What Is Big Data? Craig C. Douglas University of Wyoming
|
|
|
- Eric Bell
- 10 years ago
- Views:
Transcription
1 What Is Big Data? Craig C. Douglas University of Wyoming
2 What Is Big Data?... It Depends Unit Approximately 10 n Related to Kilobyte (KB) 1,000 bytes 3 Circa 1952 computer memory 32 KB Apollo 11 computer memory (1969) Megabyte (MB) 1,000 KB 6 Circa 1976 supercomputer memory Gigabyte (GB) 1,000 MB typical 16 GB memory smck Terabyte (TB) 1,000 GB largest SSD in a laptop Petabyte (PB) 1,000 GB ,000 DVD s or the enmre digital library of all known books wriuen in all known languages Exabyte (EB) 1,000 PB EB copied to disk in 2010 (est.) ZeUabyte (ZB) 1,000 EB 21 2 ZB copied to disk in 2011 (est.) 32 GB Smart phone memory (2014) 2
3 What Is Big Data?... It Depends What if Mme counts? Given a Mme period t, How much data can be read and wriuen? This changes over Mme as technology changes. What if the quanmty of data counts? How long does it take to read and write data? This changes over Mme as technology changes. DefiniMon of Big Data is fluid, not stamc. 3
4 Some Sources of Big Data InteracMons with dynamic databases Internet data City or regional transportamon flow control Environment and disaster management Oil/gas fields or pipelines, seismic imaging Credit cards and online businesses Government or industry regulamon/stamsmcs Dynamic data- driven apps 4
5 Why is Big Data a Hot Topic? Open posimons in data analymcs by 2020 (USA) up to 200,000 open posimons might only be 140,000 open posimons Bureau of Labor StaMsMcs projects that 70% of all newly created jobs across all STEM fields during 2010 s, across engineering, the physical sciences, the life sciences, and the social sciences, will be in computer science 5
6 Unprecedented OpportuniMes Significant contribumons to the development of these transformamve technologies have been made from diverse fields including: mathemamcs, natural sciences engineering social sciences arts and entertainment industries business world 6
7 Unprecedented OpportuniMes Algorithm and sofware development belong to computer science over the past 50 years: Computer science researchers have designed and implemented the algorithms and data structures, languages, models, tools, and abstracmons that have enabled these transformamonal technology developments 7
8 Quick summary SimulaMon oriented computamonal science is transformamonal science, but is only a niche in the grand scheme of things. Big data compumng capabilimes must be broadly available in any insmtumon that strives to compete in the coming decade. If not, an insmtumon will simply cease to be compemmve, similar to not joining the ARPAnet or CSnet in the 1970 s and 1980 s. 8
9 Some InteresMng Problems An Open Source, secure Hadoop replacement suitable for hospitals and medical records. Must be HPPA compliant. Must scale well for very large databases. Must have individual access capabilimes. Must not have complexity O(disk access) on a DFS. Should use OpenMP and MPI. Should use cache aware hashing methods. Will be useful well beyond medical records. 9
10 Some InteresMng Problems Dynamic Data- Driven ApplicaMon Systems and Big Data A natural fit and there is no agreed upon sofware for DDDAS or DDDAS- BD or DBDDAS. DDDAS has been applied to many, many fields. DDDAS researchers agree something should be produced: not considered an applicamon and too applied to be considered networking research. Need to find a niche or a program officer in a funding agency willing to think outside of the box. Many Big Data issues long common to DDDAS. 10
11 Some InteresMng Problems Sensors and telemetry SensorML was supposed to provide a standard way of describing sensor data and be able to get the data and deliver it to applicamons. It went commercial ($$$...$$$) afer the original PI remred. A true Open Source, interna5onally recognized standard would benefit one area of Big Data and DDDAS. 11
12 Some InteresMng Problems Reservoirs (oil, gas, water) Dynamic reservoir meshing VerMcal wells with micro sensors provide updates to fracked reservoirs. Speed up the meshing to including in a reservoir simulator Mme (e.g., go from a year to a day). Dynamically improve predicmons. Corporate oil/gas fields or pipelines (even small ones) produce excessive amounts of data Open Source data mining tools for specific problem 12
13 Some InteresMng Problems Audio and photographic data mining World s largest databases based on VoIP and phone monitoring by many governments (e.g., P.R. China, France, Germany, Kingdom of Saudi Arabia, United Kingdom, USA, ). Keeps disk drive makers in business and lowers hard disk prices very significantly. Another problem: Find all file duplicates in a file system efficiently. Similar to sentence problem earlier. Has commercial (e.g., Bing, satellite transmission) and research ramificamons that are not nefarious. 13
An Open Dynamic Big Data Driven Applica3on System Toolkit
An Open Dynamic Big Data Driven Applica3on System Toolkit Craig C. Douglas University of Wyoming and KAUST This research is supported in part by the Na3onal Science Founda3on and King Abdullah University
Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics
Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,
Introduction to the Mathematics of Big Data. Philippe B. Laval
Introduction to the Mathematics of Big Data Philippe B. Laval Fall 2015 Introduction In recent years, Big Data has become more than just a buzz word. Every major field of science, engineering, business,
Definition of Computers. INTRODUCTION to COMPUTERS. Historical Development ENIAC
Definition of Computers INTRODUCTION to COMPUTERS Bülent Ecevit University Department of Environmental Engineering A general-purpose machine that processes data according to a set of instructions that
Doing Multidisciplinary Research in Data Science
Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University [email protected] http://ce.qu.edu.az/~aadamov 16 May
Computer Logic (2.2.3)
Computer Logic (2.2.3) Distinction between analogue and discrete processes and quantities. Conversion of analogue quantities to digital form. Using sampling techniques, use of 2-state electronic devices
Mass Storage Structure
Mass Storage Structure 12 CHAPTER Practice Exercises 12.1 The accelerating seek described in Exercise 12.3 is typical of hard-disk drives. By contrast, floppy disks (and many hard disks manufactured before
Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016
Big Data! Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data: Data Analytical Tools for Decision Support 2 Outline Introduce
Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis. Opportunities in Power Distribution Network System Optimization (from EDA Perspective)
Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis Opportunities in Power Distribution Network System Optimization (from EDA Perspective) Outline! SmartGrid: What it is! Power Distribution Network
BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013
BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013 PERSONAL BACKGROUND Founder of the first specialist Service Management & Helpdesk System provider in Europe Past President of AFSMI
SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS
Sean Lee Solution Architect, SDI, IBM Systems SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS Agenda Converging Technology Forces New Generation Applications Data Management Challenges
Introduction to Predictive Analytics. Dr. Ronen Meiri [email protected]
Introduction to Predictive Analytics Dr. Ronen Meiri Outline From big data to predictive analytics Predictive Analytics vs. BI Intelligent platforms What can we do with it. The modeling process. Example
Algorithms and Methods for Distributed Storage Networks 7 File Systems Christian Schindelhauer
Algorithms and Methods for Distributed Storage Networks 7 File Systems Institut für Informatik Wintersemester 2007/08 Literature Storage Virtualization, Technologies for Simplifying Data Storage and Management,
lesson 1 An Overview of the Computer System
essential concepts lesson 1 An Overview of the Computer System This lesson includes the following sections: The Computer System Defined Hardware: The Nuts and Bolts of the Machine Software: Bringing the
CSCA0102 IT & Business Applications. Foundation in Business Information Technology School of Engineering & Computing Sciences FTMS College Global
CSCA0102 IT & Business Applications Foundation in Business Information Technology School of Engineering & Computing Sciences FTMS College Global Chapter 2 Data Storage Concepts System Unit The system unit
Digital Earth: Big Data, Heritage and Social Science
Digital Earth: Big Data, Heritage and Social Science The impact on geographic information and GIS Geographic Information Systems Analysis for Decision Support Impact of Big Data Digital Earth Citizen Engagement
Age of Big data. Presented by: Mohammad Iqbal BCM -2014
Age of Presented by: Mohammad Iqbal BCM -2014 Agenda Big? Big evolution from Big? Name Symbol Value Kilobyte KB 10^3 BIG DATA Megabyte MB 10^6 Gigabyte GB 10^9 Terabyte TB 10^12 Petabyte PB 10^15 So large
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP
The Big Deal about Big Data Mike Skinner, CPA CISA CITP HORNE LLP Mike Skinner, CPA CISA CITP Senior Manager, IT Assurance & Risk Services HORNE LLP Focus areas: IT security & risk assessment IT governance,
Data Centric Systems (DCS)
Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems
# Not a part of 1Z0-061 or 1Z0-144 Certification test, but very important technology in BIG DATA Analysis
Section 9 : Case Study # Objectives of this Session The Motivation For Hadoop What problems exist with traditional large-scale computing systems What requirements an alternative approach should have How
Bursting to a Hybrid Cloud for Services OFC 2015
Bursting to a Hybrid Cloud for Services OFC 2015 Big Data applications Big Compute in the cloud Why burst to the cloud? Opportunities 2 Big Data Apps Need Big Compute Life Sciences Bioinformatics Next
Applications for Business Intelligence, Predictive Analytics and Big Data
Finance, Management, & Operations Applications for Business Intelligence, Predictive Analytics and Big Data Patrick Bogan, Chief Information Officer, Fuzion Analytics Kyle Korzenowski, Chief Information
Cloud Computing Where ISR Data Will Go for Exploitation
Cloud Computing Where ISR Data Will Go for Exploitation 22 September 2009 Albert Reuther, Jeremy Kepner, Peter Michaleas, William Smith This work is sponsored by the Department of the Air Force under Air
Pervasive Location Analytics and A Billion Dollar Opportunity. Jitender Aswani, Portfolio Strategist, Business Analytics, SAP
[ Pervasive Location Analytics and A Billion Dollar Opportunity Jitender Aswani, Portfolio Strategist, Business Analytics, SAP Agenda 1. Big Data & Pervasive Nature of Location-Based Solutions 2. Location
So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI
So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI Disclosures James E. Tcheng, MD, FACC, FSCAI Affiliations / Financial Relationships / Other RWI ACC Chair, Informatics and Health IT Task Force
How To Use Big Data In Healthcare
Big data challenges hll and opportunities in healthcare: h application to detecting faint signals Dr. Greg Slabaugh City University London School of Informatics Data, data, and more data According to IBM,
Discovering Computers 2008. Chapter 7 Storage
Discovering Computers 2008 Chapter 7 Storage Chapter 7 Objectives Differentiate between storage devices and storage media Describe the characteristics of magnetic disks Describe the characteristics of
BIG DATA CHALLENGES AND PERSPECTIVES
BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,
www.pwc.com Game On: How Information is Changing the Rules of Insurance
www.pwc.com Game On: How Information is Changing the Rules of Insurance Game On: How Information is Changing the Rules of Insurance The ability to extract meaningful insights from information assets is
Architectures for massive data management
Architectures for massive data management Apache Kafka, Samza, Storm Albert Bifet [email protected] October 20, 2015 Stream Engine Motivation Digital Universe EMC Digital Universe with
Technology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Tenth Edition. Copyright 2014 Pearson Education, Inc. Publishing as Prentice Hall
Technology in Action Alan Evans Kendall Martin Mary Anne Poatsy Tenth Edition Copyright 2014 Pearson Education, Inc. Publishing as Prentice Hall Technology in Action Chapter 2 Looking at Computers Understanding
Reduction of Data at Namenode in HDFS using harballing Technique
Reduction of Data at Namenode in HDFS using harballing Technique Vaibhav Gopal Korat, Kumar Swamy Pamu [email protected] [email protected] Abstract HDFS stands for the Hadoop Distributed File System.
Big Data: Public Sector Opportunities, Challenges, and Implications
Big Data: Public Sector Opportunities, Challenges, and Implications Timothy M. Persons, Ph.D. Chief Scientist U.S. Government Accountability Office [email protected] / www.gao.gov / @GAOChfScientist Presentation
Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud
Laurence Liew General Manager, APAC Economics Is Driving Big Data Analytics to the Cloud Big Data 101 The Analytics Stack Economics of Big Data Convergence of the 3 forces Big Data Analytics in the Cloud
Data Centric Computing Revisited
Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data
Data-Flow Awareness in Parallel Data Processing
Data-Flow Awareness in Parallel Data Processing D. Bednárek, J. Dokulil *, J. Yaghob, F. Zavoral Charles University Prague, Czech Republic * University of Vienna, Austria 6 th International Symposium on
Discovering Computers 2011. Living in a Digital World
Discovering Computers 2011 Living in a Digital World Objectives Overview Differentiate among various styles of system units on desktop computers, notebook computers, and mobile devices Identify chips,
Real Time Big Data Processing
Real Time Big Data Processing Cloud Expo 2014 Ian Meyers Amazon Web Services Global Infrastructure Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure
Cloud storage Megas, Gigas and Teras
Cloud storage Megas, Gigas and Teras I think that I need cloud storage. I have photos, videos, music and documents on my computer that I can only retrieve from my computer. If my computer got struck by
Big Data Analytics. Genoveva Vargas-Solar http://www.vargas-solar.com/big-data-analytics French Council of Scientific Research, LIG & LAFMIA Labs
1 Big Data Analytics Genoveva Vargas-Solar http://www.vargas-solar.com/big-data-analytics French Council of Scientific Research, LIG & LAFMIA Labs Montevideo, 22 nd November 4 th December, 2015 INFORMATIQUE
Database Fundamentals
Database Fundamentals Computer Science 105 Boston University David G. Sullivan, Ph.D. Bit = 0 or 1 Measuring Data: Bits and Bytes One byte is 8 bits. example: 01101100 Other common units: name approximate
Using Ultra-Large Data Sets in Healthcare New Questions-New Answers
Using Ultra-Large Data Sets in Healthcare New Questions-New Answers David Hartzband, D.Sc.. Director, Technology Research, RCHN Community Health Foundation & Lecturer, Engineering Systems Division Massachusetts
BIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
Big Data System and Architecture
CHANGE, a 2012 DAC workshop 2nd International Workshop on Computing in Heterogeneous, Autonomous 'N' Goal-oriented Environments Moscone Center, San Francisco, California, June 3, 2012 Big Data System and
How To Store Data On A Computer (For A Computer)
TH3. Data storage http://www.bbc.co.uk/schools/gcsebitesize/ict/ A computer uses two types of storage. A main store consisting of ROM and RAM, and backing stores which can be internal, eg hard disk, or
Determining Your Computer Resources
Determining Your Computer Resources There are a number of computer components that must meet certain requirements in order for your computer to perform effectively. This document explains how to check
Chapter 4 System Unit Components. Discovering Computers 2012. Your Interactive Guide to the Digital World
Chapter 4 System Unit Components Discovering Computers 2012 Your Interactive Guide to the Digital World Objectives Overview Differentiate among various styles of system units on desktop computers, notebook
CHAPTER 2: HARDWARE BASICS: INSIDE THE BOX
CHAPTER 2: HARDWARE BASICS: INSIDE THE BOX Multiple Choice: 1. Processing information involves: A. accepting information from the outside world. B. communication with another computer. C. performing arithmetic
Big Data and Big Data Modeling
Big Data and Big Data Modeling The Age of Disruption Robin Bloor The Bloor Group March 19, 2015 TP02 Presenter Bio Robin Bloor, Ph.D. Robin Bloor is Chief Analyst at The Bloor Group. He has been an industry
Data De-duplication Methodologies: Comparing ExaGrid s Byte-level Data De-duplication To Block Level Data De-duplication
Data De-duplication Methodologies: Comparing ExaGrid s Byte-level Data De-duplication To Block Level Data De-duplication Table of Contents Introduction... 3 Shortest Possible Backup Window... 3 Instant
Analytical Tools: What Auditors Need to Know About Big Data
Analytical Tools: What Auditors Need to Know About Big Data Timothy M. Persons, Ph.D. Chief Scientist U.S. Government Accountability Office [email protected] / www.gao.gov / @GAOChfScientist Presentation
Data Mining and Machine Learning in Bioinformatics
Data Mining and Machine Learning in Bioinformatics PRINCIPAL METHODS AND SUCCESSFUL APPLICATIONS Ruben Armañanzas http://mason.gmu.edu/~rarmanan Adapted from Iñaki Inza slides http://www.sc.ehu.es/isg
Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect
on AWS Services Overview Bernie Nallamotu Principle Solutions Architect \ So what is it? When your data sets become so large that you have to start innovating around how to collect, store, organize, analyze
High Frequency Trading and NoSQL. Peter Lawrey CEO, Principal Consultant Higher Frequency Trading
High Frequency Trading and NoSQL Peter Lawrey CEO, Principal Consultant Higher Frequency Trading Agenda Who are we? Brief introduction to OpenHFT. What does a typical trading system look like What requirements
DATA ANALYTICS: GO BIG OR GO HOME
SPECIAL ADVERTISING SECTION businessweek.com/adsections DATA ANALYTICS: GO BIG OR GO HOME S1 THE ERA OF BIG DATA a time when petabytes of information on consumer behavior and countless other topics fly
CS246: Mining Massive Datasets Jure Leskovec, Stanford University. http://cs246.stanford.edu
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu 2 CPU Memory Machine Learning, Statistics Classical Data Mining Disk 3 20+ billion web pages x 20KB = 400+ TB
The HP IT Transformation Story
The HP IT Transformation Story Continued consolidation and infrastructure transformation impacts to the physical data center Dave Rotheroe, October, 2015 Why do data centers exist? Business Problem Application
Grab some coffee and enjoy the pre-show banter before the top of the hour!
Grab some coffee and enjoy the pre-show banter before the top of the hour! Think Big: How to Design a Big Data Information Architecture Exploratory Webcast January 22, 2014 Guests Robin Bloor Chief Analyst,
Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University
Mining Big Data Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University Website: http://www.cse.msu.edu/~ptan Google Trends Big Data Smart Cities Big Data and
UBI data management Granularity and related considerations
UBI data management Granularity and related considerations March 11, 2015 Casualty Actuarial Society RPM Seminar, Dallas USA By : Germain Denoncourt, FCIA, FCAS 1 Agenda Introduction Data quantity data
Crossing the Performance Chasm with OpenPOWER
Crossing the Performance Chasm with OpenPOWER Dr. Srini Chari Cabot Partners/IBM [email protected] #OpenPOWERSummit Join the conversation at #OpenPOWERSummit 1 Disclosure Copyright 215. Cabot Partners
22S:295 Seminar in Applied Statistics High Performance Computing in Statistics
22S:295 Seminar in Applied Statistics High Performance Computing in Statistics Luke Tierney Department of Statistics & Actuarial Science University of Iowa August 30, 2007 Luke Tierney (U. of Iowa) HPC
Chapter 6. Inside the System Unit. What You Will Learn... Computers Are Your Future. What You Will Learn... Describing Hardware Performance
What You Will Learn... Computers Are Your Future Chapter 6 Understand how computers represent data Understand the measurements used to describe data transfer rates and data storage capacity List the components
Introduction to Computer & Information Systems
Introduction to Computer & Information Systems Binnur Kurt [email protected] Istanbul Technical University Computer Engineering Department Copyleft 2005 1 Version 0.1 About the Lecturer BSc İTÜ, Computer
Secondary Storage. Any modern computer system will incorporate (at least) two levels of storage: magnetic disk/optical devices/tape systems
1 Any modern computer system will incorporate (at least) two levels of storage: primary storage: typical capacity cost per MB $3. typical access time burst transfer rate?? secondary storage: typical capacity
The Genealogy Cloud: Which Online Storage Program is Right For You Page 1 2012, copyright High-Definition Genealogy. All rights reserved.
The Genealogy Cloud: Which Online Storage Program is Right For You Thomas MacEntee, of High-Definition Genealogy http://hidefgen.com [email protected] Clouds in Genealogy? What is the Genealogy Cloud?
and processes between Magistrates Courts and Corrections Victoria. The project consisted of two stages.
Magistrates Court of Victoria Video Conference Project (VCP) Project Guide GENERAL PROJECT INFORMATION 1. The project is the installation and trial of modern internet based video conference technology
Logical Operations. Control Unit. Contents. Arithmetic Operations. Objectives. The Central Processing Unit: Arithmetic / Logic Unit.
Objectives The Central Processing Unit: What Goes on Inside the Computer Chapter 4 Identify the components of the central processing unit and how they work together and interact with memory Describe how
Taming Big Data Storage with Crossroads Systems StrongBox
BRAD JOHNS CONSULTING L.L.C Taming Big Data Storage with Crossroads Systems StrongBox Sponsored by Crossroads Systems 2013 Brad Johns Consulting L.L.C Table of Contents Taming Big Data Storage with Crossroads
416 Agriculture Hall Michigan State University 517-355-3776 http://support.anr.msu.edu [email protected]
416 Agriculture Hall Michigan State University 517-355-3776 http://support.anr.msu.edu [email protected] Title: ANR TS How To Efficiently Remove Items In Outlook To Free Up Space Document No. - 162 Revision
A Survey on Big Data Concepts and Tools
A Survey on Big Data Concepts and Tools D. Rajasekar 1, C. Dhanamani 2, S. K. Sandhya 3 1,3 PG Scholar, 2 Assistant Professor, Department of Computer Science and Engineering, Sri Krishna College of Engineering
Copyright (c) 2012, Meta Business Systems. Mario Bojilov Meta Business Systems 20 February 2013
Mario Bojilov Meta Business Systems 20 February 2013 What is Big Data Volume 90% of data in the world was created in the last 2 years What is Big Data Volume 90% of data in the world was created in the
File System Management
Lecture 7: Storage Management File System Management Contents Non volatile memory Tape, HDD, SSD Files & File System Interface Directories & their Organization File System Implementation Disk Space Allocation
Hardware Configuration Guide
Hardware Configuration Guide Contents Contents... 1 Annotation... 1 Factors to consider... 2 Machine Count... 2 Data Size... 2 Data Size Total... 2 Daily Backup Data Size... 2 Unique Data Percentage...
Acronis Backup Deduplication. Technical Whitepaper
Acronis Backup Deduplication Technical Whitepaper Table of Contents Table of Contents Table of Contents... 1 Introduction... 3 Storage Challenges... 4 How Deduplication Helps... 5 How It Works... 6 Deduplication
HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW
HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW 757 Maleta Lane, Suite 201 Castle Rock, CO 80108 Brett Weninger, Managing Director [email protected] Dave Smelker, Managing Principal [email protected]
