What Is Big Data? Craig C. Douglas University of Wyoming
|
|
- Eric Bell
- 8 years ago
- Views:
Transcription
1 What Is Big Data? Craig C. Douglas University of Wyoming
2 What Is Big Data?... It Depends Unit Approximately 10 n Related to Kilobyte (KB) 1,000 bytes 3 Circa 1952 computer memory 32 KB Apollo 11 computer memory (1969) Megabyte (MB) 1,000 KB 6 Circa 1976 supercomputer memory Gigabyte (GB) 1,000 MB typical 16 GB memory smck Terabyte (TB) 1,000 GB largest SSD in a laptop Petabyte (PB) 1,000 GB ,000 DVD s or the enmre digital library of all known books wriuen in all known languages Exabyte (EB) 1,000 PB EB copied to disk in 2010 (est.) ZeUabyte (ZB) 1,000 EB 21 2 ZB copied to disk in 2011 (est.) 32 GB Smart phone memory (2014) 2
3 What Is Big Data?... It Depends What if Mme counts? Given a Mme period t, How much data can be read and wriuen? This changes over Mme as technology changes. What if the quanmty of data counts? How long does it take to read and write data? This changes over Mme as technology changes. DefiniMon of Big Data is fluid, not stamc. 3
4 Some Sources of Big Data InteracMons with dynamic databases Internet data City or regional transportamon flow control Environment and disaster management Oil/gas fields or pipelines, seismic imaging Credit cards and online businesses Government or industry regulamon/stamsmcs Dynamic data- driven apps 4
5 Why is Big Data a Hot Topic? Open posimons in data analymcs by 2020 (USA) up to 200,000 open posimons might only be 140,000 open posimons Bureau of Labor StaMsMcs projects that 70% of all newly created jobs across all STEM fields during 2010 s, across engineering, the physical sciences, the life sciences, and the social sciences, will be in computer science 5
6 Unprecedented OpportuniMes Significant contribumons to the development of these transformamve technologies have been made from diverse fields including: mathemamcs, natural sciences engineering social sciences arts and entertainment industries business world 6
7 Unprecedented OpportuniMes Algorithm and sofware development belong to computer science over the past 50 years: Computer science researchers have designed and implemented the algorithms and data structures, languages, models, tools, and abstracmons that have enabled these transformamonal technology developments 7
8 Quick summary SimulaMon oriented computamonal science is transformamonal science, but is only a niche in the grand scheme of things. Big data compumng capabilimes must be broadly available in any insmtumon that strives to compete in the coming decade. If not, an insmtumon will simply cease to be compemmve, similar to not joining the ARPAnet or CSnet in the 1970 s and 1980 s. 8
9 Some InteresMng Problems An Open Source, secure Hadoop replacement suitable for hospitals and medical records. Must be HPPA compliant. Must scale well for very large databases. Must have individual access capabilimes. Must not have complexity O(disk access) on a DFS. Should use OpenMP and MPI. Should use cache aware hashing methods. Will be useful well beyond medical records. 9
10 Some InteresMng Problems Dynamic Data- Driven ApplicaMon Systems and Big Data A natural fit and there is no agreed upon sofware for DDDAS or DDDAS- BD or DBDDAS. DDDAS has been applied to many, many fields. DDDAS researchers agree something should be produced: not considered an applicamon and too applied to be considered networking research. Need to find a niche or a program officer in a funding agency willing to think outside of the box. Many Big Data issues long common to DDDAS. 10
11 Some InteresMng Problems Sensors and telemetry SensorML was supposed to provide a standard way of describing sensor data and be able to get the data and deliver it to applicamons. It went commercial ($$$...$$$) afer the original PI remred. A true Open Source, interna5onally recognized standard would benefit one area of Big Data and DDDAS. 11
12 Some InteresMng Problems Reservoirs (oil, gas, water) Dynamic reservoir meshing VerMcal wells with micro sensors provide updates to fracked reservoirs. Speed up the meshing to including in a reservoir simulator Mme (e.g., go from a year to a day). Dynamically improve predicmons. Corporate oil/gas fields or pipelines (even small ones) produce excessive amounts of data Open Source data mining tools for specific problem 12
13 Some InteresMng Problems Audio and photographic data mining World s largest databases based on VoIP and phone monitoring by many governments (e.g., P.R. China, France, Germany, Kingdom of Saudi Arabia, United Kingdom, USA, ). Keeps disk drive makers in business and lowers hard disk prices very significantly. Another problem: Find all file duplicates in a file system efficiently. Similar to sentence problem earlier. Has commercial (e.g., Bing, satellite transmission) and research ramificamons that are not nefarious. 13
An Open Dynamic Big Data Driven Applica3on System Toolkit
An Open Dynamic Big Data Driven Applica3on System Toolkit Craig C. Douglas University of Wyoming and KAUST This research is supported in part by the Na3onal Science Founda3on and King Abdullah University
More informationAn Open Framework for Dynamic Big-Data-Driven Application Systems (DBDDAS) Development
Procedia Computer Science Volume 29, 2014, Pages 1246 1255 ICCS 2014. 14th International Conference on Computational Science An Open Framework for Dynamic Big-Data-Driven Application Systems (DBDDAS) Development
More informationSurfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics
Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,
More informationIntroduction to the Mathematics of Big Data. Philippe B. Laval
Introduction to the Mathematics of Big Data Philippe B. Laval Fall 2015 Introduction In recent years, Big Data has become more than just a buzz word. Every major field of science, engineering, business,
More informationBIG DATA & SOCIAL INNOVATION KENNETH THOMAS, CLIENT MANAGER
BIG DATA & SOCIAL INNOVATION KENNETH THOMAS, CLIENT MANAGER 1 MAKING THE RIGHT DECISSION AT THE RIGHT PLACE AT THE RIGHT TIME 2 THE DATA MULTIPLIER EFFECT AT WORK BUSINESS DRIVEN HUMAN DRIVEN MACHINE DRIVEN
More informationDefinition of Computers. INTRODUCTION to COMPUTERS. Historical Development ENIAC
Definition of Computers INTRODUCTION to COMPUTERS Bülent Ecevit University Department of Environmental Engineering A general-purpose machine that processes data according to a set of instructions that
More informationDoing Multidisciplinary Research in Data Science
Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University aadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 16 May
More informationComputer Logic (2.2.3)
Computer Logic (2.2.3) Distinction between analogue and discrete processes and quantities. Conversion of analogue quantities to digital form. Using sampling techniques, use of 2-state electronic devices
More informationCongrats to Game Winners. How can computation use data to solve problems? What topics have we covered in CS 202? Part 1: Completed!
CS 202: Introduction to Computation " UNIVERSITY of WISCONSIN-MADISON Computer Sciences Department Professor Andrea Arpaci-Dusseau How can computation use data to solve problems? Congrats to Game Winners
More informationMass Storage Structure
Mass Storage Structure 12 CHAPTER Practice Exercises 12.1 The accelerating seek described in Exercise 12.3 is typical of hard-disk drives. By contrast, floppy disks (and many hard disks manufactured before
More informationMajed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016
Big Data! Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data: Data Analytical Tools for Decision Support 2 Outline Introduce
More informationGi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis. Opportunities in Power Distribution Network System Optimization (from EDA Perspective)
Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis Opportunities in Power Distribution Network System Optimization (from EDA Perspective) Outline! SmartGrid: What it is! Power Distribution Network
More informationBIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013
BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013 PERSONAL BACKGROUND Founder of the first specialist Service Management & Helpdesk System provider in Europe Past President of AFSMI
More informationSCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS
Sean Lee Solution Architect, SDI, IBM Systems SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS Agenda Converging Technology Forces New Generation Applications Data Management Challenges
More informationIntroduction to Predictive Analytics. Dr. Ronen Meiri ronen@dmway.com
Introduction to Predictive Analytics Dr. Ronen Meiri Outline From big data to predictive analytics Predictive Analytics vs. BI Intelligent platforms What can we do with it. The modeling process. Example
More informationAlgorithms and Methods for Distributed Storage Networks 7 File Systems Christian Schindelhauer
Algorithms and Methods for Distributed Storage Networks 7 File Systems Institut für Informatik Wintersemester 2007/08 Literature Storage Virtualization, Technologies for Simplifying Data Storage and Management,
More informationlesson 1 An Overview of the Computer System
essential concepts lesson 1 An Overview of the Computer System This lesson includes the following sections: The Computer System Defined Hardware: The Nuts and Bolts of the Machine Software: Bringing the
More informationCSCA0102 IT & Business Applications. Foundation in Business Information Technology School of Engineering & Computing Sciences FTMS College Global
CSCA0102 IT & Business Applications Foundation in Business Information Technology School of Engineering & Computing Sciences FTMS College Global Chapter 2 Data Storage Concepts System Unit The system unit
More informationDigital Earth: Big Data, Heritage and Social Science
Digital Earth: Big Data, Heritage and Social Science The impact on geographic information and GIS Geographic Information Systems Analysis for Decision Support Impact of Big Data Digital Earth Citizen Engagement
More informationBackground Information Data Uses Strategies and Plans Summary Open Discussion/Questions. Art Cadorine, ISO Pete Marotta, ISO Tracy Spadola, Teradata
1 Background Information Data Uses Strategies and Plans Summary Open Discussion/Questions 2 Art Cadorine, ISO Pete Marotta, ISO Tracy Spadola, Teradata 3 4 1343: first formal policy written in Italy 1494:
More informationLARGE, DISTRIBUTED COMPUTING INFRASTRUCTURES OPPORTUNITIES & CHALLENGES. Dominique A. Heger Ph.D. DHTechnologies, Data Nubes Austin, TX, USA
LARGE, DISTRIBUTED COMPUTING INFRASTRUCTURES OPPORTUNITIES & CHALLENGES Dominique A. Heger Ph.D. DHTechnologies, Data Nubes Austin, TX, USA Performance & Capacity Studies Availability & Reliability Studies
More informationAge of Big data. Presented by: Mohammad Iqbal BCM -2014
Age of Presented by: Mohammad Iqbal BCM -2014 Agenda Big? Big evolution from Big? Name Symbol Value Kilobyte KB 10^3 BIG DATA Megabyte MB 10^6 Gigabyte GB 10^9 Terabyte TB 10^12 Petabyte PB 10^15 So large
More informationArchitecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
More informationThe Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP
The Big Deal about Big Data Mike Skinner, CPA CISA CITP HORNE LLP Mike Skinner, CPA CISA CITP Senior Manager, IT Assurance & Risk Services HORNE LLP Focus areas: IT security & risk assessment IT governance,
More informationData Centric Systems (DCS)
Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems
More informationDISCOVERING ediscovery
DISCOVERING ediscovery Purpose This paper is the first in a series that are designed to educate organisations and increase awareness in the area of ediscovery technology. What is ediscovery? Electronic
More information# Not a part of 1Z0-061 or 1Z0-144 Certification test, but very important technology in BIG DATA Analysis
Section 9 : Case Study # Objectives of this Session The Motivation For Hadoop What problems exist with traditional large-scale computing systems What requirements an alternative approach should have How
More informationBursting to a Hybrid Cloud for Services OFC 2015
Bursting to a Hybrid Cloud for Services OFC 2015 Big Data applications Big Compute in the cloud Why burst to the cloud? Opportunities 2 Big Data Apps Need Big Compute Life Sciences Bioinformatics Next
More informationData Management Nuts and Bolts. Don Johnson Scientific Computing and Visualization
Data Management Nuts and Bolts Don Johnson Scientific Computing and Visualization Overview Data Management Storing data Sharing data Moving data Tracking data (Client responsibility) Where can you obtain
More informationApplications for Business Intelligence, Predictive Analytics and Big Data
Finance, Management, & Operations Applications for Business Intelligence, Predictive Analytics and Big Data Patrick Bogan, Chief Information Officer, Fuzion Analytics Kyle Korzenowski, Chief Information
More informationCloud Computing Where ISR Data Will Go for Exploitation
Cloud Computing Where ISR Data Will Go for Exploitation 22 September 2009 Albert Reuther, Jeremy Kepner, Peter Michaleas, William Smith This work is sponsored by the Department of the Air Force under Air
More informationPervasive Location Analytics and A Billion Dollar Opportunity. Jitender Aswani, Portfolio Strategist, Business Analytics, SAP
[ Pervasive Location Analytics and A Billion Dollar Opportunity Jitender Aswani, Portfolio Strategist, Business Analytics, SAP Agenda 1. Big Data & Pervasive Nature of Location-Based Solutions 2. Location
More informationSo Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI
So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI Disclosures James E. Tcheng, MD, FACC, FSCAI Affiliations / Financial Relationships / Other RWI ACC Chair, Informatics and Health IT Task Force
More informationHow To Use Big Data In Healthcare
Big data challenges hll and opportunities in healthcare: h application to detecting faint signals Dr. Greg Slabaugh City University London School of Informatics Data, data, and more data According to IBM,
More informationDiscovering Computers 2008. Chapter 7 Storage
Discovering Computers 2008 Chapter 7 Storage Chapter 7 Objectives Differentiate between storage devices and storage media Describe the characteristics of magnetic disks Describe the characteristics of
More informationHealth Data Analytics and Decision Support Prof.Dr. Bart De Moor
Bart.DeMoor@iminds.be iminds Department MEDICAL IT Health Data Analytics and Decision Support Prof.Dr. Bart De Moor Department Med IT Trends P3 x P4 medicine Decision support cases MEDICAL IT DEPARTMENT
More informationBIG DATA CHALLENGES AND PERSPECTIVES
BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,
More informationVCHRISTIANA CARE HEALTH SYSTEM VALUE INSTITUTE
Working with VCHRISTIANA CARE HEALTH SYSTEM VALUE INSTITUTE Edward Ewen, MD 11/6/2015 Overview What is BIG DATA? Why is it important to us? How is it used? What are the challenges? What are some examples
More informationwww.pwc.com Game On: How Information is Changing the Rules of Insurance
www.pwc.com Game On: How Information is Changing the Rules of Insurance Game On: How Information is Changing the Rules of Insurance The ability to extract meaningful insights from information assets is
More informationCloud beyond the obvious, an approach for innovation
Cloud beyond the obvious, an approach for innovation Christian Verstraete Chief Technologist Cloud Strategy Our World is Changing Living in the age of tectonic shifts, and welcome to the new style of IT
More informationArchitectures for massive data management
Architectures for massive data management Apache Kafka, Samza, Storm Albert Bifet albert.bifet@telecom-paristech.fr October 20, 2015 Stream Engine Motivation Digital Universe EMC Digital Universe with
More informationTechnology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Tenth Edition. Copyright 2014 Pearson Education, Inc. Publishing as Prentice Hall
Technology in Action Alan Evans Kendall Martin Mary Anne Poatsy Tenth Edition Copyright 2014 Pearson Education, Inc. Publishing as Prentice Hall Technology in Action Chapter 2 Looking at Computers Understanding
More informationReduction of Data at Namenode in HDFS using harballing Technique
Reduction of Data at Namenode in HDFS using harballing Technique Vaibhav Gopal Korat, Kumar Swamy Pamu vgkorat@gmail.com swamy.uncis@gmail.com Abstract HDFS stands for the Hadoop Distributed File System.
More informationBig Data: Public Sector Opportunities, Challenges, and Implications
Big Data: Public Sector Opportunities, Challenges, and Implications Timothy M. Persons, Ph.D. Chief Scientist U.S. Government Accountability Office personst@gao.gov / www.gao.gov / @GAOChfScientist Presentation
More informationLaurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud
Laurence Liew General Manager, APAC Economics Is Driving Big Data Analytics to the Cloud Big Data 101 The Analytics Stack Economics of Big Data Convergence of the 3 forces Big Data Analytics in the Cloud
More informationData Centric Computing Revisited
Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data
More informationData-Flow Awareness in Parallel Data Processing
Data-Flow Awareness in Parallel Data Processing D. Bednárek, J. Dokulil *, J. Yaghob, F. Zavoral Charles University Prague, Czech Republic * University of Vienna, Austria 6 th International Symposium on
More informationDiscovering Computers 2011. Living in a Digital World
Discovering Computers 2011 Living in a Digital World Objectives Overview Differentiate among various styles of system units on desktop computers, notebook computers, and mobile devices Identify chips,
More informationDATA. SMALL WORD. COLOSSAL IMPLICATIONS. 312.756.1760 info@spr.com www.spr.com SPR Consulting
DATA. SMALL WORD. COLOSSAL IMPLICATIONS. 312.756.1760 info@spr.com www.spr.com SPR Consulting Data, not such a small word after all. Truth is, whether we call it big data or just data, the phenomenon of
More informationReal Time Big Data Processing
Real Time Big Data Processing Cloud Expo 2014 Ian Meyers Amazon Web Services Global Infrastructure Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure
More informationCloud storage Megas, Gigas and Teras
Cloud storage Megas, Gigas and Teras I think that I need cloud storage. I have photos, videos, music and documents on my computer that I can only retrieve from my computer. If my computer got struck by
More informationBig Data Analytics. Genoveva Vargas-Solar http://www.vargas-solar.com/big-data-analytics French Council of Scientific Research, LIG & LAFMIA Labs
1 Big Data Analytics Genoveva Vargas-Solar http://www.vargas-solar.com/big-data-analytics French Council of Scientific Research, LIG & LAFMIA Labs Montevideo, 22 nd November 4 th December, 2015 INFORMATIQUE
More informationDatabase Fundamentals
Database Fundamentals Computer Science 105 Boston University David G. Sullivan, Ph.D. Bit = 0 or 1 Measuring Data: Bits and Bytes One byte is 8 bits. example: 01101100 Other common units: name approximate
More informationUsing Ultra-Large Data Sets in Healthcare New Questions-New Answers
Using Ultra-Large Data Sets in Healthcare New Questions-New Answers David Hartzband, D.Sc.. Director, Technology Research, RCHN Community Health Foundation & Lecturer, Engineering Systems Division Massachusetts
More informationBIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
More informationBig Data System and Architecture
CHANGE, a 2012 DAC workshop 2nd International Workshop on Computing in Heterogeneous, Autonomous 'N' Goal-oriented Environments Moscone Center, San Francisco, California, June 3, 2012 Big Data System and
More informationHow To Store Data On A Computer (For A Computer)
TH3. Data storage http://www.bbc.co.uk/schools/gcsebitesize/ict/ A computer uses two types of storage. A main store consisting of ROM and RAM, and backing stores which can be internal, eg hard disk, or
More informationDetermining Your Computer Resources
Determining Your Computer Resources There are a number of computer components that must meet certain requirements in order for your computer to perform effectively. This document explains how to check
More informationChapter 4 System Unit Components. Discovering Computers 2012. Your Interactive Guide to the Digital World
Chapter 4 System Unit Components Discovering Computers 2012 Your Interactive Guide to the Digital World Objectives Overview Differentiate among various styles of system units on desktop computers, notebook
More informationCHAPTER 2: HARDWARE BASICS: INSIDE THE BOX
CHAPTER 2: HARDWARE BASICS: INSIDE THE BOX Multiple Choice: 1. Processing information involves: A. accepting information from the outside world. B. communication with another computer. C. performing arithmetic
More informationBig Data and Big Data Modeling
Big Data and Big Data Modeling The Age of Disruption Robin Bloor The Bloor Group March 19, 2015 TP02 Presenter Bio Robin Bloor, Ph.D. Robin Bloor is Chief Analyst at The Bloor Group. He has been an industry
More informationData De-duplication Methodologies: Comparing ExaGrid s Byte-level Data De-duplication To Block Level Data De-duplication
Data De-duplication Methodologies: Comparing ExaGrid s Byte-level Data De-duplication To Block Level Data De-duplication Table of Contents Introduction... 3 Shortest Possible Backup Window... 3 Instant
More informationAnalytical Tools: What Auditors Need to Know About Big Data
Analytical Tools: What Auditors Need to Know About Big Data Timothy M. Persons, Ph.D. Chief Scientist U.S. Government Accountability Office personst@gao.gov / www.gao.gov / @GAOChfScientist Presentation
More informationData Mining and Machine Learning in Bioinformatics
Data Mining and Machine Learning in Bioinformatics PRINCIPAL METHODS AND SUCCESSFUL APPLICATIONS Ruben Armañanzas http://mason.gmu.edu/~rarmanan Adapted from Iñaki Inza slides http://www.sc.ehu.es/isg
More informationMigrating NASA Archives to Disk: Challenges and Opportunities. NASA Langley Research Center Chris Harris June 2, 2015
Migrating NASA Archives to Disk: Challenges and Opportunities NASA Langley Research Center Chris Harris June 2, 2015 MSST 2015 Topics ASDC Who we are? What we do? Evolution of storage technologies Why
More informationExtension Technology Academy February 10 th, 2014
Extension Technology Academy February 10 th, 2014 1 Kilobyte (KB) = 1,024 Bytes 1 Megabyte (MB) = 1,024 Kilobytes 1 Gigabyte (GB) = 1,024 Megabytes 1 Terabyte (TB) = 1,024 Gigabytes 64 KB Cloud definition
More informationBig Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect
on AWS Services Overview Bernie Nallamotu Principle Solutions Architect \ So what is it? When your data sets become so large that you have to start innovating around how to collect, store, organize, analyze
More informationHigh Frequency Trading and NoSQL. Peter Lawrey CEO, Principal Consultant Higher Frequency Trading
High Frequency Trading and NoSQL Peter Lawrey CEO, Principal Consultant Higher Frequency Trading Agenda Who are we? Brief introduction to OpenHFT. What does a typical trading system look like What requirements
More informationDATA ANALYTICS: GO BIG OR GO HOME
SPECIAL ADVERTISING SECTION businessweek.com/adsections DATA ANALYTICS: GO BIG OR GO HOME S1 THE ERA OF BIG DATA a time when petabytes of information on consumer behavior and countless other topics fly
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University. http://cs246.stanford.edu
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu 2 CPU Memory Machine Learning, Statistics Classical Data Mining Disk 3 20+ billion web pages x 20KB = 400+ TB
More informationThe HP IT Transformation Story
The HP IT Transformation Story Continued consolidation and infrastructure transformation impacts to the physical data center Dave Rotheroe, October, 2015 Why do data centers exist? Business Problem Application
More informationEnvironmental Report Fiscal Year 2014
Environmental Report Fiscal Year 2014 Executive summary Global climate change caused by increasing concentrations of atmospheric carbon dioxide is one of the most significant concerns facing our world
More informationExamine Your Strategy, Weigh Your Options in Choosing Removable Storage. Dave Holmstrom Associate Director of New Products Verbatim Corporation
Examine Your Strategy, Weigh Your Options in Choosing Removable Storage Dave Holmstrom Associate Director of New Products Verbatim Corporation Evaluating Storage Solutions What type of data is being stored?
More informationGrab some coffee and enjoy the pre-show banter before the top of the hour!
Grab some coffee and enjoy the pre-show banter before the top of the hour! Think Big: How to Design a Big Data Information Architecture Exploratory Webcast January 22, 2014 Guests Robin Bloor Chief Analyst,
More informationMining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University
Mining Big Data Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University Website: http://www.cse.msu.edu/~ptan Google Trends Big Data Smart Cities Big Data and
More informationUBI data management Granularity and related considerations
UBI data management Granularity and related considerations March 11, 2015 Casualty Actuarial Society RPM Seminar, Dallas USA By : Germain Denoncourt, FCIA, FCAS 1 Agenda Introduction Data quantity data
More informationCrossing the Performance Chasm with OpenPOWER
Crossing the Performance Chasm with OpenPOWER Dr. Srini Chari Cabot Partners/IBM chari@cabotpartners.com #OpenPOWERSummit Join the conversation at #OpenPOWERSummit 1 Disclosure Copyright 215. Cabot Partners
More informationHOW TO BECOME AN ESI HERO
HOW TO BECOME AN ESI HERO taking the mystery out of ediscovery www.fxhnd.com info@fxhnd.com Electronically Stored Information Boo! But why do I have to learn about all this technology? It s how we communicate
More information22S:295 Seminar in Applied Statistics High Performance Computing in Statistics
22S:295 Seminar in Applied Statistics High Performance Computing in Statistics Luke Tierney Department of Statistics & Actuarial Science University of Iowa August 30, 2007 Luke Tierney (U. of Iowa) HPC
More informationChapter 6. Inside the System Unit. What You Will Learn... Computers Are Your Future. What You Will Learn... Describing Hardware Performance
What You Will Learn... Computers Are Your Future Chapter 6 Understand how computers represent data Understand the measurements used to describe data transfer rates and data storage capacity List the components
More informationIntroduction to Computer & Information Systems
Introduction to Computer & Information Systems Binnur Kurt kurt@ce.itu.edu.tr Istanbul Technical University Computer Engineering Department Copyleft 2005 1 Version 0.1 About the Lecturer BSc İTÜ, Computer
More informationSecondary Storage. Any modern computer system will incorporate (at least) two levels of storage: magnetic disk/optical devices/tape systems
1 Any modern computer system will incorporate (at least) two levels of storage: primary storage: typical capacity cost per MB $3. typical access time burst transfer rate?? secondary storage: typical capacity
More informationThe Genealogy Cloud: Which Online Storage Program is Right For You Page 1 2012, copyright High-Definition Genealogy. All rights reserved.
The Genealogy Cloud: Which Online Storage Program is Right For You Thomas MacEntee, of High-Definition Genealogy http://hidefgen.com hidefgen@gmail.com Clouds in Genealogy? What is the Genealogy Cloud?
More informationThe Trials and Tribulations and ultimate success of parallelisation using Hadoop within the SCAPE project
The Trials and Tribulations and ultimate success of parallelisation using Hadoop within the SCAPE project Alastair Duncan STFC Pre Coffee talk STFC July 2014 SCAPE Scalable Preservation Environments The
More informationKeynote: Studiedag Statistische Software Vlaamse Hogescholenraad Jeroen Derynck - ICT Director @ iminds
Keynote: Studiedag Statistische Software Vlaamse Hogescholenraad Jeroen Derynck - ICT Director @ iminds We see Information Technology as the driving force of innovation in many sectors of society We are
More informationand processes between Magistrates Courts and Corrections Victoria. The project consisted of two stages.
Magistrates Court of Victoria Video Conference Project (VCP) Project Guide GENERAL PROJECT INFORMATION 1. The project is the installation and trial of modern internet based video conference technology
More informationWhat s New About Big Data? Implications for the Accountability Community
What s New About Big Data? Implications for the Accountability Community Timothy M. Persons, Ph.D. Chief Scientist U.S. Government Accountability Office personst@gao.gov / www.gao.gov Presentation to the
More informationLogical Operations. Control Unit. Contents. Arithmetic Operations. Objectives. The Central Processing Unit: Arithmetic / Logic Unit.
Objectives The Central Processing Unit: What Goes on Inside the Computer Chapter 4 Identify the components of the central processing unit and how they work together and interact with memory Describe how
More informationTaming Big Data Storage with Crossroads Systems StrongBox
BRAD JOHNS CONSULTING L.L.C Taming Big Data Storage with Crossroads Systems StrongBox Sponsored by Crossroads Systems 2013 Brad Johns Consulting L.L.C Table of Contents Taming Big Data Storage with Crossroads
More information416 Agriculture Hall Michigan State University 517-355-3776 http://support.anr.msu.edu support@anr.msu.edu
416 Agriculture Hall Michigan State University 517-355-3776 http://support.anr.msu.edu support@anr.msu.edu Title: ANR TS How To Efficiently Remove Items In Outlook To Free Up Space Document No. - 162 Revision
More informationA Survey on Big Data Concepts and Tools
A Survey on Big Data Concepts and Tools D. Rajasekar 1, C. Dhanamani 2, S. K. Sandhya 3 1,3 PG Scholar, 2 Assistant Professor, Department of Computer Science and Engineering, Sri Krishna College of Engineering
More informationCopyright (c) 2012, Meta Business Systems. Mario Bojilov Meta Business Systems 20 February 2013
Mario Bojilov Meta Business Systems 20 February 2013 What is Big Data Volume 90% of data in the world was created in the last 2 years What is Big Data Volume 90% of data in the world was created in the
More informationData Storage for the Digital Content Value Chain Thomas M. Coughlin President Coughlin Associates
Storage and Entertainment Magazine, January 2003 Data Storage for the Digital Content Value Chain Thomas M. Coughlin President Coughlin Associates 1. Introduction The digitization of human content is now
More informationBig Data Just Noise or Does it Matter?
Big Data Just Noise or Does it Matter? Opportunities for Continuous Auditing Presented by: Solon Angel Product Manager Servers The CaseWare Group. Founded in 1988. An industry leader in providing technology
More informationFile System Management
Lecture 7: Storage Management File System Management Contents Non volatile memory Tape, HDD, SSD Files & File System Interface Directories & their Organization File System Implementation Disk Space Allocation
More informationDIGITAL MARKETING STRATEGIES Leveraging The Back-End Tools
DIGITAL MARKETING STRATEGIES Leveraging The Back-End Tools Professional Background RACING INDUSTRY EXPERIENCE: First Job Out of Undergrad: - Arlington Park, Assistant to the VP of Marketing - Sponsorship
More informationHardware Configuration Guide
Hardware Configuration Guide Contents Contents... 1 Annotation... 1 Factors to consider... 2 Machine Count... 2 Data Size... 2 Data Size Total... 2 Daily Backup Data Size... 2 Unique Data Percentage...
More informationAcronis Backup Deduplication. Technical Whitepaper
Acronis Backup Deduplication Technical Whitepaper Table of Contents Table of Contents Table of Contents... 1 Introduction... 3 Storage Challenges... 4 How Deduplication Helps... 5 How It Works... 6 Deduplication
More informationHADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW
HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW 757 Maleta Lane, Suite 201 Castle Rock, CO 80108 Brett Weninger, Managing Director brett.weninger@adurant.com Dave Smelker, Managing Principal dave.smelker@adurant.com
More informationCan we Analyze all the Big Data we Collect?
DBKDA/WEB Panel 2015, Rome 28.05.2015 DBKDA Panel 2015, Rome, 27.05.2015 Reutlingen University Can we Analyze all the Big Data we Collect? Moderation: Fritz Laux, Reutlingen University, Germany Panelists:
More information