Big Data and Future Networks: A Perspective from the United States

Size: px
Start display at page:

Download "Big Data and Future Networks: A Perspective from the United States"

Transcription

1 Big Data and Future Networks: A Perspective from the United States Hisashi Kobayashi ( 小 林 久 志 ) Princeton University and National Institute for Information and Communications Technology

2 Acknowledgments Prof. Tadao Saito, Toyota Info Technology Center Dr. Nozumu Nishinaga, Mr. Masahiro Kiyokawa, and Mr. Hiroaki Yano, NICT Prof. Mung Chiang, Princeton University Dr. Evangelos Eleftheriou, IBM Zurich Research Lab Mr. Kaiser Fung, Author Numbers Rule Your World Dr. Kazuo Iwano, Mitsubishi Corporation Prof. Brian L. Mark, George Mason University Prof. Dipanker Raychaudhuri, Rutgers University Prof. Phuoc Tran-Gia, University of Würzburg Prof. Howard Wactlar, CMU and NSF CISE Directorate Prof. Philip Yu, University of Illinois at Chicago 2 Big Data and Future Network Design Hisashi Kobayashi

3 Outline How Much Information? How Big is Data? 4 President Obama s Open Government Initiative 12 President Obama s Big Data Initiative 16 Big Data in Science and Technology Research 17 - NITRD Program, NSF, DARPA, DOE Big Data in Enterprises 27 Call for Data Science and Data Scientists 36 Big Data and Networks 43 References 51 3 Big Data and Future Network Design Hisashi Kobayashi

4 HOW MUCH INFORMATION? HOW BIG IS DATA? 4 Big Data and Future Network Design Hisashi Kobayashi

5 Source: The World of Data (by IBM): 5 Big Data and Future Network Design Hisashi Kobayashi

6 How Much Data was Out There? [Kobayashi et al. 2005] Online: Disk Drives File Systems 300 Petabytes Petabyte [1,000,000,000,000,000 bytes OR bytes] Exabyte [1,000,000,000,000,000,000 bytes OR bytes] Offline: Magnetic Tape CDs 8 Exabytes cf Report by a U.C. Berkeley research group. Analog Data: Paper Film Videotape 200 Exabytes Source: / 6 Big Data and Future Network Design Hisashi Kobayashi

7 Some Big Numbers 0.43 x seconds: The Age of the Universe (13.77 billion years). 5 Exabytes: All words ever spoken by human beings (in text) Roy Williams (Caltech, 1993) 21 Exabytes/month: Global Internet traffic in 2007 Padmasree Warrior (CISCO, March 2010) 160 Exabytes: Digital information created, captures, and replicated world wide in 2007 (International Data Corporation, 2007) 42 Zettabytes: All words ever spoken by human beings (if digitized in 6kHz 16 bit audio) Mark Lieberman (U. Penn, 2003) kilo 10 3 Mega 10 6 Giga 10 9 Tera Peta Exa Zetta Yotta Big Data and Future Network Design Hisashi Kobayashi

8 Source: Asigra Info Graphic: 8 Big Data and Future Network Design Hisashi Kobayashi

9 Source: - The Retailer's Guide: 9 Big Data and Future Network Design Hisashi Kobayashi

10 10 Big Data and Future Network Design Hisashi Kobayashi Source: January 2011, Davos Switzerland

11 Every day, we create 2.5 quintillion (10 18 ) bytes (i.e., 2.5 Exabytes) of data so much that 90% of the data in the world today has been created in the last two years alone. [IBM] Raw data has little value by itself. We must process data and extract information in a usable form. - Big Data tools, e.g., Apache Hadoop, MapReduce - Data Science, (data mining, machine learning) - Need for advancing statistical analysis techniques that are scalable. We then must put the information into a valuable action, e.g., Amazon.com, a better government 11 Big Data and Future Network Design Hisashi Kobayashi

12 Open Government Initiative My administration is committed to creating an unprecedented level of openness in Government. We will work together to ensure the public trust and establish a system of transparency, public participation, and collaboration. Openness will strengthen our democracy and promote efficiency and effectiveness in Government President BARACK OBAMA, 01/21/09 12 Big Data and Future Network Design Hisashi Kobayashi

13 Government should be transparent - To promote accountability and provides information to citizens Government should be participatory - Knowledge is widely dispersed in society, and public officials benefit from having access to that knowledge. Government should be collaborative - We should use innovative tools, methods and systems to cooperate with nonprofit organizations, businesses, and individuals in the private sector. 13 Big Data and Future Network Design Hisashi Kobayashi

14 Open Government Directive 1. Publish Government Information Online 2. Improve the Quality of Government Information 3. Create and Institutionalize a Culture of Open Government 4. Create an Enabling Policy Framework for Open Government -- Peter R. Orszag, Director, Office of Management and Budget, 12/8/09 oranda_2010/m10-06.pdf 14 Big Data and Future Network Design Hisashi Kobayashi

15 Big Data and Future Network Design Hisashi Kobayashi Source: Howard Wactlar, NSF CISE Directorate at NIST Big Data Meeting, June

16 President Obama s Big Data Initiative To advance state-of-the-art technologies to collect, store, preserve, manage, analyze and share Big Data. To accelerate the pace of discovery in science and engineering, strengthen the national security, and transform teaching and learning. To expand the work force needed to develop and use Big Data technologies. More than $200 millions in new commitments through six Federal departments and agencies. - Office of Science and Technology Policy (OSTP) announced on March 29, Big Data and Future Network Design Hisashi Kobayashi

17 BIG DATA IN SCIENCE AND TECHNOLOGY RESEARCH 17 Big Data and Future Network Design Hisashi Kobayashi

18 NITRD (Networking and Information Technology Research and Development) Program Provides a framework in which many Federal agencies coordinate their R&D efforts on networking and IT. Operates under the aegis of the NITRD Subcommittee of the National Science and Technology Council (NSTC) s Committee on Technology. The National Coordination Office (NCO) supports the NITRD Program by providing technical expertise, planning and coordination and by serving as the Program s central point of contact. 18 Big Data and Future Network Design Hisashi Kobayashi

19 19 Big Data and Future Network Design Hisashi Kobayashi

20 The NITRD Program s focus: Big Data (BD) Cyber Physical Systems (CPS) Cyber Security and Information Assurance (CSIA) Health Information Technology R & D (Health IT R&D) Human Computer Interaction and Information Management (HCI&IM) Etc. 20 Big Data and Future Network Design Hisashi Kobayashi

21 21 Big Data and Future Network Design Hisashi Kobayashi

22 22 Big Data and Future Network Design Hisashi Kobayashi

23 Source: Howard Wactlar, NSF CISE Directorate at NIST Big Data Meeting, June Big Data and Future Network Design Hisashi Kobayashi

24 Source: Howard Wactlar, NSF CISE Directorate at NIST Big Data Meeting, June Big Data and Future Network Design Hisashi Kobayashi

25 XDATA Program Invest $25 million/year Develop computational techniques and software tools, for both semi-structured (e.g., tabular, relational, categorical, meta-data) and unstructured (e.g., text documents, message traffic) data. - Scalable algorithms for processing imperfect data in distributed data stores; - Effective human-computer interaction tools for rapidly customizable visual reasoning 25 Big Data and Future Network Design Hisashi Kobayashi

26 DOE s Scalable Data Management Analysis and Visualization (SDAV) Institute: ($25 million over 5 years) Project Leader: Dr. Arie Shoshani Lawrence Berkeley National Laboratory 26 Big Data and Future Network Design Hisashi Kobayashi

27 BIG DATA IN ENTERPRISES 27 Big Data and Future Network Design Hisashi Kobayashi

28 28 Big Data and Future Network Design Hisashi Kobayashi

29 The Big Data market will exceed $50B worldwide by Big Data and Future Network Design Hisashi Kobayashi

30 The Big Data Market. IDC Japan s Forecast 2011 年 億 円 2012 年 197 億 円 2016 年 765 億 円 現 在 のBigData 市 場 はIT 市 場 全 体 の 13 兆 円 の 0.1% 強 程 度 30 Big Data and Future Network Design Hisashi Kobayashi

31 Another Forecast is much Bigger (by an order of magnitude) Source: 31 Big Data and Future Network Design Hisashi Kobayashi

32 Big Data: The Management Revolution Success story of Amazon.com 30-40% annual growth in [HBR] Data Analytics (DA) will replace the HiPPO. HiPPO= Highest Paid Person s Opinion [HBR] Data analysts (or data scientists) are in short supply. [HBR]: Harvard Business Review, October 2012: Diamond ハーバード ビジネス レビュー ビッグデータ 競 争 元 年 February Big Data and Future Network Design Hisashi Kobayashi

33 Big Data in Enterprises cont d Big Data exceeds the processing capacity of conventional relational database systems. Big Data primarily addresses the database (DB)/data warehousing (DWH) aspect of data analysis. Apache Hadoop is the first technology for Big Data. -- Distributed data storage -- Analysis algorithms for parallel data 33 Big Data and Future Network Design Hisashi Kobayashi

34 A distributed computational framework that can process a wide range of datasets. High-performance parallel data processing using MapReduce. Reliable data storage using the Hadoop Distributed File System (HDFS). - Query language is NoSQL ( Not only SQL ) Typical users seem obsessed with quantity, not quality, of data. More thought should be given how to collect and select data [Kaiser Fung]. 34 Big Data and Future Network Design Hisashi Kobayashi

35 1. Volume: How to handle 3 Vs [IBM] - Massively parallel processing (e.g., Greenplum data computing) - Distributed computing platform (e.g., Apache Hadoop). 2. Velocity: - Processing of streaming data to keep storage requirement practical. (e.g., Large Hadron Collider at CERN) - Instantaneous response in some applications (e.g., financial trading) 3. Variety: - Need to deal with diverse data types and sources (e.g., text from SNS, data from sensors, image data, GPS data from mobile phones, etc.) [IBM] 35 Big Data and Future Network Design Hisashi Kobayashi

36 Big Data Platform Data Warehousing (DWH): Store large volumes information from multiple sources. Hadoop-based Analytics: Reduce the cost of analyzing massive data. Unstructured Database (as well as RDB) and NoSQL Stream Computing: Continuously analyze data to take action in real-time. Text Analytics (or Text Mining): Analyze textual content of unstructured information, using information retrieval, data mining machine learning, statistics and computational linguistics. Data Visualization Tools (or Infographics): Real-time processing and dashboard presentation. e.g. Tableau [ Spotfire [ etc. 36 Big Data and Future Network Design Hisashi Kobayashi

37 Some Vendors of Big Data Tools Greenplum: - founded in acquired by EMC in 2010 Netezza: - founded in acquired by IBM in 2011 for $1.7B. SPSS: -founded in acquired by IBM in 2009 for $1.2 B) Vertica (acquired by HP) Oracle, SAP and Microsoft also provide Big Data Tools 日 本 に 関 しては; 日 経 コンピュータ 2013 年 1 月 10 日 号 37 Big Data and Future Network Design Hisashi Kobayashi

38 Call for Better DATA SCIENCE And More DATA SCIENTISTS 38 Big Data and Future Network Design Hisashi Kobayashi

39 Try to gain insights from data, instead of presenting all collected data. Study and extend classical statistical techniques : - Exploratory Data Analysis (EDA). - Time Series Analysis - Hidden Markov Models (HMMs) - Bayesian Statistics and MCMC - etc. Scalable Algorithms and Analytics e.g., PageRank Algorithm (an efficient algorithm to compute eigenvectors of a Markov transition matrix) 39 Big Data and Future Network Design Hisashi Kobayashi

40 40 Big Data and Future Network Design Hisashi Kobayashi

41 Important Subfields of Data Mining Data stream mining [Aggrawal] - Computer network traffic - Web searches - Sensor data Graph mining [Aggrawal] - Web data - Social network analysis - Bio-informatics C. C. Aggrawal (Ed.) Data Streams: Models and Algorithms, Kluwer Academic Publisher C. C. Aggarwal and H. Wang (Eds.), Managing and Mining Graph Data, Springer 41 Big Data and Future Network Design Hisashi Kobayashi

42 42 Big Data and Future Network Design Hisashi Kobayashi

43 深 刻 な 日 本 のデータ サイエンテイスト 不 足 データ アナリシスに 関 する 知 識 ( 統 計 機 械 学 習 など)を 持 つ 新 卒 者 の 数 (2008 年 ): 米 国 24,730, 中 国 17,410,インド 13,270, 日 本 3,400. ( 中 国 では 年 +10.4% 増 加 日 本 では -5.3%) Source: SAS(Statistical Analysis System) 認 定 プロフェッショナルの 数 米 国 10,544, インド 5,907, 韓 国 1,381, 英 国 1,242 日 本 800 GDP 当 りのSAS 認 定 プロフェッショナルの 数 ( 米 国 を100) 米 国 100, インド 458, 韓 国 177, 英 国 73, 日 本 20. Source: Diamond ハーバード ビジネス レビュー Feb Big Data and Future Network Design Hisashi Kobayashi

44 [McKinsey] Big data: The next frontier for innovation, competition and productivity, McKinsey & Co., May Big Data and Future Network Design Hisashi Kobayashi

45 BIG DATA and NETWORKS 45 Big Data and Future Network Design Hisashi Kobayashi

46 Source: - What happens in an Internet Minute? (by Intel): 46 Big Data and Future Network Design Hisashi Kobayashi

47 Big Data vs. Networks Networks to cope with Big Data. - Sufficient storage, bandwidth and processing Big Data to help design and manage Networks. - Better performance, reliability and security Big Data and Networks for a better world. - Transparent government, Law enforcement - Risk management - Innovative applications for value creation e.g., User behavior tracking and marketing (Privacy and security are critical). 47 Big Data and Future Network Design Hisashi Kobayashi

48 Cloud Computing & Networking : A Platform for Big Data Cloud computing offers an on-demand access to a shared pool of configurable resources. Big Data requires a novel approach to meet the storage and processing requirements. The Cloud can make big data (analytics) accessible to those who couldn t use otherwise. Disk storage performance can be a problem when it is shared by various users. 48 Big Data and Future Network Design Hisashi Kobayashi

49 OpenFlow and FLARE will help Data Centers handle Big Data Help control of connectivity of Data Centers for big data analytics via virtualization Especially useful to a Multi-tenant Data Center environment. Facilitate load balancing among Data Centers. FLARE: Deeply Programmable Network (DPN) Architecture by Aki Nakao 49 Big Data and Future Network Design Hisashi Kobayashi

50 ID/Locator Separation and Context-oriented Service for Big Data Where contexts means data attributes, e.g., identity, group association, time, location, etc. Data Centric Networking (also called Named Data Networking or NDN ) appears a proper approach to Big Data. But its performance implications are unclear. GUID (Globally Unique ID) of MobilityFirst also facilitates context-oriented service. 50 Big Data and Future Network Design Hisashi Kobayashi

51 Optical Technologies: Fast Transport and Processing of Big Data Integrated Optical Path and Optical Packets of the AKARI Architecture. Silicon Nanophotonics Technology - Integrates optical and electrical circuits on a single silicon chip, by using 90nm CMOS fabrication line. cf. IBM Press release, Dec 10, Big Data and Future Network Design Hisashi Kobayashi

52 Additional Issues that Future Network Architectures should Address: Interface to Database - Increasingly unstructured and heterogeneous - Requires fast processing and transportation The Database community and the Networking community should interact. - No FIA project addresses database issues Service Layer for Big Data applications 52 Big Data and Future Network Design Hisashi Kobayashi

53 References [Kobayashi et al 2005] H. Kobayashi, Francois Dolivo, E. Eleftheriou, 35 Years of Progress in Digital Magnetic Recording, 2005 Eduard Rhein Technology Award Lecture. [IBM] [UCB] [McKinsey] Big data: The next frontier for innovation, competition and productivity, McKinsey & Co., May 2011, and_innovation/big_data_the_next_frontier_for_innovation [IBM] IBM Lights Up Silicon Chips to Tackle Big Data, Press release Dec 12, 2012, 53 Big Data and Future Network Design Hisashi Kobayashi

54 Appendix Big Data across the Federal Government (4) NITRD s Focus (2) NSF-NIH Initiative (2) MiKinsey Global Institute s Report (2) 2012 Summer Olympic Games Big Numbers Data Never Sleeps (Fortune Magazine, 7/ 2012) Twitter 2012 Big Data for Healthcare 54 Big Data and Future Network Design Hisashi Kobayashi

55 Big Data Across the Federal Government Department of Defense (DOD) March 29, 2012 Defense Advanced Research Projects Agency (DARPA) - Anomaly Detection at Multiple Scales (ADAMS) program - Cyber-Insider Threat (CINDER) program Department of Homeland Security (DHS) - Center of Excellence on Visualization and Data Analytics Department of Energy (DOE) - Advanced Scientific Computing Research (ASCR) - High Performance Storage System (HPSS) 55 Big Data and Future Network Design Hisashi Kobayashi

56 Department of Veterans Administration (VA) - Consortium for Healthcare Informatics Research (CHIR) - Corporate Data Warehouse (CDW) - Genomic Information System for Integrated Science (GenISIS) Department of Health and Human Services (HHS) Center for Disease Control & Prevention (CDC) - BioSense 2.0 program Center for Medicare & Medicaid Services (CMS) - A date warehouse based on Hadoop is being developed. - Use of XML database technologies is being evaluated. Food & Drug Administration (FDA) - Virtual Laboratory Environment (VLE) National Archives & Record Administration (NARA) - Cyberinfrastructure for a Billion Electronic Records (CI-BER) 56 Big Data and Future Network Design Hisashi Kobayashi

57 National Aeronautic & Space Administration (NASA) - Earth Science Data and Information System (ESDIS) - Global Earth Observation System of Systems (GEOSS) - Planetary Data System (PDS) - Multimission Archive at Space Telescope Science Institute (MAST) National Endowment for the Humanities (NEH) - Digging into Data Challenge National Institute of Health (NIH) - The Cancer Imaging Archives (TCIA) - Neuroimaging Informatics Tools and Resource Clearinghouse (NITRC) - Neuroscience Information Framework (NIF) - Structural Genomics Initiative - WorldWide Protein Data Bank (wwpdb) - Biomedical Informatics Research Network (BIRN) - Collaborative Research in Computational Neuroscience (CRCNS) 57 Big Data and Future Network Design Hisashi Kobayashi

58 National Science Foundation (NSF) - Core Techniques and Technologies for Advancing Big Data Science & Engineering - Cyberinfrastructure Framework for 21 st Century Science & Engineering (CIF21) - Data and Software Preservation for Open Science (DASPOS ) - Computational and Data-enabled Science and Engineering (CDS&E) in Mathematical and Statistical Science (CDS&E-MSS) - Open Science Grid (OSG) - Theoretical and Computational Astrophysics Networks (TCAN) National Security Agency (NSA) - Vigilant Net: A Competition to Foster and Test Cyber Defense Situational Awareness at Scale - NSA/CSS Commercial Solutions Center (NCSC) United States Geological Survey (USGS) - John Wesley Powell Center for Analysis and Synthesis 58 Big Data and Future Network Design Hisashi Kobayashi

59 The NITRD Program s focus: Big Data (BD) Cyber Physical Systems (CPS) Cyber Security and Information Assurance (CSIA) Health Information Technology R & D (Health IT R&D) Human Computer Interaction and Information Management (HCI&IM) 59 Big Data and Future Network Design Hisashi Kobayashi

60 The NITRD Program s focus cont d: High Confidence Software and Systems (HCSS) High End Computing (HEC) Large Scale Networking (LSN) Software Design and Productivity (SDP) Social, Economic, and Welfare Implication of IT and IT Workforce Development (SEW) Wireless Spectrum Research and Development (WSRD 60 Big Data and Future Network Design Hisashi Kobayashi

61 NSF-NIH Big Data Initiative Eight (8) fundamental research projects o Big Data were announced on October 3, 2012 Typically, one to three investigators per project. Total of $15 millions, so about $500k/project 1. Eliminating the Data Ingestion Bottleneck in Big-Data Application, M. Farach-Colton (Rutgers) and M. Bendor (Stony Brook) 2. DataBridge- A Sociometric System for Long-Tail Science Data Collection, A. Rajaesekar (Univ. of N.C.), G. King (Harvard) and Justin Zhan (NC Agricultura & Tech State Univ.) 3. A Formal Foundation for Big Data Management, D. Suciu (Univ. of Washington). 61 Big Data and Future Network Design Hisashi Kobayashi

62 4. Analytical Approaches to Massive Data Computation with Applications to Genomics, E. Upfal (Brown) 5. Distribution-based Machine Learning for High-dimensional Datasets, A. Singh (CMU) 6. GenomesGlore- Core Techniques, Libraries, and Domain Specific Languages for High-Throughput DNA Sequencing, S. Aluru (Iowa State) O. Olukotun (Stanford) and W. Feng (Virginia Tech.) 7. Big Tensor Mining: Theory, Scalable Algorithms and Applications, C. Faloutos (CMU) N. Sidiropoulos (U. of Minnesota) 8. Discovery and Social Analytics for Large-Scale Scientific Literature, P. Kantor, T. Joachims (Cornell) and D. Biei (Princeton) 62 Big Data and Future Network Design Hisashi Kobayashi

63 63 Big Data and Future Network Design Hisashi Kobayashi

64 64 Big Data and Future Network Design Hisashi Kobayashi

65 Source: - Big Data at London Summer Games 2012: 65 Big Data and Future Network Design Hisashi Kobayashi

66 Source: - How much data is generated Every Minute: 66 Big Data and Future Network Design Hisashi Kobayashi

67 Source: Facts about Twitter: 67 Big Data and Future Network Design Hisashi Kobayashi

68 Source: Info Graphic Healthcare IT: 68 Big Data and Future Network Design Hisashi Kobayashi

SECURITY MEETS BIG DATA. Achieve Effectiveness And Efficiency. Copyright 2012 EMC Corporation. All rights reserved.

SECURITY MEETS BIG DATA. Achieve Effectiveness And Efficiency. Copyright 2012 EMC Corporation. All rights reserved. SECURITY MEETS BIG DATA Achieve Effectiveness And Efficiency 1 IN 2010 THE DIGITAL UNIVERSE WAS 1.2 ZETTABYTES 1,000,000,000,000,000,000,000 Zetta Exa Peta Tera Giga Mega Kilo Byte Source: 2010 IDC Digital

More information

NITRD and Big Data. George O. Strawn NITRD

NITRD and Big Data. George O. Strawn NITRD NITRD and Big Data George O. Strawn NITRD Caveat auditor The opinions expressed in this talk are those of the speaker, not the U.S. government Outline What is Big Data? Who is NITRD? NITRD's Big Data Research

More information

Big Data. George O. Strawn NITRD

Big Data. George O. Strawn NITRD Big Data George O. Strawn NITRD Caveat auditor The opinions expressed in this talk are those of the speaker, not the U.S. government Outline What is Big Data? NITRD's Big Data Research Initiative Big Data

More information

Big Data R&D Initiative

Big Data R&D Initiative Big Data R&D Initiative Howard Wactlar CISE Directorate National Science Foundation NIST Big Data Meeting June, 2012 Image Credit: Exploratorium. The Landscape: Smart Sensing, Reasoning and Decision Environment

More information

Government Perspectives on the Future of Advanced Networking Technologies

Government Perspectives on the Future of Advanced Networking Technologies Government Perspectives on the Future of Advanced Networking Technologies Combined briefings presented at: GLOBALCOMM GLOBALCOMM Government Summit and Innovations Summit June 5, 2006 June 7, 2006 Simon

More information

Government Technology Trends to Watch in 2014: Big Data

Government Technology Trends to Watch in 2014: Big Data Government Technology Trends to Watch in 2014: Big Data OVERVIEW The federal government manages a wide variety of civilian, defense and intelligence programs and services, which both produce and require

More information

Big Data and Data Science: Behind the Buzz Words

Big Data and Data Science: Behind the Buzz Words Big Data and Data Science: Behind the Buzz Words Peggy Brinkmann, FCAS, MAAA Actuary Milliman, Inc. April 1, 2014 Contents Big data: from hype to value Deconstructing data science Managing big data Analyzing

More information

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics

More information

VIEWPOINT. High Performance Analytics. Industry Context and Trends

VIEWPOINT. High Performance Analytics. Industry Context and Trends VIEWPOINT High Performance Analytics Industry Context and Trends In the digital age of social media and connected devices, enterprises have a plethora of data that they can mine, to discover hidden correlations

More information

The 4 Pillars of Technosoft s Big Data Practice

The 4 Pillars of Technosoft s Big Data Practice beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed

More information

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this

More information

TUT NoSQL Seminar (Oracle) Big Data

TUT NoSQL Seminar (Oracle) Big Data Timo Raitalaakso +358 40 848 0148 rafu@solita.fi TUT NoSQL Seminar (Oracle) Big Data 11.12.2012 Timo Raitalaakso MSc 2000 Work: Solita since 2001 Senior Database Specialist Oracle ACE 2012 Blog: http://rafudb.blogspot.com

More information

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme Big Data Analytics Prof. Dr. Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany 33. Sitzung des Arbeitskreises Informationstechnologie,

More information

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012 MEDICAL DATA MINING Timothy Hays, PhD Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012 2 Healthcare in America Is a VERY Large Domain with Enormous Opportunities for Data

More information

Survey of Big Data Architecture and Framework from the Industry

Survey of Big Data Architecture and Framework from the Industry Survey of Big Data Architecture and Framework from the Industry NIST Big Data Public Working Group Sanjay Mishra May13, 2014 3/19/2014 NIST Big Data Public Working Group 1 NIST BD PWG Survey of Big Data

More information

NoSQL for SQL Professionals William McKnight

NoSQL for SQL Professionals William McKnight NoSQL for SQL Professionals William McKnight Session Code BD03 About your Speaker, William McKnight President, McKnight Consulting Group Frequent keynote speaker and trainer internationally Consulted to

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

EMC Greenplum Driving the Future of Data Warehousing and Analytics. Tools and Technologies for Big Data

EMC Greenplum Driving the Future of Data Warehousing and Analytics. Tools and Technologies for Big Data EMC Greenplum Driving the Future of Data Warehousing and Analytics Tools and Technologies for Big Data Steven Hillion V.P. Analytics EMC Data Computing Division 1 Big Data Size: The Volume Of Data Continues

More information

Sunnie Chung. Cleveland State University

Sunnie Chung. Cleveland State University Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:

More information

Chapter 1. Contrasting traditional and visual analytics approaches

Chapter 1. Contrasting traditional and visual analytics approaches Chapter 1 Understanding Big Data Analytics In This Chapter Defining Big Data Understanding Big Data Analytics Contrasting traditional and visual analytics approaches The era of Big Data is upon us. The

More information

Data Centric Computing Revisited

Data Centric Computing Revisited Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data

More information

www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization

More information

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics BIG DATA & ANALYTICS Transforming the business and driving revenue through big data and analytics Collection, storage and extraction of business value from data generated from a variety of sources are

More information

Big Data a threat or a chance?

Big Data a threat or a chance? Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but

More information

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords From A to Z By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords Big data is one of the, well, biggest trends in IT today, and it has spawned a whole new generation

More information

How To Understand The Business Case For Big Data

How To Understand The Business Case For Big Data Brochure More information from http://www.researchandmarkets.com/reports/2643647/ Big Data and Telecom Analytics Market: Business Case, Market Analysis & Forecasts 2014-2019 Description: Big Data refers

More information

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 美 國 13 歲 學 生 用 Big Data 找 出 霸 淩 熱 點 Puri 架 設 網 站 Bullyvention, 藉 由 分 析 Twitter 上 找 出 提 到 跟 霸 凌 相 關 的 詞, 搭 配 地 理 位 置

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

Ali Eghlima Ph.D Director of Bioinformatics. A Bioinformatics Research & Consulting Group

Ali Eghlima Ph.D Director of Bioinformatics. A Bioinformatics Research & Consulting Group A Bioinformatics Research & Consulting Group Adding Omics Data to Electronic Health Record, A paradigm Shift in Big Data Modeling, Analytics and Storage management for Healthcare and Life Sciences Organizations

More information

White Paper. Version 1.2 May 2015 RAID Incorporated

White Paper. Version 1.2 May 2015 RAID Incorporated White Paper Version 1.2 May 2015 RAID Incorporated Introduction The abundance of Big Data, structured, partially-structured and unstructured massive datasets, which are too large to be processed effectively

More information

IJITE Vol.03 Issue - 03, (March 2015) ISSN: 2321 1776 Impact Factor 3.570

IJITE Vol.03 Issue - 03, (March 2015) ISSN: 2321 1776 Impact Factor 3.570 Big data analytics vs Data Mining analytics Vinti Parmar, 1 Department of Computer Science, Indira Gandhi University, Meerpur, Rewari Haryana, INDIA Itisha Gupta Department of Computer Science, Bright

More information

Large-Scale Data Processing

Large-Scale Data Processing Large-Scale Data Processing Eiko Yoneki eiko.yoneki@cl.cam.ac.uk http://www.cl.cam.ac.uk/~ey204 Systems Research Group University of Cambridge Computer Laboratory 2010s: Big Data Why Big Data now? Increase

More information

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data: The Datafication Of Everything Thoughts Devices Processes Thoughts Things Processes Run the Business Organize data to do something

More information

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,

More information

Mind Commerce. http://www.marketresearch.com/mind Commerce Publishing v3122/ Publisher Sample

Mind Commerce. http://www.marketresearch.com/mind Commerce Publishing v3122/ Publisher Sample Mind Commerce http://www.marketresearch.com/mind Commerce Publishing v3122/ Publisher Sample Phone: 800.298.5699 (US) or +1.240.747.3093 or +1.240.747.3093 (Int'l) Hours: Monday - Thursday: 5:30am - 6:30pm

More information

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

BIG DATA-AS-A-SERVICE

BIG DATA-AS-A-SERVICE White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers

More information

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014 Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014 Defining Big Not Just Massive Data Big data refers to data sets whose size is beyond the ability of typical database software tools

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

Data Refinery with Big Data Aspects

Data Refinery with Big Data Aspects International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data

More information

Big Data Across the Federal Government

Big Data Across the Federal Government Big Data Across the Federal Government March 29, 2012 Here are highlights of ongoing Federal programs that address the challenges of, and tap the opportunities afforded by, the big data revolution to advance

More information

Hadoop. http://hadoop.apache.org/ Sunday, November 25, 12

Hadoop. http://hadoop.apache.org/ Sunday, November 25, 12 Hadoop http://hadoop.apache.org/ What Is Apache Hadoop? The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using

More information

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem: Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Big Data and Telecom Analytics Market: Business Case, Market Analysis & Forecasts 2014-2019

Big Data and Telecom Analytics Market: Business Case, Market Analysis & Forecasts 2014-2019 MARKET RESEARCH STORE Big Data and Telecom Analytics Market: Business Case, Market Analysis & Forecasts 2014-2019 Market Research Store included latest deep and professional market research report on Big

More information

Intro to Big Data and Business Intelligence

Intro to Big Data and Business Intelligence Intro to Big Data and Business Intelligence Anjana Susarla Eli Broad College of Business What is Business Intelligence A Simple Definition: The applications and technologies transforming Business Data

More information

Big Data Explained. An introduction to Big Data Science.

Big Data Explained. An introduction to Big Data Science. Big Data Explained An introduction to Big Data Science. 1 Presentation Agenda What is Big Data Why learn Big Data Who is it for How to start learning Big Data When to learn it Objective and Benefits of

More information

Build Your Competitive Edge in Big Data with Cisco. Rick Speyer Senior Global Marketing Manager Big Data Cisco Systems 6/25/2015

Build Your Competitive Edge in Big Data with Cisco. Rick Speyer Senior Global Marketing Manager Big Data Cisco Systems 6/25/2015 Build Your Competitive Edge in Big Data with Cisco Rick Speyer Senior Global Marketing Manager Big Data Cisco Systems 6/25/2015 Big Data Trends Increasingly Everything will be Connected to Everything Massive

More information

Big Data and Healthcare Payers WHITE PAPER

Big Data and Healthcare Payers WHITE PAPER Knowledgent White Paper Series Big Data and Healthcare Payers WHITE PAPER Summary With the implementation of the Affordable Care Act, the transition to a more member-centric relationship model, and other

More information

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Transitioning

More information

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA

More information

Oracle Big Data Building A Big Data Management System

Oracle Big Data Building A Big Data Management System Oracle Big Building A Big Management System Copyright 2015, Oracle and/or its affiliates. All rights reserved. Effi Psychogiou ECEMEA Big Product Director May, 2015 Safe Harbor Statement The following

More information

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. Impact of Big Data in Oil & Gas Industry Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. New Age Information 2.92 billions Internet Users in 2014 Twitter processes 7 terabytes

More information

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

Data-intensive HPC: opportunities and challenges. Patrick Valduriez Data-intensive HPC: opportunities and challenges Patrick Valduriez Big Data Landscape Multi-$billion market! Big data = Hadoop = MapReduce? No one-size-fits-all solution: SQL, NoSQL, MapReduce, No standard,

More information

How To Get More Data From Your Computer

How To Get More Data From Your Computer Industry Perspective: Big Data and Big Data Analytics David Barnes Program Director Emerging Internet Technologies IBM Software Group What is Big Data? The Adjacent Possible Inexpensive disk + Increased

More information

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved. Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!

More information

Sunnie Chung. Cleveland State University

Sunnie Chung. Cleveland State University Sunnie Chung Cleveland State University They are very new technologies to Computer Science in rise of Web Service on Internet (IoT) They were fast developed and fast evolving Research and Developments

More information

EMC BACKUP MEETS BIG DATA

EMC BACKUP MEETS BIG DATA EMC BACKUP MEETS BIG DATA Strategies To Protect Greenplum, Isilon And Teradata Systems 1 Agenda Big Data: Overview, Backup and Recovery EMC Big Data Backup Strategy EMC Backup and Recovery Solutions for

More information

"BIG DATA A PROLIFIC USE OF INFORMATION"

BIG DATA A PROLIFIC USE OF INFORMATION Ojulari Moshood Cameron University - IT4444 Capstone 2013 "BIG DATA A PROLIFIC USE OF INFORMATION" Abstract: The idea of big data is to better use the information generated by individual to remake and

More information

Customized Report- Big Data

Customized Report- Big Data GINeVRA Digital Research Hub Customized Report- Big Data 1 2014. All Rights Reserved. Agenda Context Challenges and opportunities Solutions Market Case studies Recommendations 2 2014. All Rights Reserved.

More information

White Paper: Datameer s User-Focused Big Data Solutions

White Paper: Datameer s User-Focused Big Data Solutions CTOlabs.com White Paper: Datameer s User-Focused Big Data Solutions May 2012 A White Paper providing context and guidance you can use Inside: Overview of the Big Data Framework Datameer s Approach Consideration

More information

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

Chapter 6. Foundations of Business Intelligence: Databases and Information Management Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Exploiting Data at Rest and Data in Motion with a Big Data Platform Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data

More information

We are Big Data A Sonian Whitepaper

We are Big Data A Sonian Whitepaper EXECUTIVE SUMMARY Big Data is not an uncommon term in the technology industry anymore. It s of big interest to many leading IT providers and archiving companies. But what is Big Data? While many have formed

More information

Big Systems, Big Data

Big Systems, Big Data Big Systems, Big Data When considering Big Distributed Systems, it can be noted that a major concern is dealing with data, and in particular, Big Data Have general data issues (such as latency, availability,

More information

Big Data and Trusted Information

Big Data and Trusted Information Dr. Oliver Adamczak Big Data and Trusted Information CAS Single Point of Truth 7. Mai 2012 The Hype Big Data: The next frontier for innovation, competition and productivity McKinsey Global Institute 2012

More information

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data! Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data: Data Analytical Tools for Decision Support 2 Outline Introduce

More information

Introduction to the Mathematics of Big Data. Philippe B. Laval

Introduction to the Mathematics of Big Data. Philippe B. Laval Introduction to the Mathematics of Big Data Philippe B. Laval Fall 2015 Introduction In recent years, Big Data has become more than just a buzz word. Every major field of science, engineering, business,

More information

USING BIG DATA FOR INTELLIGENT BUSINESSES

USING BIG DATA FOR INTELLIGENT BUSINESSES HENRI COANDA AIR FORCE ACADEMY ROMANIA INTERNATIONAL CONFERENCE of SCIENTIFIC PAPER AFASES 2015 Brasov, 28-30 May 2015 GENERAL M.R. STEFANIK ARMED FORCES ACADEMY SLOVAK REPUBLIC USING BIG DATA FOR INTELLIGENT

More information

Apache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com

Apache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Apache Hadoop in the Enterprise Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Cloudera The Leader in Big Data Management Powered by Apache Hadoop The Leading Open Source Distribution of Apache

More information

Transforming the Telecoms Business using Big Data and Analytics

Transforming the Telecoms Business using Big Data and Analytics Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe

More information

SDN Security Challenges. Anita Nikolich National Science Foundation Program Director, Advanced Cyberinfrastructure July 2015

SDN Security Challenges. Anita Nikolich National Science Foundation Program Director, Advanced Cyberinfrastructure July 2015 SDN Security Challenges Anita Nikolich National Science Foundation Program Director, Advanced Cyberinfrastructure July 2015 Cybersecurity Enhancement Act 2014 Public-Private Collaboration on Security (NIST

More information

Data Warehouse design

Data Warehouse design Data Warehouse design Design of Enterprise Systems University of Pavia 10/12/2013 2h for the first; 2h for hadoop - 1- Table of Contents Big Data Overview Big Data DW & BI Big Data Market Hadoop & Mahout

More information

How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6

How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6 Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...

More information

National Big Data R&D Initiative

National Big Data R&D Initiative National Big Data R&D Initiative Suzi Iacono, PhD National Science Foundation Co-chair NITRD Big Data Senior Steering Group for CASC Spring Meeting April 23, 2014 Why is Big Data Important? Transformative

More information

TABLE OF CONTENTS 1 Chapter 1: Introduction 2 Chapter 2: Big Data Technology & Business Case 3 Chapter 3: Key Investment Sectors for Big Data

TABLE OF CONTENTS 1 Chapter 1: Introduction 2 Chapter 2: Big Data Technology & Business Case 3 Chapter 3: Key Investment Sectors for Big Data TABLE OF CONTENTS 1 Chapter 1: Introduction 1.1 Executive Summary 1.2 Topics Covered 1.3 Key Findings 1.4 Target Audience 1.5 Companies Mentioned 2 Chapter 2: Big Data Technology & Business Case 2.1 Defining

More information

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data Yinong Chen 2 Big Data Big Data Technologies Cloud Computing Service and Web-Based Computing Applications Industry Control Systems

More information

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science Dr. Daisy Zhe Wang CISE Department University of Florida August 25th 2014 20 Review Overview of Data Science Why Data

More information

This Symposium brought to you by www.ttcus.com

This Symposium brought to you by www.ttcus.com This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data

More information

CIS492 Special Topics: Cloud Computing د. منذر الطزاونة

CIS492 Special Topics: Cloud Computing د. منذر الطزاونة CIS492 Special Topics: Cloud Computing د. منذر الطزاونة Big Data Definition No single standard definition Big Data is data whose scale, diversity, and complexity require new architecture, techniques, algorithms,

More information

1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India

1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India 1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India Call for Papers Colossal Data Analysis and Networking has emerged as a de facto

More information

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21)

CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) CYBERINFRASTRUCTURE FRAMEWORK FOR 21 ST CENTURY SCIENCE, ENGINEERING, AND EDUCATION (CIF21) Overview The Cyberinfrastructure Framework for 21 st Century Science, Engineering, and Education (CIF21) investment

More information

Introducing Big Data. Abstract. with Small Changes. Agenda. Big Data in the News. Bits and Bytes

Introducing Big Data. Abstract. with Small Changes. Agenda. Big Data in the News. Bits and Bytes Introducing Big Data in Stat 101 with Small Changes 17 Nov 2013 Introducing Big Data in Stat 101 with Small Changes John D. McKenzie, Jr. Babson College Babson Park, MA 02457 0310 mckenzie@babson.edu DSI

More information

Here comes the flood Tools for Big Data analytics. Guy Chesnot -June, 2012

Here comes the flood Tools for Big Data analytics. Guy Chesnot -June, 2012 Here comes the flood Tools for Big Data analytics Guy Chesnot -June, 2012 Agenda Data flood Implementations Hadoop Not Hadoop 2 Agenda Data flood Implementations Hadoop Not Hadoop 3 Forecast Data Growth

More information

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume

More information

Big Analytics: A Next Generation Roadmap

Big Analytics: A Next Generation Roadmap Big Analytics: A Next Generation Roadmap Cloud Developers Summit & Expo: October 1, 2014 Neil Fox, CTO: SoftServe, Inc. 2014 SoftServe, Inc. Remember Life Before The Web? 1994 Even Revolutions Take Time

More information

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are

More information

W H I T E P A P E R. Building your Big Data analytics strategy: Block-by-Block! Abstract

W H I T E P A P E R. Building your Big Data analytics strategy: Block-by-Block! Abstract W H I T E P A P E R Building your Big Data analytics strategy: Block-by-Block! Abstract In this white paper, Impetus discusses how you can handle Big Data problems. It talks about how analytics on Big

More information

Computing at a Cross-Roads: Big Data, Big Compute, and the Long Tail. William Gropp www.cs.illinois.edu/~wgropp

Computing at a Cross-Roads: Big Data, Big Compute, and the Long Tail. William Gropp www.cs.illinois.edu/~wgropp Computing at a Cross-Roads: Big Data, Big Compute, and the Long Tail William Gropp www.cs.illinois.edu/~wgropp What this talk is A request for help, in keeping with the topics of this meeting The US NSF

More information

Reference Architecture, Requirements, Gaps, Roles

Reference Architecture, Requirements, Gaps, Roles Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture

More information

Turning Big Data into Big Decisions Delivering on the High Demand for Data

Turning Big Data into Big Decisions Delivering on the High Demand for Data Turning Big Data into Big Decisions Delivering on the High Demand for Data Michael Ho, Vice President of Professional Services Digital Government Institute s Government Big Data Conference, October 31,

More information

Big Data. Sonovate QuickView Series #3

Big Data. Sonovate QuickView Series #3 Big Data Sonovate QuickView Series #3 The most valuable commodity I know of is information. - Gordon Gekko Big data is changing the world dramatically right before our eyes from the amount of data being

More information

Big Data. Lyle Ungar, University of Pennsylvania

Big Data. Lyle Ungar, University of Pennsylvania Big Data Big data will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus. McKinsey Data Scientist: The Sexiest Job of the 21st Century -

More information

Secure Cloud Computing Concepts Supporting Big Data in Healthcare. Ryan D. Pehrson Director, Solutions & Architecture Integrated Data Storage, LLC

Secure Cloud Computing Concepts Supporting Big Data in Healthcare. Ryan D. Pehrson Director, Solutions & Architecture Integrated Data Storage, LLC Secure Cloud Computing Concepts Supporting Big Data in Healthcare Ryan D. Pehrson Director, Solutions & Architecture Integrated Data Storage, LLC Learning Objectives After this session, the learner should

More information

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management

Big Data and New Paradigms in Information Management. Vladimir Videnovic Institute for Information Management Big Data and New Paradigms in Information Management Vladimir Videnovic Institute for Information Management 2 "I am certainly not an advocate for frequent and untried changes laws and institutions must

More information

Demystifying Big Data Government Agencies & The Big Data Phenomenon

Demystifying Big Data Government Agencies & The Big Data Phenomenon Demystifying Big Data Government Agencies & The Big Data Phenomenon Today s Discussion If you only remember four things 1 Intensifying business challenges coupled with an explosion in data have pushed

More information