BIG DATA. Value 8/14/2014 WHAT IS BIG DATA? THE 5 V'S OF BIG DATA WHAT IS BIG DATA?



Similar documents
Analytics Industry Trends Survey. Research conducted and written by:

Analytics A survey on analytic usage, trends, and future initiatives. Research conducted and written by:

IBM Big Data in Government

A New Era Of Analytic

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg

Big Data Executive Survey

UNIFY YOUR (BIG) DATA

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

High-Performance Analytics

BIG DATA STRATEGY. Rama Kattunga Chair at American institute of Big Data Professionals. Building Big Data Strategy For Your Organization

Parallel Data Warehouse

COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY WITH PRACTICAL OUTCOMES

Big Data Effects on Weather and Climate

Extend your analytic capabilities with SAP Predictive Analysis

Solve your toughest challenges with data mining

An interdisciplinary model for analytics education

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Big Data and Healthcare Payers WHITE PAPER

BIG Data Analytics Move to Competitive Advantage

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

INVESTOR PRESENTATION. First Quarter 2014

Hexaware E-book on Predictive Analytics

Traditional Analytics and Beyond:

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

This Symposium brought to you by

HDP Enabling the Modern Data Architecture

redesigning the data landscape to deliver true business intelligence Your business technologists. Powering progress

Solve Your Toughest Challenges with Data Mining

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Disrupting The Market: Predictive Analytics As A Service

Using Big Data Analytics to

TOP 8 TRENDS FOR 2016 BIG DATA

Demonstration of SAP Predictive Analysis 1.0, consumption from SAP BI clients and best practices

CIO Roundtable - Big Data

Big Data in the Nordics 2012

Questionnaire about the skills necessary for people. working with Big Data in the Statistical Organisations

Architecting for the Internet of Things & Big Data

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

How To Learn To Use Big Data

2015 Ironside Group, Inc. 2

Airline Applications of Business Intelligence Systems

Internet of Things. Opportunity Challenges Solutions

Big Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich

Modernizing Your Data Warehouse for Hadoop

Modern Data Warehouse

Outline. What is Big data and where they come from? How we deal with Big data?

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Sunnie Chung. Cleveland State University

The Big Data Deluge: Creating Serious Business Problems. Analytics: Harnessing Big Data Deluge to Acquire Business Power

Big Data Integration: A Buyer's Guide

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

The 4 Pillars of Technosoft s Big Data Practice

VIEWPOINT. High Performance Analytics. Industry Context and Trends

ANALYTICS CENTER LEARNING PROGRAM

Armanino McKenna LLP Welcomes You To Today s Webinar:

Why include analytics as part of the School of Information Technology curriculum?

Cost-Effective Business Intelligence with Red Hat and Open Source

Big Analytics: A Next Generation Roadmap

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

The 3 questions to ask yourself about BIG DATA

Big Data Analytics in Facilities Management

Big Data and Trusted Information

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

Big Impacts from Big Data UNION SQUARE ADVISORS LLC

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Transforming the Telecoms Business using Big Data and Analytics

Maximizing Return and Minimizing Cost with the Decision Management Systems

Data Search. Searching and Finding information in Unstructured and Structured Data Sources

EMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT

Data Mining Solutions for the Business Environment

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

Oracle Big Data SQL Technical Update

SAS and Teradata Partnership

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

HDP Hadoop From concept to deployment.

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Introducing Oracle Exalytics In-Memory Machine

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

MapR: Best Solution for Customer Success

Big Data overview. Livio Ventura. SICS Software week, Sept Cloud and Big Data Day

Data Warehousing in the Age of Big Data

The Intersection of Big Data and Analytics. Philip Russom TDWI Research Director for Data Management May 5, 2011

Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate

How To Make Data Streaming A Real Time Intelligence

Transcription:

WHAT IS BIG DATA? BIG DATA DR. KLARA NELSON THE UNIVERSITY OF TAMPA "Volumes of data that are unusually large, or types of data that are unstructured" Thomas Davenport, Keeping Up with the Quants, 2013, p. 6 The emerging technologies and practices that enable the collection, processing, discovery, analysis, and storage of large volumes and disparate types of data, quickly and cost effectively. SAS Best Practices Team Definition http://tamaradull.com/2013/02/20/the-5-ws-what-is-big-data/ TBTLA PRESENTATION AUGUST 14, 2014 WHAT IS BIG DATA? Big data Traditional analytics Type of data Unstructured formats Formatted in rows and columns Volume of 100 TB to PB Tens of TB or less data Flow of data Constant flow of data Static pool of data Analysis methods Machine learning Hypothesis-based Primary purpose Data-based products Internal decision support and services Source: Thomas Davenport, Big Data @ Work, 2014, Table 1-1, p. 4 THE 5 V'S OF BIG DATA Volume Data size Velocity High-velocity capture, discovery, and/or analysis Value Variety Many different types Veracity Quality / Trustworthiness http://www-01.ibm.com/software/data/bigdata/ http://www- 05.ibm.com/fr/events/netezzaDM_2012/Solutions_Big_Data.pdf 1

TYPICAL DATA SET SIZE CUSTOMER TRANSACTIONS: #1 SOURCE OF LARGE DATA Rexer Analytics (2013), "2013 Data Miner Survey - Summary Report, p. 31. Rexer Analytics (2013), "2013 Data Miner Survey - Summary Report, p. 9. THE 5 V'S OF BIG DATA: VALUE Integrating V doing something valuable with the data, turning data into dollars Being able to translate massive amounts of data into real insights and realizing value from that insight Big Data at UPS to shave ONE MILE off each DRIVER's ROUTE a day would save the firm $50 MILLION a year. BIG DATA = BIG ROI Healthcare 20% decrease in patient mortality by analyzing streaming patient data Telco 92% decrease in processing time by analyzing networking and call data Utilities 99% improved accuracy in placing power generation resources by analyzing 2.8 petabytes of untapped data Healthcare, Telco, Utilities: http://www-01.ibm.com/software/data/bigdata/industry.html UPS: Christian Science Monitor, Aug 12, 2013, p. 32 THE 8 MOST IN-DEMAND BIG DATA ROLES Role Average Annual Salary ($) Visualization Tool Developers (Expert Level) 150,000 175,000 Hadoop Developers 150,000 175,000 Data Scientists 125,000 140,000 Information Architects 113,750 135,350 ETL Developers 110,000 130,000 Predictive Analytics Developers 103,700 129,000 Data Warehouse Appliance Specialist 97,950 123,600 OLAP Developers 97,900 115,550 http://www.computerworld.com/slideshow/detail/138836/the-8-most-in-demandbig-data-roles-#slide7, February 17, 2014 2

THE BIG DATA LANDSCAPE WHAT IS BIG DATA TECHNOLOGY? "Big data technology is capable of handling a lot of data. Big data handles data cheaply. Big data handles data in the form of unstructured strings of data. Big data does its searches independently. Big data is used to store and manage large amounts of data. That s what big data is." Bill Inmon http://blogs-images.forbes.com/davefeinleib/files/2014/06/big-data-landscape-jul-4-2012-00111.png Source: "Big Data Technology Does Not Replace a Data Warehouse", http://www.b-eye-network.com/view/16714, January 10, 2013 TECHNOLOGIES: DATA WAREHOUSE VS. BIG DATA Use the best tool for the job depending on the business requirements: Discovery of unexplored business questions Clean, consistent, high quality data Low latency, interactive reports, OLAP Raw unstructured data Analysis of preliminary data WHICH DATA MINING/ ANALYTIC TOOLS ARE USED? The average data miner reports using 5 tools, but conducts 76% of their work in their primary tool. Source: http://tamaradull.com/2013/03/20/the-5-ws-when-should-we-use-big-data-vs-data-warehousingtechnologies/ Rexer Analytics (2013), "2013 Data Miner Survey - Summary Report, p. 31. 3

PREPARING STUDENTS TO WORK WITH BIG DATA Analytics courses ITM 466 Business Intelligence and Analytics (Elective) ITM 615 Business Analytics (MBA Decision Analysis Elective) Course topics Assessing analytics competencies of organizations (e.g., Davenport's DELTA) Analytical thinking stages Ethics of analytics / big data Data quality Data warehouses & other technologies Data mining methods TECHNOLOGIES USED IN THE BUSINESS ANALYTICS COURSES SAP Business Objects Microsoft Excel Tableau Software SQL Server Data Tools for building analysis databases and data mining IBM SPSS Statistics Suite for research and analysis IBM SPSS Modeler for predicting future behavior (data mining) IBM SPSS Text Analytics for mining unstructured data sources IBM Digital Analytics (formerly Coremetrics Web Analytics) DATA MINING ALGORITHMS DATA MINERS & ITM 466/615 STUDENTS ARE USING denotes algorithms covered hands-on in ITM 466/615 Rexer Analytics (2013), "2013 Data Miner Survey - Summary Report, p. 36. THE CHALLENGES OF BIG DATA & BIG DATA ANALYTICS Delivering Value "Through 2015, 85% of Fortune 500 organizations will be unable to exploit big data for competitive advantage." (Gartner) Data Silos Quality Storage Enterprise strategy Talent Lack of IT/technical skills Lack of domain knowledge Lack of analytical thinking skills Organizational culture Technologies and tools Big data as IT-driven projects Gartner quote: http://www.gartner.com/technology/topics/big-data.jsp 4

THE CHALLENGES OF BIG DATA AND BIG DATA ANALYTICS Ethics "A code of conduct to refer to in judging what is right and what is wrong" regarding the ways we gather data and use data and guide individual and organizational conduct through use of data and Frank Buytendijk quotes on Analytics and Ethics from the TDWI Las Vegas 2012 World Conference "Are there things you shouldn't do?" "It seems like we are doing things because we can." "The key thing is that technology is answering questions that weren't even asked." "Tools are creating ethical issues, and we don't even have the mechanism to do something about it." THANK YOU! 5