CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait



Similar documents
Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Statistical Challenges with Big Data in Management Science

WHAT IS BIG DATA? David Bechtold

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith /

BIG DATA FUNDAMENTALS

Exploiting Data at Rest and Data in Motion with a Big Data Platform

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data

Are You Ready for Big Data?

Industry Impact of Big Data in the Cloud: An IBM Perspective

How To Understand The Benefits Of Big Data

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

BIG DATA I N B A N K I N G

How Big Is Big Data Adoption? Survey Results. Survey Results Big Data Company Strategy... 6

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

Big Data Introduction, Importance and Current Perspective of Challenges

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

BIG DATA. - How big data transforms our world. Kim Escherich Executive Innovation Architect, IBM Global Business Services

Are You Ready for Big Data?

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

A New Era Of Analytic

Turning Big Data into Big Decisions Delivering on the High Demand for Data

Big Data / FDAAWARE. Rafi Maslaton President, cresults the maker of Smart-QC/QA/QD & FDAAWARE 30-SEP-2015

We are Big Data A Sonian Whitepaper

Problems to store, transfer and process the Big Data 6/2/2016 GIANG TRAN - TTTGIANG2510@GMAIL.COM 1

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 12

A Survey on Big Data Concepts and Tools

BUY BIG DATA IN RETAIL

Big Data. Fast Forward. Putting data to productive use

Beyond Watson: The Business Implications of Big Data

Deploying Big Data to the Cloud: Roadmap for Success

Modern (Computational) Approaches to Big Data Analytics. CSC 576 Computer Science, University of Rochester Instructor: Ji Liu

Big Data Spatial Analytics An Introduction

BIG DATA & ANALYTICS

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Analyzing Big Data: The Path to Competitive Advantage

IJRCS - International Journal of Research in Computer Science ISSN:

Big Data-Challenges and Opportunities

Introduction to Analytics and Big Data - Hadoop. Rob Peglar EMC Isilon

Big Data, Data Analytics and Actuaries. Adam Driussi, Quantium

The Next Wave of Data Management. Is Big Data The New Normal?

How To Get More Data From Your Computer

Big Data Analytics: 14 November 2013

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

What happens when Big Data and Master Data come together?

Exploiting the power of Big Data

How To Use Big Data Effectively

The New World of Data. Don Strickland President, Strickland & Associates

Big Data & Intelligence Driven Security. EMELIA Yamson My ewyamson@nosmay.com

Doing Multidisciplinary Research in Data Science

Big Data, Official Statistics and Social Science Research: Emerging Data Challenges

A Strategic Approach to Unlock the Opportunities from Big Data

Big Analytics: A Next Generation Roadmap

Big Data Er Big Data bare en døgnflue? Lasse Bache-Mathiesen CTO BIM Norway

Big Data Mining: Challenges and Opportunities to Forecast Future Scenario

Information Optimization

Big Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich

Introduction to the Mathematics of Big Data. Philippe B. Laval

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Keyword: YARN, HDFS, RAM

Big Data Strategy. Use Case Study. Amy O Connor // Field Sales Evangelist

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg

Game On: How Information is Changing the Rules of Insurance

Changing the face of Business Intelligence & Information Management

SUCCESSES AND CHALLENGES DEVELOPING AN ENTERPRISE CLINICAL ANALYTICS SOLUTION Ambulatory Information Management (AIM)

Big data The three-minute guide

Data Analytics in Organisations and Business

Discover How a 360-Degree View of the Customer Boosts Productivity and Profits. eguide

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Introduction to Predictive Analytics. Dr. Ronen Meiri

BIRT in the World of Big Data

Big Data in Telco & Banking Analytics. Benjamin Sznajder IBM Research Haifa

BIG DATA TECHNOLOGY. Hadoop Ecosystem

DEVELOP INSIGHT DRIVEN CUSTOMER EXPERIENCES USING BIG DATA AND ADAVANCED ANALYTICS

Perspectives on Big Data and Big Data Analytics

Two Recent LE Use Cases

Ali Eghlima Ph.D Director of Bioinformatics. A Bioinformatics Research & Consulting Group

Big Data Solutions. Portal Development with MongoDB and Liferay. Solutions

Annex: Concept Note. Big Data for Policy, Development and Official Statistics New York, 22 February 2013

Transcription:

CSC590: Selected Topics BIG DATA & DATA MINING Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

Agenda Introduction What is Big Data Why Big Data? Characteristics of Big Data Applications of Big Data Problems of Big Data

Introduction March, 2012, the Obama Administration announced $200 million in R&D investments for Big Data. http://www.cccblog.org/20 12/03/29/obama- administration-unveils- 200m-big-data-rd-initiative/ 640K ought to be enough for anybody.

What is Big Data? Big Data usually includes data sets with sizes beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time "Big Data" is what happens when the cost of storing data falls below the cost of deciding to throw it away. -- George Dyson

What is Big Data? Every day, we create 2.5 quintillion bytes of data so much that 90% of the data in the world today has been created in the last two years alone. This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals to name a few. This data is big data.

What is Big Data? There are huge volumes of data in the world: From the beginning of recorded time until 2003,We created 5 billion gigabytes (exabytes) of data. In 2011, the same amount was created every two days In 2013, the same amount of data is created every 10 minutes.

Why Big Data? Why now? What enabled Big Data? Storage became cheaper Processing power got Faster AND cheaper Data became available What does that mean? It is easy to get data.. But you can t decide whether you want to throw it away or not!?!

Characteristics of Big Data WWW VVV Volume Velocity Variety Veracity?

Volume Enterprises are awash with ever-growing data of all types, easily amassing terabytes even petabytes of information. Turn 12 terabytes of Tweets created each day into improved product sentiment analysis Convert 350 billion annual meter readings to better predict power consumption

Volume Cont d

Volume Cont d Twitter generates more than 7 Terabytes (TB) a day; Facebook more than 10 TBs, and some enterprises already store data in the petabyte range.

Velocity Sometimes 2 minutes is too late. For timesensitive processes such as catching fraud, big data must be used as it streams into your enterprise in order to maximize its value. Scrutinize 5 million trade events created each day to identify potential fraud Analyze 500 million daily call detail records in realtime to predict customer churn faster TPS Tweets Per Second Record high of 150,000 TPS

Variety Big data is any type of data - structured and unstructured data such as text, sensor data, audio, video, click streams, log files and more. New insights are found when analyzing these data types together. Monitor 100 s of live video feeds from surveillance cameras to target points of interest Exploit the 80% data growth in images, video and documents to improve customer satisfaction

Variety Cont d Web data, e-commerce purchases at department/ grocery stores Bank/Credit Card transactions Social Network

Veracity Amazon: Book recommendations People who bought this, also bought.. IMDB Similar movies Google Did you mean? Translations.. Twitter & Facebook Do you know this person? Wallmart Here is a 10% coupon to

Applications of Big Data Government Data.gov Health 7 big data solutions for Healthcare Education www.coursera.org Banking Loans, Stock market..etc. Commercial Amazon, Wallmart etc!

Problems Data Silos collections of data that have grown across the organization or within specific departments which are not connected in a cohesive plan.

Problems Cont d Experts & Technology Still there are not enough Data Scientists to accommodate for the demand Technology is not yet mature to help obtain the wisdom needs

Problems Cont d Privacy You are the product! Changing Infrastructure