Big Data. What is Big Data? Over the past years. Big Data. Big Data: Introduction and Applications



Similar documents
Chapter 6: Big Data and Analytics

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

How Big Is Big Data Adoption? Survey Results. Survey Results Big Data Company Strategy... 6

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Hadoop Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast,

Big Data Explained. An introduction to Big Data Science.

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

Big Data in Retail Big Data Analytics Central to Customer Acquisition and Retention Strategies in Retail

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

BIG DATA: IT MAY BE BIG BUT IS IT SMART?

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

BIG DATA FUNDAMENTALS

International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May ISSN BIG DATA: A New Technology

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Big Data a threat or a chance?

How To Understand The Business Case For Big Data

Data Centric Computing Revisited

Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Mind Commerce. Commerce Publishing v3122/ Publisher Sample

How To Use Big Data Effectively

BIRT in the World of Big Data

What happens when Big Data and Master Data come together?

Big Data Technologies Compared June 2014

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Text Analytics Beginner s Guide. Extracting Meaning from Unstructured Data

Business Analytics Research and Teaching Perspectives

What s Trending in Analytics for the Consumer Packaged Goods Industry?

Are You Ready for Big Data?

Big Analytics: A Next Generation Roadmap

OnX Big Data Reference Architecture

Sources: Summary Data is exploding in volume, variety and velocity timely

How To Make Sense Of Data With Altilia

Mohan Sawhney Robert R. McCormick Tribune Foundation Clinical Professor of Technology Kellogg School of Management

Are You Ready for Big Data?

Big Data Processing: Past, Present and Future

Mind Commerce. Commerce Publishing v3122/ Publisher Sample

Hadoop Market - Global Industry Analysis, Size, Share, Growth, Trends, And Forecast,

Data Refinery with Big Data Aspects

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

Transforming the Telecoms Business using Big Data and Analytics

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

Big Data Er Big Data bare en døgnflue? Lasse Bache-Mathiesen CTO BIM Norway

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Big Data Use Cases Update

A New Era Of Analytic

Mind Commerce. Commerce Publishing v3122/ Publisher Sample

Social Media Influencer Survey 2014

BIG DATA USING HADOOP

Customized Report- Big Data

Annex: Concept Note. Big Data for Policy, Development and Official Statistics New York, 22 February 2013

Big Data in Telecom value chain. Presented by: Gurjot S Sandhu Director Sales Xalted Information Systems Pvt. Ltd.

WHAT IS BIG DATA? David Bechtold

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Big Data / FDAAWARE. Rafi Maslaton President, cresults the maker of Smart-QC/QA/QD & FDAAWARE 30-SEP-2015

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

Solve your toughest challenges with data mining

Big Data for Marketing:

3 rd Asia Pacific Pharmaceutical Compliance Congress And Best Practices Forum Sept , Kuala Lumpur, Malaysia

The Big Data Market: Business Case, Market Analysis & Forecasts

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

A U T H O R S : G a n e s h S r i n i v a s a n a n d S a n d e e p W a g h Social Media Analytics

Big Data Comes of Age: Shifting to a Real-time Data Platform

IJRCS - International Journal of Research in Computer Science ISSN:

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

At a recent industry conference, global

Big Data: Are You Ready? Kevin Lancaster

Microsoft Big Data. Solution Brief

Beyond listening Driving better decisions with business intelligence from social sources

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Pascal Clement Head of Travel Intelligence June 2013

Offload Enterprise Data Warehouse (EDW) to Big Data Lake. Ample White Paper

Il mondo dei DB Cambia : Tecnologie e opportunita`

TABLE OF CONTENTS 1 Chapter 1: Introduction 2 Chapter 2: Big Data Technology & Business Case 3 Chapter 3: Key Investment Sectors for Big Data

Getting Started Practical Input For Your Roadmap

Big Data Big Deal? Salford Systems

OCR LEVEL 2 CAMBRIDGE TECHNICAL

Big Data. Fast Forward. Putting data to productive use

COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY WITH PRACTICAL OUTCOMES

How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW

TUT NoSQL Seminar (Oracle) Big Data

Big Data Are You Ready? Thomas Kyte

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

Transcription:

Big Data Big Data: Introduction and Applications August 20, 2015 HKU-HKJC ExCEL3 Seminar Michael Chau, Associate Professor School of Business, The University of Hong Kong Ample opportunities for business organizations and governments to provide better services and gain managerial and strategic insights by gathering, cleaning, and analyzing these Big Data. 4 Over the past years What is Big Data? Data storage has grown exponentially It is not just big! Four Vs Computation capacity has risen sharply Network bandwidth has increased greatly Volume Velocity Variety Veracity 2 5 Big Data Massive amount data are being generated at an unprecedented speed from various sources: online transactions mobile applications sensors images, audio, video social media including blogs, weibos, facebook, and forums 3 6 Source: IBM

The Data Size Is Getting Big, Bigger Hadron Collider - 1 PB/sec Boeing jet - 20 TB/hr Facebook - 500 TB/day. YouTube 1 TB/4 min. The proposed Square Kilometer Array telescope (the world s proposed biggest telescope) 1 EB/day Names for Big Data Sizes 7 10 Source: IBM Google Analytics 8 Source: IBM 11 Mobile Devices 9 Source: IBM 12

Social Media Big Data Investment by Industry An important component of Big Data Social networking sites Blogs, microblogs Online reviews, discussion forums Online news aggregator People reveal themselves on social media Demographics, preferences, habits, family ties, social ties Images, videos Unstructured but rich in content Increasingly used in marketing research 13 16 McKinsey estimates. Big Data Investment by Region $300 billion potential annual value to US health care 250 billion potential annual value to Europe s public sector administration $600 billion potential annual consumer surplus from using personal location data globally 60% potential increase in retailers operating margins possible with big data 14 17 How governments see big data? Source: Accenture Industrial Internet Insights Report For 2015 USA: The White House Big Data R&D Initiative was launched in 2012 Extract knowledge and insights from large and complex collections of digital data UK: Formed the Alan Turing Institute for big data research South Korea: The Big Data Initiative was launched in 2011 15 18

How governments see big data? United Nations Global Pulse project Analyze social media and big data for sustainable development and humanitarian action 19 Big Data Considerations You can t process the amount of data that you want to because of the limitations of your current platform. You can t include new/contemporary data sources (e.g., social media, RFID, Sensory, Web, GPS, textual data) because it does not comply with the data schema. You need to (or want to) integrate data as quickly as possible to be current on your analysis. You want to work with a schema-on-demand data storage paradigm because the variety of data types. The data is arriving so fast at your organization s doorstep that your analytics platform cannot handle it. 22 Source: The Storage Alchemist20 Critical Success Factors for Big Data Analytics A clear business need (alignment with the vision and the strategy) Strong, committed sponsorship (executive champion) Alignment between the business and IT strategy A fact-based decision-making culture A strong data infrastructure The right analytics tools Right people with right skills 23 Fundamentals of Big Data Analytics Big Data by itself, regardless of the size, type, or speed, is worthless Big Data + big analytics = value (the 5 th V) With the value proposition, Big Data also brought about big challenges Effectively and efficiently capturing, storing, and analyzing Big Data New breed of technologies needed (developed or purchased or hired or outsourced ) Critical Success Factors for Big Data Analytics 21 24

Models and Technologies Traditional Data mining, data warehouse Big Data Analytics Unstructured data from multiple sources Real-time analytics Complex statistical analysis Distributed computing 25 Big Data Vendors Big Data vendor landscape is developing very rapidly A representative list would include Cloudera - cloudera.com Software, MapR mapr.com Hardware, Hortonworks - hortonworks.com Service, Also, IBM (Netezza, InfoSphere), Oracle (Exadata, Exalogic), Microsoft, Amazon, Google, 28 Models and Technologies Social media analytics Text and sentiment analysis Social network analysis Predictive modeling Statistical method Artificial intelligence/data mining Visualization Top 10 Big Data Vendors with Primary Focus on Hadoop $70 $60 $50 $40 $30 $20 $10 $0 26 29 27 30 15

Example: Amazon Example: Hewlett-Packard Predicts what customers will buy and manage their inventory Analyzes data of 330,000 employees to predict who have a high risk of leaving the job Customers who bought this item, also bought these Results in an estimated saving of $300 million Anticipatory Shipping: shipping an item to a customer in anticipation that this customer will order that product will it work? 31 34 Example: Google Flu Trend Example: Germany Soccer Google Flu Trend: Predicts influenza breakout based on the occurrences of relevant search terms in search data Match Insights: Collects and analyzes massive amounts of player performance data (including video data) Successfully predicts regional outbreaks of flu up to 10 days before they were reported by the CDC (Centers for Disease Control and Prevention). 32 35 Example: Macy s Analyzes a vast amount of customer data ranging from visit frequencies and sales to style preferences and personal motivations. Adjusts pricing in near-real time for millions of items, based on demand and inventory Sends targeted and customized direct mailings to customers Example: US Law Enforcement Challenges Isolated databases Lack of analytics capability COPLINK system Implemented in many police departments in the US Database linking Association mining 33 36

37 40 Social Media Analytics Social Media Social interactions among people People freely reveal their feelings Highly dynamic 38 41 Example: US Law Enforcement Different Types of Social Media Predictive policing Predict time and location with high-probability of criminal activities Prevent crimes before they happen Crime mapping RAIDS Online Connects law enforcement with the community to reduce crime and improve public safety Collaborative projects (e.g., Wikipedia, Wiktionary) Blogs and microblogs (e.g., Twitter, Weibo) Content communities (e.g., YouTube) Social networking sites (e.g., Facebook) Virtual game worlds (e.g., World of Warcraft) Virtual social worlds (e.g., Second Life) 39 42

2015 Facebook usage Social Media Analytics 43 46 Social Media Analytics Tools and Vendors 44 47 Social Media Analytics 45 48

Application Case: HK Online Public Opinion Case Study: National education debate Controversy on the implementation of national education in secondary and primary schools Study period lasted from January 1, 2012 to May, 31, 2012. 49 52 Social Network Analysis Social Network - social structure composed of individuals linked to each other Analysis of social dynamics Identify Opinion leaders Bridges Clusters and social circles 50 53 Application Case: HK Online Public Opinion Challenges Online platforms become a important venue to understand public opinions Too much information for analysis and monitoring Data collection from major Hong Kong based online opinion platforms Discussion forums Uwants, Discuss HK, Golden, HK Reporter Twitter Sina weibo Blogs Facebook pages, groups, and events 51 54

55 58 Prediction of public sentiment 56 59 Application Case: Singapore Social Media Analytics Analyze and visualize social media http://research.larc.smu.edu.sg/palanteer/ http://research.larc.smu.edu.sg/palanteert/ 57 60

61 64 So, what big data can do? Reduce cost Improve services Improve design and planning 62 65 Questions and Discussions 63