Introduction to Predictive Analytics. Dr. Ronen Meiri ronen@dmway.com

Similar documents
Introduction to the Mathematics of Big Data. Philippe B. Laval

Doing Multidisciplinary Research in Data Science

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP

Age of Big data. Presented by: Mohammad Iqbal BCM -2014

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Applications for Business Intelligence, Predictive Analytics and Big Data

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013

A Survey on Big Data Concepts and Tools

Peter Rakers De Verstoring van Big Data

BIG DATA CHALLENGES AND PERSPECTIVES

So What s the Big Deal?

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

BIG DATA What it is and how to use?

Oracle Big Data for Dummies

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

Intro to Big Data and Business Intelligence

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

Changing the face of Business Intelligence & Information Management

Big Data a threat or a chance?

How To Use Big Data In Healthcare

Digital Earth: Big Data, Heritage and Social Science

Big Analytics: A Next Generation Roadmap

Oracle Big Data for Dummies

Big Data Explained. An introduction to Big Data Science.

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

Real Time Big Data Processing

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Algorithms and Methods for Distributed Storage Networks 7 File Systems Christian Schindelhauer

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis. Opportunities in Power Distribution Network System Optimization (from EDA Perspective)

BIG DATA TRENDS AND TECHNOLOGIES

Big Data The next big thing

Predictive Analytics

UNDERSTANDING THE BIG DATA PROBLEMS AND THEIR SOLUTIONS USING HADOOP AND MAP-REDUCE

Big Data. Sonovate QuickView Series #3

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

Texas Digital Government Summit. Data Analysis Structured vs. Unstructured Data. Presented By: Dave Larson

Big Data: Public Sector Opportunities, Challenges, and Implications

Big Data. Lyle Ungar, University of Pennsylvania

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

Keywords Big Data Analytic Tools, Data Mining, Hadoop and MapReduce, HBase and Hive tools, User-Friendly tools.

The 3 questions to ask yourself about BIG DATA

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

Big Data Streams. Analytics Challenges, Analysis, and Applications. Adel M. Alimi

Using Ultra-Large Data Sets in Healthcare New Questions-New Answers

Moving From Hadoop to Spark

WHAT IS BIG DATA? David Bechtold

THE AGE OF BIG DATA. Chula DataScience

Sunnie Chung. Cleveland State University

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How To Handle Big Data With A Data Scientist

Copyright 2014, Neudesic. All rights reserved.

Hadoop Big Data for Processing Data and Performing Workload

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith /

SECURITY MEETS BIG DATA. Achieve Effectiveness And Efficiency. Copyright 2012 EMC Corporation. All rights reserved.

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Pervasive Location Analytics and A Billion Dollar Opportunity. Jitender Aswani, Portfolio Strategist, Business Analytics, SAP

Analytical Tools: What Auditors Need to Know About Big Data

The HP IT Transformation Story

Big Data Analytics Nokia

Data Warehouse design

Predictive Analytics Certificate Program

A New Era Of Analytic

Chapter 1. Contrasting traditional and visual analytics approaches

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

Taming the Beast of Big Data

Advanced Big Data Analytics with R and Hadoop

Big Data and Marketing

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends

Statistics for BIG data

Analytics Data Discovery QlikView

Big Data: Study in Structured and Unstructured Data

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

Big data and its transformational effects

What Is Big Data? Craig C. Douglas University of Wyoming

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Transcription:

Introduction to Predictive Analytics Dr. Ronen Meiri

Outline From big data to predictive analytics Predictive Analytics vs. BI Intelligent platforms What can we do with it. The modeling process. Example Life time value. How DMWay makes predictive analytics easy.

The digital revolution Printing revolution (Gutenberg's press ~ 1450) Scientific revolution (~1550), mechanics, medicine, chemistry, optics, electricity Industrial revolution (~1800), textile, chemicals, agriculture, transportation, muss production Digital revolution (~1950) Accounting, a man on the moon, Signal processing, information retrieval, wearable Computing

Albert Einstein Computers are incredibly fast, accurate and stupid; Humans are incredibly slow, inaccurate and brilliant; Together they are powerful beyond imagination.

Big Data is all over. ~4 Zetta bytes of data (2013) http://en.wikipedia.org/wiki/zettabyte Major players: Facebook - 1,150 million users Gmail 425 million users Skype 300 million users Tweeter 500 million users (M200 active) WhatsApp 300+ million users Youtube 1,000 million users (4 billion views a day) Instagram - 150 million users Many others - Google, Waze, Amazon, Ebay, Paypal, Value Symbol Name 1000 kb kilobyte 1000 2 MB megabyte 1000 3 GB gigabyte 1000 4 TB terabyte 1000 5 PB petabyte 1000 6 EB exabyte 1000 7 ZB zettabyte 1000 8 YB yottabyte Sources: http://www.calcalist.co.il/local/articles/1,7340,l-3602417,00.html (Calcalist, May 2013) http://expandedramblings.com/index.php/resource-how-many-people-use-the-top-social-media/ September 15, 2013

Emphasis So Far How to store and manage big data? Should one host the data on premises or on the cloud? Multiple technologies: Hadoop Hbase MongoBD nosql databases Others How to gain benefits from big data?

From big data to data science Data is a strategic asset (competitive advantage) Extract the value buried in the data for decision making Hottest buzzword these days is Data Science NY Times declared data science as the sexiest job of this century The MCkinsey group estimates a shortage of 150,000-190,000 data scientists by the end of 2018 http://www.mckinsey.com/insights/business_technology/big_data_the_next_frontier_for_innovation

Data Science - Venn diagram

Business Analytics Business Analytics Descriptive Analytics Predictive Analytics Prescriptive Analytics

Descriptive Analytics (BI) Good for reporting Interactive analytics Measuring the business performance (KPI) Ad-hoc reporting Works well with huge amount of data (Big data). Relatively easy to use Value?

Prescriptive Analytics Proactive - Cannot do without it Maximize business performance Combines business rules with modeling (descriptive, predictive) to drive actions

Predictive Analytics Looks on past events to predict future outcomes (targeting, churn ) Complex modeling techniques (statistics, math, ML, computer science ) Proactive Value

Predictive Analytics vs. Descriptive (Gartner)

Predictive Analytics vs. BI Sorry business intelligence gurus, but BI is no longer good enough business intelligence reports and dashboards describe what has already happened they are not proactive Ian A. Bertram, in Gartner Business Intelligence & Analytics Summit March 18, 2013 http://data-informed.com/gartner-researchers-predictiveanalytics-to-gain-traction-in-business/

Intelligent Platform Cycle Collect Data Analyze Evaluate Deploy

PA what can we do with it? Prediction Estimate the Life Time Value of a new customer Estimate the expected losses or the number of claims in insurance policy. Expected deposits Classification Who is likely to churn. Who is likely respond to an offer Who is likely to default on a loan in the next period of time

PA what can we do with it? Forecasting Stock price Forecast KPI (expected sales in next month, quarter, year ) Seasonality Collaborative Analytics (Wisdom of the crowd) What product to offer to what user

Modeling Process Model the business problem Data/ETL Analyze Deploy evaluate

Modeling the business problem What is the business problem What is a churn? What is the definition of the LTV How to define success in a process? Relevant and actionable Churn for example What are the data component User activities? How the solution can be integrated within the organization s operational system Batch process Near real time

Data Modeling Map the data sources Map the relations between the data sources Create the analysis dataset (structure the data in 2D)

Analyze Relevant modeling algorithms (linear regression, trees, logistic regressions ) Transformations Feature selection Validate

Deployment Write pseudo code Code the predictive model in SQL, Java,. Imbed the code in the operational systems Add business rules (Prescriptive Analytics).

Evaluate Collect performance data Measure the model performance Recalibrate/Update model Set alerts when model performance degrade