Big Data and Big Analy-cs Trends: The Promise and the Hype. Gregory Piatetsky KDnuggets

Size: px
Start display at page:

Download "Big Data and Big Analy-cs Trends: The Promise and the Hype. Gregory Piatetsky KDnuggets"

Transcription

1 Big Data and Big Analy-cs Trends: The Promise and the Hype Gregory Piatetsky KDnuggets KDnuggets

2 My Data PhD in applying Machine Learning to databases Researcher at GTE Labs started the first project on Knowledge Discovery in Databases in 1989 Organized first 3 Knowledge Discovery and Data Mining (KDD) workshops ( ), cofounded Knowledge Discovery and Data Mining (KDD) conferences (1995) Chief ScienSst at 2 analyscs startups Co- founder SIGKDD (1998), Chair, AnalyScs/Data Mining Consultant, KDnuggets

3 KDnuggets Stands for Knowledge Discovery Nuggets started KDnuggets News newsleyer (~ 12,000 subscribers now) early website in 1994, in 1997, blog- style in best year: 50,000 unique visitors/month twiyer.com/kdnuggets ~6,000 followers facebook.com/kdnuggets page group: KDnuggets AnalyScs & Data Mining KDnuggets

4 What do we call what we do? StaSsScs Data mining Knowledge Discovery in Data (KDD) PredicSve AnalyScs Data Science Big AnalyScs? Core Idea: Finding Useful PaBerns in Data KDnuggets

5 Pre- history ( ): StaSsScs sta-s-cs is the biggest term in 20 th century, Analy-cs is used increasingly thru 20 th century data mining appears in late 1990s From Google Ngram viewer English language books Search case sensisve used most popular version. Other languages, especially Chinese, need to be considered for full picture KDnuggets

6 20 th Century AnalyScs vs Data Mining data mining AnalyScs Data Mining analy-cs?? Google N- grams search is case sensisve; Note: data mining > Data Mining usage While analyscs < AnalyScs KDnuggets

7 Data Mining Surges in 1996 data mining AnalyScs analy-cs Data Mining KDD- 95, 1 st Conference on Knowledge Discovery and Data Mining, Montreal Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, 1996, Eds: U. Fayyad, G. Piatetsky- Shapiro, P. Smyth, and R. Uthurusamy KDnuggets

8 Recent History: data mining AnalyScs analy-cs Knowledge Discovery analy-cs has been used since 1980, but started to rise in 2005 data mining surges around 1996 (soon amer first KDD conference) but slowly declines amer 2003 (TIA controversy, associated with Govt invasion of privacy). Knowledge Discovery appears in 1989, jumps in 1996, and plateaus amer 2000 (Google N- grams, smoothing =1) KDnuggets

9 Earliest use of data mining 1962? Amer eliminasng many following data. Mining cost is examples which refer to Mining of minerals, and books from 1958 that have a CD ayached (errors in book year) The earliest data mining reference I found is Source: Google Books (c) KDnuggets

10 Google Trends: Amer 2006, AnalyScs > Data Mining Global all regions (c) KDnuggets

11 >50% of AnalyScs searches are for Google AnalyScs Google AnalyScs introduced, Dec 2005 (c) KDnuggets

12 Google Trends observasons (as of Sep 2012) Decline in analyscs in 2012? Compe9ng on Analy9cs book, Apr 2007 December vacason drops (c) KDnuggets 2012

13 Global View: searches for data mining, analyscs - google Google Insights (c) KDnuggets

14 AnalyScs: Business > Data> PredicSve > Text Google Insights, Jan Sep 2012, Global (c) KDnuggets

15 Data Mining >> Business/Data/PredicSve AnalyScs Google Insights, Jan Sep 2012, Global (c) KDnuggets

16 Data Mining > Big Data >> PredicSve AnalyScs > Data Science Big Data Surge Google Insights, Jan Sep 2012, Global (c) KDnuggets

17 What will replace Big Data buzzword? Poll: will- replace- big- data.html KDnuggets

18 History StaSsScs 1960s Data Mining = bad ac9vity, data dredging Data Mining is good, surges in Data Mining plateaus (bad, invasion of privacy?) Google AnalyScs Business/Data/PredicSve AnalyScs Big Data ?? KDnuggets

19 AnalyScs, Big Data, Data Mining Today KDnuggets Polls Findings (c) KDnuggets

20 KDnuggets

21 Where did you apply Analy-cs/Data Mining? Avg. Number of Industries 2.8 Most Popular: - CRM - Banking - Health Care - EducaSon - Fraud DetecSon Highest growth in: Travel / Hospitality Social Networks EducaSon Biotech/Genomics Credit Scoring applied- anayscs- data- mining.html KDnuggets

22 Data Types Analyzed/Mined Most popular: - Table data - Time series - Text - - itemsets/transacsons Most growing: - XML data - text (free- form) - social network data - JSON types- analyzed- data- mined.html KDnuggets

23 Largest Dataset Analyzed? Big Data Miners elite group 2012 median dataset size ~20-40 GB, vs GB in dataset- analyzed- data- mined.html KDnuggets

24 Largest Dataset Analyzed by Region Big Data Miners: TeraBytes and Petabytes 18-24% KDnuggets

25 Which methods/algorithms did you use for data analysis Most popular: - Decision Trees - Regression - Clustering - StaSsScs - VisualizaSon analyscs- data- mining.html (c) KDnuggets

26 Algorithms with highest Industry Affinity Industry Affinity = How much this algorithm is more used among industry data miners = analyscs- data- mining.html (c) KDnuggets

27 Academic algorithms lowest Industry affinity analyscs- data- mining.html (c) KDnuggets

28 Cloud Analy-cs is not common (yet) Big data tools use grew 5- fold, from about 3% in 2011 to about 15% of respondents in 2012 analyscs somware poll (c) KDnuggets

29 JOBS AND SKILLS (c) KDnuggets

30 Shortage of Skills McKinsey: shortage by 2018 in the US of ,000 people with deep analyscal skills 1.5 M managers/analysts with the know- how to use the analysis of big data to make effecsve decisions. Source: big_data/ (c) KDnuggets

31 Indeed.com fastest growing jobs Top 10 skills: HTML5 MongoDB ios Android Mobile app Puppet Hadoop jquery PaaS Social Media Hadoop MongoDB KDnuggets

32 Big Data grows faster than MongoDB Hadoop Big Data MongoDB KDnuggets

33 Data Mining >> Hadoop (c) KDnuggets

34 Demand for Data Scien-sts surging Data ScienSst Fastest growing term on 1% of jobs in % of jobs in % of jobs 2012, Jan- Sep Data ScienSst sexiest job of the 21 st Century (???) say Thomas H. Davenport and D.J. PaSl, (HBR, Oct 2010) KDnuggets

35 Rebranding from Data Mining to Big Data Data Mining Big Data Data Scien-st Data mining jobs are much more common, but Big Data jobs are surging much faster than Data ScienSst (c) KDnuggets

36 LinkedIn Skills: Data Mining ~ 105,000 members with Data Mining skill (Sep 2012) Dwarfs related skills - 1% growth (according to LinkedIn) But in Oct 2011 there were 75K members with Data Mining Skill, which gives 40% annual growth (c) KDnuggets

37 Cloud (Big Data) AnalyScs Skills (c) KDnuggets

38 LinkedIn AnalyScs/Data Mining Skills Ground analyscs skills most common Cloud analyscs skills grow fastest Text AnalyScs skills less common SenSment Analysis fastest growing (c) KDnuggets

39 LinkedIn Analy-c Tools Skills SAS- cersfied KDnuggets

40 Big Data 2 nd Industrial RevoluSon Do old acsvises beyer Create new acsvises/businesses (c) KDnuggets

41 ApplicaSon areas Doing old things beyer Churn predicson Direct markesng/customer modeling RecommendaSons Fraud detecson Security/Intelligence CompeSSon will level companies (c) KDnuggets

42 Limit to PredicSng Human Behavior? There is randomness in human behavior and once we find 1- level effects, more data or beyer algorithms will give diminishing returns in most cases Example: Neylix Prize: the most advanced algorithms were only a few percentages beyer than basic algorithms (c) KDnuggets

43 Big Data Enables New Things! Google first big success of big data Social networks (facebook, TwiYer, LinkedIn, ) success depends on network size, i.e. big data LocaSon analyscs Health- care Personalized medicine SemanScs and AI? Imagine IBM Watson, Siri in 2020? (c) KDnuggets

44 Big Data Bubble? Big Data Gartner Hype Cycle 2012 KDnuggets 44

45 Gartner Hype Cycle for Big Data, 2012 Social Network Analysis, 5-10 Data ScienSst, 2-5 yrs Social AnalyScs, 2-5 PredicSve AnalyScs, <2 MapReduce & AlternaSve - Disillusionment KDnuggets

46 QuesSons? KDnuggets: Analy;cs, Big Data, Data Mining News, Jobs, Sodware, Data, ConsulSng, Courses, MeeSngs, PublicaSons, Webcasts, Subscribe to KDnuggets News at to editor1@kdnuggets.com KDnuggets

47 Data Mining in 1902?? KDnuggets

48 Research and Industry Disconnect? Uplim modeling needs more research AssociaSon rules need less papers Data Mining with Privacy research industry use? Conferences should bring researchers and industry people together (c) KDnuggets

49 Direct MarkeSng Lim: Random and Model- sorted Lists CPH: CumulaSve Pct Hits Random Model Pct list 5% of random list have 5% of hits 5% of model- score ranked list have 21% of hits. Lid(5%) = 21%/5% = 4.2

50 Most lim curves are surprising similar Study of lift curves in banking, telecom Best lift curves are similar Special point T=Target percentage Lift Actual lift(t) Est. lift(t) Lift(T) ~ sqrt (1/T) 4 2 G. Piatetsky- Shapiro, B. Masand, Es-ma-ng Campaign Benefits and Modeling Lid, in Proceedings of KDD- 99 Conference, ACM Press, *T% (c) KDnuggets

Targeted Marketing, KDD Cup and Customer Modeling

Targeted Marketing, KDD Cup and Customer Modeling Targeted Marketing, KDD Cup and Customer Modeling Outline Direct Marketing Review: Evaluation: Lift, Gains KDD Cup 1997 Lift and Benefit estimation Privacy and Data Mining 2 Direct Marketing Paradigm Find

More information

Machine Learning, Data Mining, and Knowledge Discovery: An Introduction

Machine Learning, Data Mining, and Knowledge Discovery: An Introduction Machine Learning, Data Mining, and Knowledge Discovery: An Introduction AHPCRC Workshop - 8/17/10 - Dr. Martin Based on slides by Gregory Piatetsky-Shapiro from Kdnuggets http://www.kdnuggets.com/data_mining_course/

More information

Doing Multidisciplinary Research in Data Science

Doing Multidisciplinary Research in Data Science Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University aadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 16 May

More information

How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6

How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6 Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...

More information

Introduction to Predictive Analytics. Dr. Ronen Meiri ronen@dmway.com

Introduction to Predictive Analytics. Dr. Ronen Meiri ronen@dmway.com Introduction to Predictive Analytics Dr. Ronen Meiri Outline From big data to predictive analytics Predictive Analytics vs. BI Intelligent platforms What can we do with it. The modeling process. Example

More information

Increasing Marketing ROI with Optimized Prediction

Increasing Marketing ROI with Optimized Prediction Increasing Marketing ROI with Optimized Prediction Yottamine s Unique and Powerful Solution Smart marketers are using predictive analytics to make the best offer to the best customer for the least cost.

More information

Data Mining for Fun and Profit

Data Mining for Fun and Profit Data Mining for Fun and Profit Data mining is the extraction of implicit, previously unknown, and potentially useful information from data. - Ian H. Witten, Data Mining: Practical Machine Learning Tools

More information

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume

More information

Data Mining: Overview. What is Data Mining?

Data Mining: Overview. What is Data Mining? Data Mining: Overview What is Data Mining? Recently * coined term for confluence of ideas from statistics and computer science (machine learning and database methods) applied to large databases in science,

More information

The Library (Big) Data scien4st

The Library (Big) Data scien4st The Library (Big) Data scien4st IFLA/ALA webinar: Big Data: new roles and opportuni4es for new librarians June 15 th 2016 IFLA Big Data Special Interest Group (SIG) Wouter Klapwijk, Stellenbosch University,

More information

Building and Deploying Customer Behavior Models

Building and Deploying Customer Behavior Models Building and Deploying Customer Behavior Models February 20, 2014 David Smith, VP Marketing and Community, Revolution Analytics Paul Maiste, President and CEO, Lityx In Today s Webinar About Revolution

More information

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam

More information

Hadoop s Advantages for! Machine! Learning and. Predictive! Analytics. Webinar will begin shortly. Presented by Hortonworks & Zementis

Hadoop s Advantages for! Machine! Learning and. Predictive! Analytics. Webinar will begin shortly. Presented by Hortonworks & Zementis Webinar will begin shortly Hadoop s Advantages for Machine Learning and Predictive Analytics Presented by Hortonworks & Zementis September 10, 2014 Copyright 2014 Zementis, Inc. All rights reserved. 2

More information

IBM Power Systems This is Power on a Smarter Planet

IBM Power Systems This is Power on a Smarter Planet IBM Power Systems This is Power on a Smarter Planet Red Hat Enterprise Linux for IBM Power Systems! Filipe Miranda Global Lead for Linux on IBM System z and Power Systems!, #powerlinux, #bigdata, #IBMWatson,

More information

The Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics

The Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics The Data Engineer Mike Tamir Chief Science Officer Galvanize Steven Miller Global Leader Academic Programs IBM Analytics Alessandro Gagliardi Lead Faculty Galvanize Businesses are quickly realizing that

More information

INDEX. Introduction Page 3. Methodology Page 4. Findings. Conclusion. Page 5. Page 10

INDEX. Introduction Page 3. Methodology Page 4. Findings. Conclusion. Page 5. Page 10 FINDINGS 1 INDEX 1 2 3 4 Introduction Page 3 Methodology Page 4 Findings Page 5 Conclusion Page 10 INTRODUCTION Our 2016 Data Scientist report is a follow up to last year s effort. Our aim was to survey

More information

Gordon S. Linoff Founder Data Miners, Inc. gordon@data-miners.com

Gordon S. Linoff Founder Data Miners, Inc. gordon@data-miners.com Survival Data Mining Gordon S. Linoff Founder Data Miners, Inc. gordon@data-miners.com What to Expect from this Talk Background on survival analysis from a data miner s perspective Introduction to key

More information

Big Data. Lyle Ungar, University of Pennsylvania

Big Data. Lyle Ungar, University of Pennsylvania Big Data Big data will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus. McKinsey Data Scientist: The Sexiest Job of the 21st Century -

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

Predictive Analytics Certificate Program

Predictive Analytics Certificate Program Information Technologies Programs Predictive Analytics Certificate Program Accelerate Your Career Offered in partnership with: University of California, Irvine Extension s professional certificate and

More information

Estimating Campaign Benefits and Modeling Lift

Estimating Campaign Benefits and Modeling Lift Estimating Campaign Benefits and Modeling Lift Gregy Piatetsky-Shapiro Knowledge Stream Partners Brij Masand GTE Labaties Boston, MA 219 Waltham, MA 2451-1128 gps at kstream.com brij at gte.com Abstract

More information

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»

More information

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90 FREE echapter C H A P T E R1 Big Data and Analytics Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90 percent of the data in the

More information

International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 ISSN 2278-7763. BIG DATA: A New Technology

International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 ISSN 2278-7763. BIG DATA: A New Technology International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 BIG DATA: A New Technology Farah DeebaHasan Student, M.Tech.(IT) Anshul Kumar Sharma Student, M.Tech.(IT)

More information

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health Lecture 1: Data Mining Overview and Process What is data mining? Example applications Definitions Multi disciplinary Techniques Major challenges The data mining process History of data mining Data mining

More information

Data are everywhere. IBM projects that every day we generate 2.5

Data are everywhere. IBM projects that every day we generate 2.5 C HAPTER 1 Big Data and Analytics Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. 1 In relative terms, this means 90 percent of the data in the world has been

More information

Big Data a threat or a chance?

Big Data a threat or a chance? Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but

More information

How Big Data is Different

How Big Data is Different FALL 2012 VOL.54 NO.1 Thomas H. Davenport, Paul Barth and Randy Bean How Big Data is Different Brought to you by Please note that gray areas reflect artwork that has been intentionally removed. The substantive

More information

Machine Learning and Predictive Analytics Foster Growth [1]

Machine Learning and Predictive Analytics Foster Growth [1] Machine Learning and Predictive Analytics Foster Growth [1] Machine learning technology, which is defined in this ProgrammableWeb article [2], is starting to become a common component in many types of

More information

Beyond SEO: What to Do With Your Website Visitors

Beyond SEO: What to Do With Your Website Visitors Beyond SEO What To Do With Your Website Visitors Ron Stauffer Beyond SEO What To Do With Your Website Visitors Ron Stauffer Infront Webworks The Web in 1994 www.apple.com The Web in 1994 www.amazon.com

More information

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,

More information

No BI without Machine Learning

No BI without Machine Learning No BI without Machine Learning Francis Pieraut francis@qmining.com http://fraka6.blogspot.com/ 10 March 2011 MTI-820 ETS Too Much Data Supervised Learning (classification) Unsupervised Learning (clustering)

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER Hur hanterar vi utmaningar inom området - Big Data Jan Östling Enterprise Technologies Intel Corporation, NER Legal Disclaimers All products, computer systems, dates, and figures specified are preliminary

More information

Big Data in Finance. Alexander Grigoriev. School of Business and Economics Sharing Success

Big Data in Finance. Alexander Grigoriev. School of Business and Economics Sharing Success Big Data in Finance Alexander Grigoriev Definitions Wiki: Big Data Gartner s 3V-definition [2012]: Big data is high volume, high velocity, and/or high variety information assets that require new forms

More information

IN-DEPTH USE CASE IN-DEPTH USE CASE. drive 9x. increase in reach INFLUENCERS. modernb2b.co

IN-DEPTH USE CASE IN-DEPTH USE CASE. drive 9x. increase in reach INFLUENCERS. modernb2b.co IN-DEPTH USE CASE IN-DEPTH USE CASE drive 9x INFLUENCERS increase in reach INFLUENCER MARKETING THE STATUS QUO For most businesses, social media is about spreading your news and driving awareness. Some

More information

PREDICTIVE MARKETING, DIGITAL ATTRIBUTION, OPTIMIZATION, AND DATA-DRIVEN PERSONALIZATION

PREDICTIVE MARKETING, DIGITAL ATTRIBUTION, OPTIMIZATION, AND DATA-DRIVEN PERSONALIZATION PREDICTIVE MARKETING, DIGITAL ATTRIBUTION, OPTIMIZATION, AND DATA-DRIVEN PERSONALIZATION A m a r t y a B h a t t a c h a r j y & S u n e e l G r o v e r P r i n c i p a l S o l u t i o n A r c h i t e

More information

SEO Presentation. Asenyo Inc.

SEO Presentation. Asenyo Inc. SEO Presentation What is Search Engine Optimization? Search Engine Optimization (SEO) : PPC and Organic Results Pay Per Click Ads The means of achieving top search engine results without having to incur

More information

Grab some coffee and enjoy the pre-show banter before the top of the hour!

Grab some coffee and enjoy the pre-show banter before the top of the hour! Grab some coffee and enjoy the pre-show banter before the top of the hour! The Analytic Platform: Empowering the Business Now The Briefing Room Welcome Host: Eric Kavanagh eric.kavanagh@bloorgroup.com

More information

Big Data Big Deal? Salford Systems www.salford-systems.com

Big Data Big Deal? Salford Systems www.salford-systems.com Big Data Big Deal? Salford Systems www.salford-systems.com 2015 Copyright Salford Systems 2010-2015 Big Data Is The New In Thing Google trends as of September 24, 2015 Difficult to read trade press without

More information

High-Performance Analytics

High-Performance Analytics High-Performance Analytics David Pope January 2012 Principal Solutions Architect High Performance Analytics Practice Saturday, April 21, 2012 Agenda Who Is SAS / SAS Technology Evolution Current Trends

More information

How To Understand Data Mining In R And Rattle

How To Understand Data Mining In R And Rattle http: // togaware. com Copyright 2014, Graham.Williams@togaware.com 1/40 Data Analytics and Business Intelligence (8696/8697) Introducing Data Science with R and Rattle Graham.Williams@togaware.com Chief

More information

Open source Google-style large scale data analysis with Hadoop

Open source Google-style large scale data analysis with Hadoop Open source Google-style large scale data analysis with Hadoop Ioannis Konstantinou Email: ikons@cslab.ece.ntua.gr Web: http://www.cslab.ntua.gr/~ikons Computing Systems Laboratory School of Electrical

More information

Data Mining in CRM & Direct Marketing. Jun Du The University of Western Ontario jdu43@uwo.ca

Data Mining in CRM & Direct Marketing. Jun Du The University of Western Ontario jdu43@uwo.ca Data Mining in CRM & Direct Marketing Jun Du The University of Western Ontario jdu43@uwo.ca Outline Why CRM & Marketing Goals in CRM & Marketing Models and Methodologies Case Study: Response Model Case

More information

Cloud Computing and Big Data That s Why! Ray Walshe 14 th March 2013

Cloud Computing and Big Data That s Why! Ray Walshe 14 th March 2013 Cloud Computing and Big Data That s Why! Ray Walshe 14 th March 2013 Data Centre Growth We need more data centres??? (c) Ray Walshe 2013 2 By 2015 By 2015, about 24% of all new business software purchases

More information

Online Content Optimization Using Hadoop. Jyoti Ahuja Dec 20 2011

Online Content Optimization Using Hadoop. Jyoti Ahuja Dec 20 2011 Online Content Optimization Using Hadoop Jyoti Ahuja Dec 20 2011 What do we do? Deliver right CONTENT to the right USER at the right TIME o Effectively and pro-actively learn from user interactions with

More information

Machine Learning and Predictive Analytics Foster Growth Convert Edit Feb. 21 2014

Machine Learning and Predictive Analytics Foster Growth Convert Edit Feb. 21 2014 Machine Learning and Predictive Analytics Foster Growth Convert Edit Feb. 21 2014 By Janet Wagner, PW Staff Machine learning technology, which is defined in this ProgrammableWeb article, is starting to

More information

Cloud Computing Backgrounder

Cloud Computing Backgrounder Cloud Computing Backgrounder No surprise: information technology (IT) is huge. Huge costs, huge number of buzz words, huge amount of jargon, and a huge competitive advantage for those who can effectively

More information

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD Predictive Analytics Techniques: What to Use For Your Big Data March 26, 2014 Fern Halper, PhD Presenter Proven Performance Since 1995 TDWI helps business and IT professionals gain insight about data warehousing,

More information

Predictive Analytics for Demand Forecasting and Planning Managers A Big Data Challenge Hans Levenbach, Delphus, Inc.

Predictive Analytics for Demand Forecasting and Planning Managers A Big Data Challenge Hans Levenbach, Delphus, Inc. Predictive Analytics for Demand Forecasting and Planning Managers A Big Data Challenge Hans Levenbach, Delphus, Inc. ISF2013, KAIST College of Business, Seoul, Korea Agenda Role of Big Data in Small Predictive

More information

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence The Rise of Industrial Big Data Brian Courtney General Manager Industrial Data Intelligence Agenda Introduction Big Data for the industrial sector Case in point: Big data saves millions at GE Energy Seeking

More information

Data Mining. Knowledge Discovery, Data Warehousing and Machine Learning Final remarks. Lecturer: JERZY STEFANOWSKI

Data Mining. Knowledge Discovery, Data Warehousing and Machine Learning Final remarks. Lecturer: JERZY STEFANOWSKI Data Mining Knowledge Discovery, Data Warehousing and Machine Learning Final remarks Lecturer: JERZY STEFANOWSKI Email: Jerzy.Stefanowski@cs.put.poznan.pl Data Mining a step in A KDD Process Data mining:

More information

Building HTML5 and hybrid mobile apps using cloud services. Andrei Glazunov

Building HTML5 and hybrid mobile apps using cloud services. Andrei Glazunov Building HTML5 and hybrid mobile apps using cloud services Andrei Glazunov About Exadel Exadel is a global software engineering company. Founded in 1998, headquarters in San Francisco Bay Area 7 development

More information

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate

More information

Exploring the Efficiency of Big Data Processing with Hadoop MapReduce

Exploring the Efficiency of Big Data Processing with Hadoop MapReduce Exploring the Efficiency of Big Data Processing with Hadoop MapReduce Brian Ye, Anders Ye School of Computer Science and Communication (CSC), Royal Institute of Technology KTH, Stockholm, Sweden Abstract.

More information

David G. Belanger, PhD, Senior Research Fellow, Stevens Institute of Technology, New Jersey, USA Topic: Big Data - The Next Phase Abstract

David G. Belanger, PhD, Senior Research Fellow, Stevens Institute of Technology, New Jersey, USA Topic: Big Data - The Next Phase Abstract David G. Belanger, PhD, Senior Research Fellow, Stevens Institute of Technology, New Jersey, USA Dr. David Belanger is currently a Senior Research Fellow at Stevens Institute of Technology. In this role

More information

Big Data and the Data Warehouse

Big Data and the Data Warehouse Big Data and the Data Warehouse When the phrase big data management hit the data management and business intelligence (BI) industry, it had many IT professionals wondering if it would be the real deal

More information

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013 Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013 Housekeeping 1. Any questions coming out of today s presentation can be discussed in the bar this evening 2. OCF is

More information

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords From A to Z By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords Big data is one of the, well, biggest trends in IT today, and it has spawned a whole new generation

More information

What is Customer Relationship Management? Customer Relationship Management Analytics. Customer Life Cycle. Objectives of CRM. Three Types of CRM

What is Customer Relationship Management? Customer Relationship Management Analytics. Customer Life Cycle. Objectives of CRM. Three Types of CRM Relationship Management Analytics What is Relationship Management? CRM is a strategy which utilises a combination of Week 13: Summary information technology policies processes, employees to develop profitable

More information

A Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML

A Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML www.bsc.es A Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML Josep Ll. Berral, Nicolas Poggi, David Carrera Workshop on Big Data Benchmarks Toronto, Canada 2015 1 Context ALOJA: framework

More information

Big Data and Marketing

Big Data and Marketing Big Data and Marketing Professor Venky Shankar Coleman Chair in Marketing Director, Center for Retailing Studies Mays Business School Texas A&M University http://www.venkyshankar.com venky@venkyshankar.com

More information

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Analyst @ Expedia

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Analyst @ Expedia The Impact of Big Data on Classic Machine Learning Algorithms Thomas Jensen, Senior Business Analyst @ Expedia Who am I? Senior Business Analyst @ Expedia Working within the competitive intelligence unit

More information

Open source large scale distributed data management with Google s MapReduce and Bigtable

Open source large scale distributed data management with Google s MapReduce and Bigtable Open source large scale distributed data management with Google s MapReduce and Bigtable Ioannis Konstantinou Email: ikons@cslab.ece.ntua.gr Web: http://www.cslab.ntua.gr/~ikons Computing Systems Laboratory

More information

Developing Data Analytics Skills in Japan: Status and Challenge

Developing Data Analytics Skills in Japan: Status and Challenge Developing Data Analytics Skills in Japan: Status and Challenge Hiroshi Maruyama, The Institute of Statistical Mathematics Abstract: Japan needs to develop data analytics talents quickly to catch up with

More information

Text Mining with R Twitter Data Analysis 1

Text Mining with R Twitter Data Analysis 1 Text Mining with R Twitter Data Analysis 1 Yanchang Zhao http://www.rdatamining.com R and Data Mining Workshop for the Master of Business Analytics course, Deakin University, Melbourne 28 May 2015 1 Presented

More information

TOP 8 TRENDS FOR 2016 BIG DATA

TOP 8 TRENDS FOR 2016 BIG DATA The year 2015 was an important one in the world of big data. What used to be hype became the norm as more businesses realized that data, in all forms and sizes, is critical to making the best possible

More information

What Managers Need to Know about Data Science. Annie Flippo

What Managers Need to Know about Data Science. Annie Flippo What Managers Need to Know about Data Science Annie Flippo Outline What is data science Industry trends What is data The Optimal Data Scientist The Optimal Manager Topics in Data Science Topics in Cloud

More information

BUSINESS INTELLIGENCE COMPETENCY CENTER

BUSINESS INTELLIGENCE COMPETENCY CENTER BUSINESS INTELLIGENCE COMPETENCY CENTER Last Updated: December 2012 Dr. Joseph M. Woodside Executive Director BICC, Stetson University Dr. Ted J. Surynt Executive Advisory Board, Stetson University Dr.

More information

Focus on the business, not the business of data warehousing!

Focus on the business, not the business of data warehousing! Focus on the business, not the business of data warehousing! Adam M. Ronthal Technical Product Marketing and Strategy Big Data, Cloud, and Appliances @ARonthal 1 Disclaimer Copyright IBM Corporation 2014.

More information

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

APPROACHABLE ANALYTICS MAKING SENSE OF DATA APPROACHABLE ANALYTICS MAKING SENSE OF DATA AGENDA SAS DELIVERS PROVEN SOLUTIONS THAT DRIVE INNOVATION AND IMPROVE PERFORMANCE. About SAS SAS Business Analytics Framework Approachable Analytics SAS for

More information

Big Data and Data Science: Behind the Buzz Words

Big Data and Data Science: Behind the Buzz Words Big Data and Data Science: Behind the Buzz Words Peggy Brinkmann, FCAS, MAAA Actuary Milliman, Inc. April 1, 2014 Contents Big data: from hype to value Deconstructing data science Managing big data Analyzing

More information

Hexaware E-book on Predictive Analytics

Hexaware E-book on Predictive Analytics Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,

More information

BIG DATA & ANALYTICS

BIG DATA & ANALYTICS 12 th NCS International Conference Information Technology for Inclusive Development. Akure, Ondo State JULY 22-24, 2015 JOHNSON S. IYILADE, Ph.D. Researcher, University of Saskatchewan, Canada Founder

More information

INTERNATIONAL STUDENT MARKETING. Global Digital Advertising Agency for Universities and Colleges

INTERNATIONAL STUDENT MARKETING. Global Digital Advertising Agency for Universities and Colleges INTERNATIONAL STUDENT MARKETING Global Digital Advertising Agency for Universities and Colleges Net Natives work with over 200 global universities to recruit students from 150 countries. Our outcome focused

More information

Syllabus INFO-GB-3322. Design and Development of Web and Mobile Applications (Especially for Start Ups)

Syllabus INFO-GB-3322. Design and Development of Web and Mobile Applications (Especially for Start Ups) Syllabus INFO-GB-3322 Design and Development of Web and Mobile Applications (Especially for Start Ups) Spring 2015 Stern School of Business Norman White, KMEC 8-88 Email: nwhite@stern.nyu.edu Phone: 212-998

More information

DATA MINING - SELECTED TOPICS

DATA MINING - SELECTED TOPICS DATA MINING - SELECTED TOPICS Peter Brezany Institute for Software Science University of Vienna E-mail : brezany@par.univie.ac.at 1 MINING SPATIAL DATABASES 2 Spatial Database Systems SDBSs offer spatial

More information

Big Data and Open Data

Big Data and Open Data Big Data and Open Data Bebo White SLAC National Accelerator Laboratory/ Stanford University!! bebo@slac.stanford.edu dekabytes hectobytes Big Data IS a buzzword! The Data Deluge From the beginning of

More information

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the

More information

Market Assessment & Campaign SLA Calculator LOGO WE OPEN THE DOOR, SO YOU CAN CLOSE IT.

Market Assessment & Campaign SLA Calculator LOGO WE OPEN THE DOOR, SO YOU CAN CLOSE IT. Market Assessment & Campaign SLA Calculator LOGO WE OPEN THE DOOR, SO YOU CAN CLOSE IT. Your Market Assessment Overview Your Inbound Market Assessment and Campaign SLA Calculator is broken down into several

More information

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction Data Mining and Exploration Data Mining and Exploration: Introduction Amos Storkey, School of Informatics January 10, 2006 http://www.inf.ed.ac.uk/teaching/courses/dme/ Course Introduction Welcome Administration

More information

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme Big Data Analytics Prof. Dr. Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany 33. Sitzung des Arbeitskreises Informationstechnologie,

More information

What is Data Science? Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014

What is Data Science? Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014 What is Data Science? { Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014 Let s start with: What is Data? http://upload.wikimedia.org/wikipedia/commons/f/f0/darpa

More information

Session 10 : E-business models, Big Data, Data Mining, Cloud Computing

Session 10 : E-business models, Big Data, Data Mining, Cloud Computing INFORMATION STRATEGY Session 10 : E-business models, Big Data, Data Mining, Cloud Computing Tharaka Tennekoon B.Sc (Hons) Computing, MBA (PIM - USJ) POST GRADUATE DIPLOMA IN BUSINESS AND FINANCE 2014 Internet

More information

Hadoop & SAS Data Loader for Hadoop

Hadoop & SAS Data Loader for Hadoop Turning Data into Value Hadoop & SAS Data Loader for Hadoop Sebastiaan Schaap Frederik Vandenberghe Agenda What s Hadoop SAS Data management: Traditional In-Database In-Memory The Hadoop analytics lifecycle

More information

Working with telecommunications

Working with telecommunications Working with telecommunications Minimizing churn in the telecommunications industry Contents: 1 Churn analysis using data mining 2 Customer churn analysis with IBM SPSS Modeler 3 Types of analysis 3 Feature

More information

Data Mining Methods: Applications for Institutional Research

Data Mining Methods: Applications for Institutional Research Data Mining Methods: Applications for Institutional Research Nora Galambos, PhD Office of Institutional Research, Planning & Effectiveness Stony Brook University NEAIR Annual Conference Philadelphia 2014

More information

Machine Learning for Display Advertising

Machine Learning for Display Advertising Machine Learning for Display Advertising The Idea: Social Targeting for Online Advertising Performance Current ad spending seems disproportionate Online Advertising Spending Breakdown (2009) Source: IAB

More information

The Big Data Revolution: welcome to the Cognitive Era.

The Big Data Revolution: welcome to the Cognitive Era. The Big Data Revolution: welcome to the Cognitive Era. Yves Eychenne, Cloud Advisor, IBM Email: yves.eychenne@fr.ibm.com @yeychenne 2015 INTERNATIONAL BUSINESS MACHINES CORPORATION Agenda Big Data and

More information

How To Become An Analytics Consultant

How To Become An Analytics Consultant Leading Business with Data MA Nang Laik Assistant Professor of Information Systems (Practice) Programme Director, MITB (Analytics) 1 Outline 1. Big Data Revolution 2. Why Master of IT in Business (Analytics)?

More information

B2B opportunity predictiona Big Data and Advanced. Analytics Approach. Insert

B2B opportunity predictiona Big Data and Advanced. Analytics Approach. Insert B2B opportunity predictiona Big Data and Advanced Analytics Approach Vodafone Global Enterprise Manu Kumar, Head of Targeting, Optimization & Data Science Insert Agenda Why B2B opportunities are hard to

More information

Energy Savings from Business Energy Feedback

Energy Savings from Business Energy Feedback Energy Savings from Business Energy Feedback Behavior, Energy, and Climate Change Conference 2015 October 21, 2015 Jim Stewart, Ph.D. INTRODUCTION 2 Study Background Xcel Energy runs the Business Energy

More information

Second CRM Startup Pack

Second CRM Startup Pack Second CRM Startup Pack An Introduction Making Businesses Profitable www.secondcrm.com /secondcrm CRM for Startups Early stage Startups only focus on the idea and product and customer development is usually

More information

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Spring 2015 Thomas Hill, Ph.D. VP Analytic Solutions Dell Statistica Overview and Agenda Dell Software overview Dell in

More information

Francois Ajenstat, Tableau Stephanie McReynolds, Aster Data Steve e Wooledge, Aster Data

Francois Ajenstat, Tableau Stephanie McReynolds, Aster Data Steve e Wooledge, Aster Data Deep Data Exploration: Find Patterns in Your Data Faster & Easier Curt Monash, Founder and President, Monash Research Francois Ajenstat, Tableau Stephanie McReynolds, Aster Data Steve e Wooledge, Aster

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

Big Data in Enterprise challenges & opportunities. Yuanhao Sun 孙 元 浩 yuanhao.sun@intel.com Software and Service Group

Big Data in Enterprise challenges & opportunities. Yuanhao Sun 孙 元 浩 yuanhao.sun@intel.com Software and Service Group Big Data in Enterprise challenges & opportunities Yuanhao Sun 孙 元 浩 yuanhao.sun@intel.com Software and Service Group Big Data Phenomenon 1.8ZB in 2011 2 Days > the dawn of civilization to 2003 750M Photos

More information

Data Mining + Business Intelligence. Integration, Design and Implementation

Data Mining + Business Intelligence. Integration, Design and Implementation Data Mining + Business Intelligence Integration, Design and Implementation ABOUT ME Vijay Kotu Data, Business, Technology, Statistics BUSINESS INTELLIGENCE - Result Making data accessible Wider distribution

More information