Opportunities and Limitations of Big Data

Size: px
Start display at page:

Download "Opportunities and Limitations of Big Data"

Transcription

1 Opportunities and Limitations of Big Data Karl Schmedders University of Zurich and Swiss Finance Institute «Big Data: Little Ethics?» HWZ-Darden-Conference June 4, 2015 On fortune.com this morning: Apple's Tim Cook launches blistering attack on Facebook and Google Blistering speech stresses Apple as a company that doesn t want your data. Source: 2 1

2 Old Rules vs. Modern Myths Sample Size The n=all myth Causation vs. Correlation The End of Theory myth Model Fitting Machine Learning dangers 3 Tracking Influenza with Google Flu Trends Google Flu Trends: Track the spread of influenza across the US through analyzing the top 50 million search terms Centers for Disease Control and Prevention: Track the spread of influenza across the US through analyzing the reports from doctors 4 2

3 Advantages Google is much faster than the CDC Google Flu Trends: 1 day CDC: > 1 week Flu Trends based on millions of people (n=all, n=infinity) Approach us quick, accurate, cheap and theory-free 5 When Google got flu wrong (Nature, 2013) Massive overestimation of flu season 2012/13 Earlier problems in

4 What went wrong? Widespread media coverage of the severity of US flu season Reports may have triggered many flu-related searches by people who were not ill Well-known old problem: wrong sample Sample Bias 7 Classical Example: U.S. Presidential Election 1936 Forecasts of election outcomes: The Literary Digest (sample: 2.4 million people) vs. George Gallup (sample: people) Prediction Literary Digest: 43% Roosevelt Prediction George Gallup: 61% Roosevelt Actual outcome: 61% Roosevelt The Literary Digest sent out forms to people from a list of automobile registrations and telephone directories Who had a phone in 1936? 8 4

5 Still relevant today Why were the Israeli election polls so wrong? "The Internet does not represent the state of Israel and the people of Israel," he said, referring to modern statistical methods. "It represents panels, and the panels are biased strongly to the center - - Tel Aviv, better-educated, more participants in this kind of conversation. (Avi Degani, Tel Aviv University) 9 Why Sampling is Still Very Important Self-reported user data is often a biased sample Growth in noise is swamping the signal that businesses hope to find in the data n = all (or n = infinity ) is wishful thinking 10 5

6 Available Data Sample Objective: Unbiased analysis Problem: User data likely biased Solution: Do NOT use all your user data (may be too much anyway) Sample from available data Check for representativeness Target Population 11 Old Rules vs. Modern Myths Sample Size The n=all myth Causation vs. Correlation The End of Theory myth Model Fitting Machine Learning dangers 12 6

7 The End of Theory? The End of Theory: The Data Deluge Makes the Scientific Method Obsolete (Chris Anderson, Wired Magazine, 2008) All models are wrong, and increasingly you can succeed without them. (Peter Norvig, Google s research director) [F]aced with massive data, this approach to science hypothesize, model, test is becoming obsolete. 13 Did you really mean that? There is now a better way. Petabytes allow us to say: Correlation is enough. We can stop looking for models. We can analyze the data without hypotheses about what it might show. We can throw the numbers into the biggest computing clusters the world has ever seen and let statistical algorithms find patterns where science cannot. Referring to The End of Theory : This revolutionary notion has now entered not just the popular imagination, but also the research practices of corporations, states, journalists and academics. (Mark Graham, The Guardian, 2012) 14 7

8 Spurious Correlation Source: Correlation or Causation (Business Week, 2011) 15 And there is much more Super Bowl and Dow Jones Industrial Average and Superbowl correctly forecasted sign of annual DJIA return Hot-hand fallacy The Hot Hand in Basketball: On the Misperception of Random Sequences. Gilovich, Tversky, Vallone. Cognitive psychology

9 Lack of models Without a model we cannot distinguish between spurious and meaningful correlations Lack of models makes data less useful than it might be; big data insights will be limited Nate Silver describes The End of Theory as categorically the wrong attitude Sound theoretical understanding of statistical application is absolutely necessary 17 Old Rules vs. Modern Myths Sample Size The n=all myth Causation vs. Correlation The End of Theory myth Model Fitting Machine Learning dangers 18 9

10 Top 500 supercomputers are getting faster Source: 19 while storage of data is getting cheaper Source:

11 Big Data and Business Analytics More storage More data Faster computers New sophisticated methods 21 Aside: Reproducibility of Statistical Results Numerous cases in the biomedical field of statistical results from clinical trials that cannot be reproduced in separate studies In 2011, Bayer researchers reported that they were able to reproduce the results of only 17 of 67 published studies they examined In 2012, Amgen researchers reported that they were able to reproduce the results of only 6 of 53 published cancer studies In 2014, a review of Tamiflu found that while it made flu symptoms disappear a bit sooner, it did not stop serious complications or keep people out of the hospital 22 11

12 Fundamental Flaw Publication of only successful trials introduces bias Success in study may have been purely accidental With enough trials, at least one will sooner or later be successful 23 How to become a Guru to some Investors Financial advisor sends letters to 10,240 = 10 x 2^10 potential clients, with half (5120) predicting a particular stock will go up, and the other half predicting it will go down One month later, the advisor sends letters only to the 5120 investors who were previously sent the correct prediction, with half (2560) letters predicting a certain security will go up, and the other half predicting it will go down The advisor continues this process for 10 months

13 in a fraudulent way Ten investors will have been sent ten consecutive correct predictions! They may be so impressed by the advisor's ten consecutive spot-on predictions that will entrust to him/her all of their assets... Final ten investors are unaware of the thousands of wrong predictions 25 Statistical Overfitting I remember my friend Johnny von Neumann used to say, with four parameters I can fit an elephant, and with five I can make him wiggle his trunk. (Enrico Fermi) If you torture the data long enough, it will confess. (Ronald H. Coase) 26 13

14 Backtest Overfitting in Finance Backtesting of an investment strategy: use historical market data to assess performance Backtest overfitting: develop a trading strategy (``model ) that is sufficiently complex and has many degrees of freedom to fit the data almost perfectly Computers can analyze millions or billions of variations of a strategy, so sooner or later you will find a great match 27 Strategy vs. Underlying Asset (Pseudo random) Source: (Thanks to David H. Bailey) 28 14

15 After Iteration After Iteration

16 After Iteration After convergence 32 16

17 and on a new data set Source: (Thanks to David H. Bailey) 33 Too much of a good thing? David H. Bailey (LBNL & UC Davis) [C]omputers operating on big data can generate nonsense faster than ever before! 34 17

18 Old Rules vs. Modern Myths Sample Size The n=all myth Causation vs. Correlation The End of Theory myth Model Fitting Machine Learning dangers 35 18

Healthcare data analytics. Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw

Healthcare data analytics. Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw Healthcare data analytics Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw Outline Data Science Enabling technologies Grand goals Issues Google flu trend Privacy Conclusion Analytics

More information

Statistics, Big Data and Data Science!?

Statistics, Big Data and Data Science!? Statistics, Big Data and Data Science!? Prof. Dr. Göran Kauermann Ludwig-Maximilians-Universität Munich, Germany Statistics, Big Data and Data Science Statistics Founded around 1900 with the seminal work

More information

Sustaining Mind and the brand machine relevance with the connected consumer

Sustaining Mind and the brand machine relevance with the connected consumer Research excellence Sustaining Mind and the brand machine relevance with the connected consumer It may be immense, fast and mind-bendingly varied. But researchers must remember: Big Data can no more speak

More information

Why Big Data is not Big Hype in Economics and Finance?

Why Big Data is not Big Hype in Economics and Finance? Why Big Data is not Big Hype in Economics and Finance? Ariel M. Viale Marshall E. Rinker School of Business Palm Beach Atlantic University West Palm Beach, April 2015 1 The Big Data Hype 2 Big Data as

More information

Collaborations between Official Statistics and Academia in the Era of Big Data

Collaborations between Official Statistics and Academia in the Era of Big Data Collaborations between Official Statistics and Academia in the Era of Big Data World Statistics Day October 20-21, 2015 Budapest Vijay Nair University of Michigan Past-President of ISI vnn@umich.edu What

More information

Big Data Hope or Hype?

Big Data Hope or Hype? Big Data Hope or Hype? David J. Hand Imperial College, London and Winton Capital Management Big data science, September 2013 1 Google trends on big data Google search 1 Sept 2013: 1.6 billion hits on big

More information

Introduction to Big Data! with Apache Spark" UC#BERKELEY#

Introduction to Big Data! with Apache Spark UC#BERKELEY# Introduction to Big Data! with Apache Spark" UC#BERKELEY# Course Goals" This Lecture" Brief History of Data Analysis" Big Data and Data Science Why All the Excitement?" Where Big Data Comes From" Course

More information

Extreme Computing. Big Data. Stratis Viglas. School of Informatics University of Edinburgh sviglas@inf.ed.ac.uk. Stratis Viglas Extreme Computing 1

Extreme Computing. Big Data. Stratis Viglas. School of Informatics University of Edinburgh sviglas@inf.ed.ac.uk. Stratis Viglas Extreme Computing 1 Extreme Computing Big Data Stratis Viglas School of Informatics University of Edinburgh sviglas@inf.ed.ac.uk Stratis Viglas Extreme Computing 1 Petabyte Age Big Data Challenges Stratis Viglas Extreme Computing

More information

!!! The Fallacy of Big Data! Brian Fine and Con Menictas!

!!! The Fallacy of Big Data! Brian Fine and Con Menictas! !!! The Fallacy of Big Data! Brian Fine and Con Menictas! 1! What is Big Data?! Big data is a vague term for a massive phenomenon that has rapidly become an obsession with entrepreneurs, scientists, governments

More information

Statistical Challenges with Big Data in Management Science

Statistical Challenges with Big Data in Management Science Statistical Challenges with Big Data in Management Science Arnab Kumar Laha Indian Institute of Management Ahmedabad Analytics vs Reporting Competitive Advantage Reporting Prescriptive Analytics (Decision

More information

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 美 國 13 歲 學 生 用 Big Data 找 出 霸 淩 熱 點 Puri 架 設 網 站 Bullyvention, 藉 由 分 析 Twitter 上 找 出 提 到 跟 霸 凌 相 關 的 詞, 搭 配 地 理 位 置

More information

Finding Patterns the Challenge of Big Data 1

Finding Patterns the Challenge of Big Data 1 Finding Patterns The Challenge of Big Data David J. Hand Imperial College, London and Winton Capital Management November 2015 Finding Patterns the Challenge of Big Data 1 we are on the cusp of a tremendous

More information

Is Big Data Bigger than a Bread Box?

Is Big Data Bigger than a Bread Box? Is Big Data Bigger than a Bread Box? Bradley Strauss Chitika, Inc. January 14, 2014 The Basic Problem The basic problem we face is simple to state: the big in big data is not well-defined, and perhaps

More information

How can you unlock the value in real-world data? A novel approach to predictive analytics could make the difference.

How can you unlock the value in real-world data? A novel approach to predictive analytics could make the difference. How can you unlock the value in real-world data? A novel approach to predictive analytics could make the difference. What if you could diagnose patients sooner, start treatment earlier, and prevent symptoms

More information

PSYCHOLOGY PROGRAM LEARNING GOALS AND OUTCOMES BY COURSE LISTING

PSYCHOLOGY PROGRAM LEARNING GOALS AND OUTCOMES BY COURSE LISTING PSYCHOLOGY PROGRAM LEARNING GOALS AND OUTCOMES BY COURSE LISTING Psychology 1010: General Psychology Learning Goals and Outcomes LEARNING GOAL 1: KNOWLEDGE BASE OF PSYCHOLOGY Demonstrate familiarity with

More information

Getting personal: The future of communications

Getting personal: The future of communications Getting personal: The future of communications Neil Wholey LGinsight and Head of Research and Customer Insight at Westminster City Council @neilwholey Accuracy of opinion polls http://thefutureplace.type

More information

Cutting Through the Myths, Hype and Confusion Real Answers, Real Facts about SEO

Cutting Through the Myths, Hype and Confusion Real Answers, Real Facts about SEO How to Decipher Fact from Fiction Debunking SEO s Greatest Myths By: Jennifer Horowitz, Director of Marketing www.ecombuffet.com jennifer@ecombuffet.com 562-592-5347 This paper is designed to educate you

More information

Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index

Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index Rickard Nyman *, Paul Ormerod Centre for the Study of Decision Making Under Uncertainty,

More information

Big Data Big Knowledge?

Big Data Big Knowledge? EBPI Epidemiology, Biostatistics and Prevention Institute Big Data Big Knowledge? Torsten Hothorn 2015-03-06 The end of theory The End of Theory: The Data Deluge Makes the Scientific Method Obsolete (Chris

More information

BIG DATA FUNDAMENTALS

BIG DATA FUNDAMENTALS BIG DATA FUNDAMENTALS Timeframe Minimum of 30 hours Use the concepts of volume, velocity, variety, veracity and value to define big data Learning outcomes Critically evaluate the need for big data management

More information

Statistical Fallacies: Lying to Ourselves and Others

Statistical Fallacies: Lying to Ourselves and Others Statistical Fallacies: Lying to Ourselves and Others "There are three kinds of lies: lies, damned lies, and statistics. Benjamin Disraeli +/- Benjamin Disraeli Introduction Statistics, assuming they ve

More information

Analytics in Days White Paper and Business Case

Analytics in Days White Paper and Business Case Analytics in Days White Paper and Business Case Analytics Navigating the Maze Analytics is hot. It seems virtually everyone needs or wants it, but many still aren t sure what the business case is or how

More information

Preparing for Big Data for SoC/IC Design By Dean Drako, IC Manage President & CEO DAC 2014 Multi-Site Design Panel Opening Remarks (edited transcript)

Preparing for Big Data for SoC/IC Design By Dean Drako, IC Manage President & CEO DAC 2014 Multi-Site Design Panel Opening Remarks (edited transcript) Preparing for Big Data for SoC/IC Design By Dean Drako, IC Manage President & CEO DAC 2014 Multi-Site Design Panel Opening Remarks (edited transcript) One of the things we are working on at IC manage is

More information

Characterizing Task Usage Shapes in Google s Compute Clusters

Characterizing Task Usage Shapes in Google s Compute Clusters Characterizing Task Usage Shapes in Google s Compute Clusters Qi Zhang 1, Joseph L. Hellerstein 2, Raouf Boutaba 1 1 University of Waterloo, 2 Google Inc. Introduction Cloud computing is becoming a key

More information

Analyzing Big Data: The Path to Competitive Advantage

Analyzing Big Data: The Path to Competitive Advantage White Paper Analyzing Big Data: The Path to Competitive Advantage by Marcia Kaplan Contents Introduction....2 How Big is Big Data?................................................................................

More information

ARE YOUR CUSTOMER SERVICE METRICS TELLING THE TRUTH? Many rank frontline teams unfairly.

ARE YOUR CUSTOMER SERVICE METRICS TELLING THE TRUTH? Many rank frontline teams unfairly. ARE YOUR CUSTOMER SERVICE METRICS TELLING THE TRUTH? Many rank frontline teams unfairly. It s frightening how many companies are measuring and managing customer service with inaccurate metrics. Sandy Rogers,

More information

American Economic Association

American Economic Association American Economic Association Does the Basketball Market Believe in the `Hot Hand,'? Author(s): Colin F. Camerer Source: The American Economic Review, Vol. 79, No. 5 (Dec., 1989), pp. 1257-1261 Published

More information

Internet Search Activity & Crowdsourcing

Internet Search Activity & Crowdsourcing Novel Data Sources for Disease Surveillance and Epidemiology: Internet Search Activity & Crowdsourcing Sumiko Mekaru, DVM, MPVM, MLIS HealthMap, Boston Children s Hospital American College of Epidemiology

More information

How To Understand Data Science

How To Understand Data Science EBPI Epidemiology, Biostatistics and Prevention Institute Big Data Science Torsten Hothorn 2014-03-31 The end of theory The End of Theory: The Data Deluge Makes the Scientific Method Obsolete (Chris Anderson,

More information

Speed bump. Acceleration-ramp blues on the information superhighway

Speed bump. Acceleration-ramp blues on the information superhighway Speed bump Acceleration-ramp blues on the information superhighway The signs on the Infobahn say, Full Speed Ahead... but some bumps in the road might send unlucky travelers hurtling off the edge and into

More information

SOCIAL MEDIA: A NEW DATA SOURCE FOR PUBLIC HEALTH. Mark Dredze Johns Hopkins University Michael Paul, Alex Lamb, David Broniatowski

SOCIAL MEDIA: A NEW DATA SOURCE FOR PUBLIC HEALTH. Mark Dredze Johns Hopkins University Michael Paul, Alex Lamb, David Broniatowski SOCIAL MEDIA: A NEW DATA SOURCE FOR PUBLIC HEALTH Mark Dredze Johns Hopkins University Michael Paul, Alex Lamb, David Broniatowski BIG DATA: SOCIAL MEDIA AND HEALTH Tweets: ~500 million a day Health Tweets:

More information

Algorithmic Trading Session 1 Introduction. Oliver Steinki, CFA, FRM

Algorithmic Trading Session 1 Introduction. Oliver Steinki, CFA, FRM Algorithmic Trading Session 1 Introduction Oliver Steinki, CFA, FRM Outline An Introduction to Algorithmic Trading Definition, Research Areas, Relevance and Applications General Trading Overview Goals

More information

Big Data and Its Role in the Health IT Space Presented by MobileHelp

Big Data and Its Role in the Health IT Space Presented by MobileHelp Big Data and Its Role in the Health IT Space Presented by MobileHelp With excerpts from an interview with Jean Robichaud, CTO of MobileHelp Big Data and Its Role in the Health IT Space Presented by MobileHelp

More information

THE THREE "Rs" OF PREDICTIVE ANALYTICS

THE THREE Rs OF PREDICTIVE ANALYTICS THE THREE "Rs" OF PREDICTIVE As companies commit to big data and data-driven decision making, the demand for predictive analytics has never been greater. While each day seems to bring another story of

More information

T he complete guide to SaaS metrics

T he complete guide to SaaS metrics T he complete guide to SaaS metrics What are the must have metrics each SaaS company should measure? And how to calculate them? World s Simplest Analytics Tool INDEX Introduction 4-5 Acquisition Dashboard

More information

QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS

QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS Huina Mao School of Informatics and Computing Indiana University, Bloomington, USA ECB Workshop on Using Big Data for Forecasting

More information

The Social Impact of Open Data

The Social Impact of Open Data United States of America Federal Trade Commission The Social Impact of Open Data Remarks of Maureen K. Ohlhausen 1 Commissioner, Federal Trade Commission Center for Data Innovation The Social Impact of

More information

What is Big Data? The three(or four) Vs in Big Data In 2013 the total amount of stored information is estimated to be Volume.

What is Big Data? The three(or four) Vs in Big Data In 2013 the total amount of stored information is estimated to be Volume. 8/26/2014 CS581 Big Data - Fall 2014 1 8/26/2014 CS581 Big Data - Fall 2014 2 CS535/CS581A BIG DATA What is Big Data? PART 0. INTRODUCTION 1. INTRODUCTION TO BIG DATA 2. COURSE INTRODUCTION PART 0. INTRODUCTION

More information

How do you Solve a Problem like Analytics?

How do you Solve a Problem like Analytics? How do you Solve a Problem like Analytics? Stewart Robinson Professor of Management Science School of Business and Economics Loughborough University Outline My analytics story What is analytics? The Dianoetic

More information

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»

More information

Overview. Why this policy? Influenza. Vaccine or mask policies. Other approaches Conclusion. epidemiology transmission vaccine

Overview. Why this policy? Influenza. Vaccine or mask policies. Other approaches Conclusion. epidemiology transmission vaccine Overview Why this policy? Influenza epidemiology transmission vaccine Vaccine or mask policies development and implementation Other approaches Conclusion Influenza or mask policy Receive the influenza

More information

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg The Big Picture on Big Data Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg Objective of Talk 1. Deliver a Primer on Big Data. 2. How does this emerging topic apply to Quality? 3.

More information

Big data: are we making a big mistake?

Big data: are we making a big mistake? data science Big data: are we making a big mistake? Economist, journalist and broadcaster Tim Harford delivered the 2014 Significance lecture at the Royal Statistical Society International Conference.

More information

Automated Text Analytics. Testing Manual Processing against Automated Listening

Automated Text Analytics. Testing Manual Processing against Automated Listening Automated Text Analytics Testing Manual Processing against Automated Listening Contents Executive Summary... 3 Why Is Manual Analysis Inaccurate and Automated Text Analysis on Target?... 3 Text Analytics

More information

Introduction to Pattern Recognition

Introduction to Pattern Recognition Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)

More information

CPE 462 VHDL: Simulation and Synthesis

CPE 462 VHDL: Simulation and Synthesis CPE 462 VHDL: Simulation and Synthesis Topic #09 - a) Introduction to random numbers in hardware Fortuna was the goddess of fortune and personification of luck in Roman religion. She might bring good luck

More information

Understanding ETFs. An ETF (exchange traded fund) is a way for you to invest/trade in the market.

Understanding ETFs. An ETF (exchange traded fund) is a way for you to invest/trade in the market. Let s start with the basics. An ETF (exchange traded fund) is a way for you to invest/trade in the market. ETFs are easy to invest in; no broker/financial advisor needed. You can do it yourself. All you

More information

Putting IBM Watson to Work In Healthcare

Putting IBM Watson to Work In Healthcare Martin S. Kohn, MD, MS, FACEP, FACPE Chief Medical Scientist, Care Delivery Systems IBM Research marty.kohn@us.ibm.com Putting IBM Watson to Work In Healthcare 2 SB 1275 Medical data in an electronic or

More information

Cross Validation. Dr. Thomas Jensen Expedia.com

Cross Validation. Dr. Thomas Jensen Expedia.com Cross Validation Dr. Thomas Jensen Expedia.com About Me PhD from ETH Used to be a statistician at Link, now Senior Business Analyst at Expedia Manage a database with 720,000 Hotels that are not on contract

More information

Software for data analysis and accurate forecasting. Forecasts for Guaranteed Profits. The Predictive Analytics Software for Insurance Companies

Software for data analysis and accurate forecasting. Forecasts for Guaranteed Profits. The Predictive Analytics Software for Insurance Companies Software for data analysis and accurate forecasting Forecasts for Guaranteed Profits The Predictive Analytics Software for Insurance Companies About Blue Yonder Blue Yonder, established in 2008, is the

More information

Colleen s Interview With Ivan Kolev

Colleen s Interview With Ivan Kolev Colleen s Interview With Ivan Kolev COLLEEN: [TO MY READERS] Hello, everyone, today I d like to welcome you to my interview with Ivan Kolev (affectionately known as Coolice). Hi there, Ivan, and thank

More information

Lecture 23: Pairs Trading Steven Skiena. http://www.cs.sunysb.edu/ skiena

Lecture 23: Pairs Trading Steven Skiena. http://www.cs.sunysb.edu/ skiena Lecture 23: Pairs Trading Steven Skiena Department of Computer Science State University of New York Stony Brook, NY 11794 4400 http://www.cs.sunysb.edu/ skiena Pairs Trading This strategy was pioneered

More information

What to consider before investing in Recruitment Software

What to consider before investing in Recruitment Software What to consider before investing in Recruitment Software Finding the best recruitment software for your company is one of the biggest decisions a recruitment organisation will make and is one that is

More information

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data

More information

Web content provided for Blue Square Design see www.blue-square.com.au. Home Page

Web content provided for Blue Square Design see www.blue-square.com.au. Home Page Web content provided for Blue Square Design see www.blue-square.com.au Home Page We help your business make new friends When you harness the skills of a graphic and web design studio, there s a certain

More information

How To Predict Stock Price With Mood Based Models

How To Predict Stock Price With Mood Based Models Twitter Mood Predicts the Stock Market Xiao-Jun Zeng School of Computer Science University of Manchester x.zeng@manchester.ac.uk Outline Introduction and Motivation Approach Framework Twitter mood model

More information

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI Disclosures James E. Tcheng, MD, FACC, FSCAI Affiliations / Financial Relationships / Other RWI ACC Chair, Informatics and Health IT Task Force

More information

White Paper. Benefits and Challenges for Today s Online B- to- B Research Methodology. By Pete Cape, Director, Global Knowledge Management.

White Paper. Benefits and Challenges for Today s Online B- to- B Research Methodology. By Pete Cape, Director, Global Knowledge Management. White Paper Benefits and Challenges for Today s Online B- to- B Research Methodology By Pete Cape, Director, Global Knowledge Management March 2015 Survey Sampling International, 2015 ABOUT THE AUTHOR

More information

Stock Market Trends...P1. What Is Automated Trading...P2. Advantages & Disadvantages...P3. Automated Trading Conclusion...P4

Stock Market Trends...P1. What Is Automated Trading...P2. Advantages & Disadvantages...P3. Automated Trading Conclusion...P4 I S SUE 1 VOLUME 1 YE AR 2013 Stock Market Trends...P1 What Is Automated Trading...P2 Advantages & Disadvantages...P3 Automated Trading Conclusion...P4 IS S UE 1 V O L UME 1 YEAR 2013 Automated stock trading

More information

Website Promotion for Voice Actors: How to get the Search Engines to give you Top Billing! By Jodi Krangle http://www.voiceoversandvocals.

Website Promotion for Voice Actors: How to get the Search Engines to give you Top Billing! By Jodi Krangle http://www.voiceoversandvocals. Website Promotion for Voice Actors: How to get the Search Engines to give you Top Billing! By Jodi Krangle http://www.voiceoversandvocals.com Why have a website? If you re busier than you d like to be

More information

How To Improve Data Quality

How To Improve Data Quality Big Data Promises and Pitfalls David J. Hand Imperial College, London and Winton Capital Management July 2015 Policy making in the Big Data Era 1 The world of data is changing Not something which happens

More information

A Pharmacometrician s Perspective for Utilization of Big Data

A Pharmacometrician s Perspective for Utilization of Big Data Is There a Role of Big Data in Drug Development Decisions? ACoP6 Oct. 5, 2015 Crystal City, VA A Pharmacometrician s Perspective for Utilization of Big Data Marc R. Gastonguay, Ph.D. President & CEO Metrum

More information

A/B TESTING. Comparing Data. October 25, 2007 Version 4.0

A/B TESTING. Comparing Data. October 25, 2007 Version 4.0 A/B TESTING Comparing Data October 25, 2007 Version 4.0 CHAPTER 1 1 What is A/B Testing? A/B testing is a means by which you can compare data sets from two different pages, banners, products, etc. A/B

More information

Statistics for BIG data

Statistics for BIG data Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before

More information

SURVEY REPORT DATA SCIENCE SOCIETY 2014

SURVEY REPORT DATA SCIENCE SOCIETY 2014 SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses

More information

Final Report by Charlotte Burney, Ananda Costa, Breyuana Smith. Social Media Engagement & Evaluation

Final Report by Charlotte Burney, Ananda Costa, Breyuana Smith. Social Media Engagement & Evaluation Final Report by Charlotte Burney, Ananda Costa, Breyuana Smith Social Media Engagement & Evaluation Table of Contents Executive Summary Insights Questions Explored and Review of Key Actionable Insights

More information

The Adwords Companion

The Adwords Companion The Adwords Companion 5 Essential Insights Google Don t Teach You About Adwords By Steve Gibson www.ppc-services-uk.co.uk Copyright: Steve Gibson, ppc-services-uk.co.uk, 2008 1 Table Of Contents Introduction

More information

Table of Contents 11-step plan on how to get the most out of the strategies backtesting... 2 Step #1... 2 Pick any strategy you like from the "10

Table of Contents 11-step plan on how to get the most out of the strategies backtesting... 2 Step #1... 2 Pick any strategy you like from the 10 Table of Contents 11-step plan on how to get the most out of the strategies backtesting... 2 Step #1... 2 Pick any strategy you like from the "10 simple free strategies" file... 2 Step #2... 2 Get a strict

More information

Quantifying human behaviour using online data. Suzy Moat Data Science Lab Behavioural Science, WBS Suzy.Moat@wbs.ac.uk

Quantifying human behaviour using online data. Suzy Moat Data Science Lab Behavioural Science, WBS Suzy.Moat@wbs.ac.uk Quantifying human behaviour using online data Suzy Moat Data Science Lab Behavioural Science, WBS Suzy.Moat@wbs.ac.uk Data Science Lab Behavioural Science, WBS 1 The advantage of looking forward more Google

More information

Big Data how it changes the way you treat data

Big Data how it changes the way you treat data Big Data how it changes the way you treat data Oct. 2013 Chung-Min Chen Chief Scientist Info. Analysis Research & Services The views and opinions expressed in this presentation are those of the author

More information

PUBLIC HEALTH MEETS SOCIAL MEDIA: MINING HEALTH INFO FROM TWITTER

PUBLIC HEALTH MEETS SOCIAL MEDIA: MINING HEALTH INFO FROM TWITTER PUBLIC HEALTH MEETS SOCIAL MEDIA: MINING HEALTH INFO FROM TWITTER Michael Paul (@mjp39) Johns Hopkins University Crowdsourcing and Human Computation Lecture 18 Learning about the real world through Twitter

More information

Lead Generation in Emerging Markets

Lead Generation in Emerging Markets Lead Generation in Emerging Markets White paper Summary I II III IV V VI VII Which are the emerging markets? Why emerging markets? How does online help? Seasonality Do we know when to profit on what we

More information

Data Mining for Model Creation. Presentation by Paul Below, EDS 2500 NE Plunkett Lane Poulsbo, WA USA 98370 paul.below@eds.

Data Mining for Model Creation. Presentation by Paul Below, EDS 2500 NE Plunkett Lane Poulsbo, WA USA 98370 paul.below@eds. Sept 03-23-05 22 2005 Data Mining for Model Creation Presentation by Paul Below, EDS 2500 NE Plunkett Lane Poulsbo, WA USA 98370 paul.below@eds.com page 1 Agenda Data Mining and Estimating Model Creation

More information

Consultant Led Workshops Instant consulting on major changes or new skills sets. e-learning Self paced, learner driven development

Consultant Led Workshops Instant consulting on major changes or new skills sets. e-learning Self paced, learner driven development j s@ THE SALES MANAGER S COACHING ROLE Whilst most sales managers recognise their role in developing and training their sales people, very few approach it rigorously. Yet, without this critical activity,

More information

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU PITFALLS IN TIME SERIES ANALYSIS Cliff Hurvich Stern School, NYU The t -Test If x 1,..., x n are independent and identically distributed with mean 0, and n is not too small, then t = x 0 s n has a standard

More information

Cloud Computing with Microsoft Azure

Cloud Computing with Microsoft Azure Cloud Computing with Microsoft Azure Michael Stiefel www.reliablesoftware.com development@reliablesoftware.com http://www.reliablesoftware.com/dasblog/default.aspx Azure's Three Flavors Azure Operating

More information

Uncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM

Uncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM Uncovering Value in Healthcare Data with Cognitive Analytics Christine Livingston, Perficient Ken Dugan, IBM Conflict of Interest Christine Livingston Ken Dugan Has no real or apparent conflicts of interest

More information

Sales Lead Brokerage Profit Plan Bonus Document

Sales Lead Brokerage Profit Plan Bonus Document Sales Lead Brokerage Profit Plan Bonus Document Introduction Hello and thanks for ordering the Sales Lead Brokerage Profit Plan through the Money Makers Reviewed website. As you ll know if you read my

More information

Employee Surveys: Four Do s and Don ts. Alec Levenson

Employee Surveys: Four Do s and Don ts. Alec Levenson Employee Surveys: Four Do s and Don ts Alec Levenson Center for Effective Organizations University of Southern California 3415 S. Figueroa Street, DCC 200 Los Angeles, CA 90089 USA Phone: 1-213-740-9814

More information

International Equities: Another Turn of the Wheel

International Equities: Another Turn of the Wheel International Equities: Another Turn of the Wheel May 6, 2015 by David Ruff of Forward Investing Non-U.S. equities have recently surged after a long spell of underperformance. Does this signal a new market

More information

Your Questions from Chapter 1. General Psychology PSYC 200. Your Questions from Chapter 1. Your Questions from Chapter 1. Science is a Method.

Your Questions from Chapter 1. General Psychology PSYC 200. Your Questions from Chapter 1. Your Questions from Chapter 1. Science is a Method. General Psychology PSYC 200 Methods of Psychology Your Questions from Chapter 1 Which names of people are important to remember? In what way are we going to be tested on this material? What exactly did

More information

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1

Basic research methods. Basic research methods. Question: BRM.2. Question: BRM.1 BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects

More information

Proceedings of the 9th WSEAS International Conference on APPLIED COMPUTER SCIENCE

Proceedings of the 9th WSEAS International Conference on APPLIED COMPUTER SCIENCE Automated Futures Trading Environment Effect on the Decision Making PETR TUCNIK Department of Information Technologies University of Hradec Kralove Rokitanskeho 62, 500 02 Hradec Kralove CZECH REPUBLIC

More information

Currency Trading and Forex 100 Success Secrets 100 Most Asked Questions on becoming a Successful Currency Trader

Currency Trading and Forex 100 Success Secrets 100 Most Asked Questions on becoming a Successful Currency Trader Currency Trading and Forex 100 Success Secrets 100 Most Asked Questions on becoming a Successful Currency Trader Copyright 2008 Currency Trading and Forex 100 Success Secrets Notice of rights All rights

More information

Top tips for online campaign optimisation

Top tips for online campaign optimisation Contents 1. The best laid plans... 3 2. Playing to the right audience... 3 3. Quality not quantity... 4 4. You only get out what you put in.. 4 5. Less is more... 5 6. Did you get my message?... 6 7. If

More information

Ridiculously Good Outsourcing. The Monetization of Big Data: Made Possible By Humans. www.taskus.com info@taskus.com (888) 400 - TASK

Ridiculously Good Outsourcing. The Monetization of Big Data: Made Possible By Humans. www.taskus.com info@taskus.com (888) 400 - TASK From The TaskUs Library The Monetization of Big Data: Made Possible By Humans Ridiculously Good Outsourcing www.taskus.com info@taskus.com (888) 400 - TASK The Monetization of Big Data: Made Possible by

More information

Seven Things You Must Know Before Hiring a DUI Lawyer

Seven Things You Must Know Before Hiring a DUI Lawyer Seven Things You Must Know Before Hiring a DUI Lawyer 1 Introduction Some people don t quite understand the severity of getting a DUI. In many cases, your license is instantly taken away and you won t

More information

Holiday Fraud Myths. How They Leave Retailers Vulnerable

Holiday Fraud Myths. How They Leave Retailers Vulnerable Holiday Fraud Myths How They Leave Retailers Vulnerable Table of Contents 03 04 06 08 10 12 14 Introduction Myth #1 Digital Gift Cards Myth #2 Distance, Dollar and Expedite Myth #3 Machine vs. Manual Review

More information

Doing Multidisciplinary Research in Data Science

Doing Multidisciplinary Research in Data Science Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University aadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 16 May

More information

Our Data & Methodology. Understanding the Digital World by Turning Data into Insights

Our Data & Methodology. Understanding the Digital World by Turning Data into Insights Our Data & Methodology Understanding the Digital World by Turning Data into Insights Understanding Today s Digital World SimilarWeb provides data and insights to help businesses make better decisions,

More information

Data Centric Computing Revisited

Data Centric Computing Revisited Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data

More information

Exercises. Updated images, review exercises, and homeworks can be found on the Internet at: http://www.anderson.ucla.edu/faculty/edward.

Exercises. Updated images, review exercises, and homeworks can be found on the Internet at: http://www.anderson.ucla.edu/faculty/edward. Preface The story of this book began with my difficult transition from teaching international economics and econometrics in Economics Ph.D. programs at Harvard and UCLA to teaching in the MBA programs

More information

POLLING STANDARDS. The following standards were developed by a committee of editors and reporters and should be adhered to when using poll results.

POLLING STANDARDS. The following standards were developed by a committee of editors and reporters and should be adhered to when using poll results. ! POLLING STANDARDS June, 2006 The following standards were developed by a committee of editors and reporters and should be adhered to when using poll results. OVERVIEW Reporting on polls is no different

More information

The Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics

The Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics The Data Engineer Mike Tamir Chief Science Officer Galvanize Steven Miller Global Leader Academic Programs IBM Analytics Alessandro Gagliardi Lead Faculty Galvanize Businesses are quickly realizing that

More information

Google AdWords Remarketing

Google AdWords Remarketing Google AdWords Remarketing AdWords remarketing is not only great for driving visitors back to your website to convert but is also great at improving your branding which in effect increases conversion and

More information

A U T H O R S : G a n e s h S r i n i v a s a n a n d S a n d e e p W a g h Social Media Analytics

A U T H O R S : G a n e s h S r i n i v a s a n a n d S a n d e e p W a g h Social Media Analytics contents A U T H O R S : G a n e s h S r i n i v a s a n a n d S a n d e e p W a g h Social Media Analytics Abstract... 2 Need of Social Content Analytics... 3 Social Media Content Analytics... 4 Inferences

More information

Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks

Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks Imre Varga Abstract In this paper I propose a novel method to model real online social networks where the growing

More information

Counter Expertise Review on the TNO Security Analysis of the Dutch OV-Chipkaart. OV-Chipkaart Security Issues Tutorial for Non-Expert Readers

Counter Expertise Review on the TNO Security Analysis of the Dutch OV-Chipkaart. OV-Chipkaart Security Issues Tutorial for Non-Expert Readers Counter Expertise Review on the TNO Security Analysis of the Dutch OV-Chipkaart OV-Chipkaart Security Issues Tutorial for Non-Expert Readers The current debate concerning the OV-Chipkaart security was

More information

Getting to Know Big Data

Getting to Know Big Data Getting to Know Big Data Dr. Putchong Uthayopas Department of Computer Engineering, Faculty of Engineering, Kasetsart University Email: putchong@ku.th Information Tsunami Rapid expansion of Smartphone

More information