Opportunities and Limitations of Big Data
|
|
- Rosaline Price
- 8 years ago
- Views:
Transcription
1 Opportunities and Limitations of Big Data Karl Schmedders University of Zurich and Swiss Finance Institute «Big Data: Little Ethics?» HWZ-Darden-Conference June 4, 2015 On fortune.com this morning: Apple's Tim Cook launches blistering attack on Facebook and Google Blistering speech stresses Apple as a company that doesn t want your data. Source: 2 1
2 Old Rules vs. Modern Myths Sample Size The n=all myth Causation vs. Correlation The End of Theory myth Model Fitting Machine Learning dangers 3 Tracking Influenza with Google Flu Trends Google Flu Trends: Track the spread of influenza across the US through analyzing the top 50 million search terms Centers for Disease Control and Prevention: Track the spread of influenza across the US through analyzing the reports from doctors 4 2
3 Advantages Google is much faster than the CDC Google Flu Trends: 1 day CDC: > 1 week Flu Trends based on millions of people (n=all, n=infinity) Approach us quick, accurate, cheap and theory-free 5 When Google got flu wrong (Nature, 2013) Massive overestimation of flu season 2012/13 Earlier problems in
4 What went wrong? Widespread media coverage of the severity of US flu season Reports may have triggered many flu-related searches by people who were not ill Well-known old problem: wrong sample Sample Bias 7 Classical Example: U.S. Presidential Election 1936 Forecasts of election outcomes: The Literary Digest (sample: 2.4 million people) vs. George Gallup (sample: people) Prediction Literary Digest: 43% Roosevelt Prediction George Gallup: 61% Roosevelt Actual outcome: 61% Roosevelt The Literary Digest sent out forms to people from a list of automobile registrations and telephone directories Who had a phone in 1936? 8 4
5 Still relevant today Why were the Israeli election polls so wrong? "The Internet does not represent the state of Israel and the people of Israel," he said, referring to modern statistical methods. "It represents panels, and the panels are biased strongly to the center - - Tel Aviv, better-educated, more participants in this kind of conversation. (Avi Degani, Tel Aviv University) 9 Why Sampling is Still Very Important Self-reported user data is often a biased sample Growth in noise is swamping the signal that businesses hope to find in the data n = all (or n = infinity ) is wishful thinking 10 5
6 Available Data Sample Objective: Unbiased analysis Problem: User data likely biased Solution: Do NOT use all your user data (may be too much anyway) Sample from available data Check for representativeness Target Population 11 Old Rules vs. Modern Myths Sample Size The n=all myth Causation vs. Correlation The End of Theory myth Model Fitting Machine Learning dangers 12 6
7 The End of Theory? The End of Theory: The Data Deluge Makes the Scientific Method Obsolete (Chris Anderson, Wired Magazine, 2008) All models are wrong, and increasingly you can succeed without them. (Peter Norvig, Google s research director) [F]aced with massive data, this approach to science hypothesize, model, test is becoming obsolete. 13 Did you really mean that? There is now a better way. Petabytes allow us to say: Correlation is enough. We can stop looking for models. We can analyze the data without hypotheses about what it might show. We can throw the numbers into the biggest computing clusters the world has ever seen and let statistical algorithms find patterns where science cannot. Referring to The End of Theory : This revolutionary notion has now entered not just the popular imagination, but also the research practices of corporations, states, journalists and academics. (Mark Graham, The Guardian, 2012) 14 7
8 Spurious Correlation Source: Correlation or Causation (Business Week, 2011) 15 And there is much more Super Bowl and Dow Jones Industrial Average and Superbowl correctly forecasted sign of annual DJIA return Hot-hand fallacy The Hot Hand in Basketball: On the Misperception of Random Sequences. Gilovich, Tversky, Vallone. Cognitive psychology
9 Lack of models Without a model we cannot distinguish between spurious and meaningful correlations Lack of models makes data less useful than it might be; big data insights will be limited Nate Silver describes The End of Theory as categorically the wrong attitude Sound theoretical understanding of statistical application is absolutely necessary 17 Old Rules vs. Modern Myths Sample Size The n=all myth Causation vs. Correlation The End of Theory myth Model Fitting Machine Learning dangers 18 9
10 Top 500 supercomputers are getting faster Source: 19 while storage of data is getting cheaper Source:
11 Big Data and Business Analytics More storage More data Faster computers New sophisticated methods 21 Aside: Reproducibility of Statistical Results Numerous cases in the biomedical field of statistical results from clinical trials that cannot be reproduced in separate studies In 2011, Bayer researchers reported that they were able to reproduce the results of only 17 of 67 published studies they examined In 2012, Amgen researchers reported that they were able to reproduce the results of only 6 of 53 published cancer studies In 2014, a review of Tamiflu found that while it made flu symptoms disappear a bit sooner, it did not stop serious complications or keep people out of the hospital 22 11
12 Fundamental Flaw Publication of only successful trials introduces bias Success in study may have been purely accidental With enough trials, at least one will sooner or later be successful 23 How to become a Guru to some Investors Financial advisor sends letters to 10,240 = 10 x 2^10 potential clients, with half (5120) predicting a particular stock will go up, and the other half predicting it will go down One month later, the advisor sends letters only to the 5120 investors who were previously sent the correct prediction, with half (2560) letters predicting a certain security will go up, and the other half predicting it will go down The advisor continues this process for 10 months
13 in a fraudulent way Ten investors will have been sent ten consecutive correct predictions! They may be so impressed by the advisor's ten consecutive spot-on predictions that will entrust to him/her all of their assets... Final ten investors are unaware of the thousands of wrong predictions 25 Statistical Overfitting I remember my friend Johnny von Neumann used to say, with four parameters I can fit an elephant, and with five I can make him wiggle his trunk. (Enrico Fermi) If you torture the data long enough, it will confess. (Ronald H. Coase) 26 13
14 Backtest Overfitting in Finance Backtesting of an investment strategy: use historical market data to assess performance Backtest overfitting: develop a trading strategy (``model ) that is sufficiently complex and has many degrees of freedom to fit the data almost perfectly Computers can analyze millions or billions of variations of a strategy, so sooner or later you will find a great match 27 Strategy vs. Underlying Asset (Pseudo random) Source: (Thanks to David H. Bailey) 28 14
15 After Iteration After Iteration
16 After Iteration After convergence 32 16
17 and on a new data set Source: (Thanks to David H. Bailey) 33 Too much of a good thing? David H. Bailey (LBNL & UC Davis) [C]omputers operating on big data can generate nonsense faster than ever before! 34 17
18 Old Rules vs. Modern Myths Sample Size The n=all myth Causation vs. Correlation The End of Theory myth Model Fitting Machine Learning dangers 35 18
Healthcare data analytics. Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw
Healthcare data analytics Da-Wei Wang Institute of Information Science wdw@iis.sinica.edu.tw Outline Data Science Enabling technologies Grand goals Issues Google flu trend Privacy Conclusion Analytics
More informationStatistics, Big Data and Data Science!?
Statistics, Big Data and Data Science!? Prof. Dr. Göran Kauermann Ludwig-Maximilians-Universität Munich, Germany Statistics, Big Data and Data Science Statistics Founded around 1900 with the seminal work
More informationSustaining Mind and the brand machine relevance with the connected consumer
Research excellence Sustaining Mind and the brand machine relevance with the connected consumer It may be immense, fast and mind-bendingly varied. But researchers must remember: Big Data can no more speak
More informationWhy Big Data is not Big Hype in Economics and Finance?
Why Big Data is not Big Hype in Economics and Finance? Ariel M. Viale Marshall E. Rinker School of Business Palm Beach Atlantic University West Palm Beach, April 2015 1 The Big Data Hype 2 Big Data as
More informationCollaborations between Official Statistics and Academia in the Era of Big Data
Collaborations between Official Statistics and Academia in the Era of Big Data World Statistics Day October 20-21, 2015 Budapest Vijay Nair University of Michigan Past-President of ISI vnn@umich.edu What
More informationBig Data Hope or Hype?
Big Data Hope or Hype? David J. Hand Imperial College, London and Winton Capital Management Big data science, September 2013 1 Google trends on big data Google search 1 Sept 2013: 1.6 billion hits on big
More informationIntroduction to Big Data! with Apache Spark" UC#BERKELEY#
Introduction to Big Data! with Apache Spark" UC#BERKELEY# Course Goals" This Lecture" Brief History of Data Analysis" Big Data and Data Science Why All the Excitement?" Where Big Data Comes From" Course
More informationExtreme Computing. Big Data. Stratis Viglas. School of Informatics University of Edinburgh sviglas@inf.ed.ac.uk. Stratis Viglas Extreme Computing 1
Extreme Computing Big Data Stratis Viglas School of Informatics University of Edinburgh sviglas@inf.ed.ac.uk Stratis Viglas Extreme Computing 1 Petabyte Age Big Data Challenges Stratis Viglas Extreme Computing
More information!!! The Fallacy of Big Data! Brian Fine and Con Menictas!
!!! The Fallacy of Big Data! Brian Fine and Con Menictas! 1! What is Big Data?! Big data is a vague term for a massive phenomenon that has rapidly become an obsession with entrepreneurs, scientists, governments
More informationStatistical Challenges with Big Data in Management Science
Statistical Challenges with Big Data in Management Science Arnab Kumar Laha Indian Institute of Management Ahmedabad Analytics vs Reporting Competitive Advantage Reporting Prescriptive Analytics (Decision
More informationBig Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料
Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 美 國 13 歲 學 生 用 Big Data 找 出 霸 淩 熱 點 Puri 架 設 網 站 Bullyvention, 藉 由 分 析 Twitter 上 找 出 提 到 跟 霸 凌 相 關 的 詞, 搭 配 地 理 位 置
More informationFinding Patterns the Challenge of Big Data 1
Finding Patterns The Challenge of Big Data David J. Hand Imperial College, London and Winton Capital Management November 2015 Finding Patterns the Challenge of Big Data 1 we are on the cusp of a tremendous
More informationIs Big Data Bigger than a Bread Box?
Is Big Data Bigger than a Bread Box? Bradley Strauss Chitika, Inc. January 14, 2014 The Basic Problem The basic problem we face is simple to state: the big in big data is not well-defined, and perhaps
More informationHow can you unlock the value in real-world data? A novel approach to predictive analytics could make the difference.
How can you unlock the value in real-world data? A novel approach to predictive analytics could make the difference. What if you could diagnose patients sooner, start treatment earlier, and prevent symptoms
More informationPSYCHOLOGY PROGRAM LEARNING GOALS AND OUTCOMES BY COURSE LISTING
PSYCHOLOGY PROGRAM LEARNING GOALS AND OUTCOMES BY COURSE LISTING Psychology 1010: General Psychology Learning Goals and Outcomes LEARNING GOAL 1: KNOWLEDGE BASE OF PSYCHOLOGY Demonstrate familiarity with
More informationGetting personal: The future of communications
Getting personal: The future of communications Neil Wholey LGinsight and Head of Research and Customer Insight at Westminster City Council @neilwholey Accuracy of opinion polls http://thefutureplace.type
More informationCutting Through the Myths, Hype and Confusion Real Answers, Real Facts about SEO
How to Decipher Fact from Fiction Debunking SEO s Greatest Myths By: Jennifer Horowitz, Director of Marketing www.ecombuffet.com jennifer@ecombuffet.com 562-592-5347 This paper is designed to educate you
More informationBig Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index
Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index Rickard Nyman *, Paul Ormerod Centre for the Study of Decision Making Under Uncertainty,
More informationBig Data Big Knowledge?
EBPI Epidemiology, Biostatistics and Prevention Institute Big Data Big Knowledge? Torsten Hothorn 2015-03-06 The end of theory The End of Theory: The Data Deluge Makes the Scientific Method Obsolete (Chris
More informationBIG DATA FUNDAMENTALS
BIG DATA FUNDAMENTALS Timeframe Minimum of 30 hours Use the concepts of volume, velocity, variety, veracity and value to define big data Learning outcomes Critically evaluate the need for big data management
More informationStatistical Fallacies: Lying to Ourselves and Others
Statistical Fallacies: Lying to Ourselves and Others "There are three kinds of lies: lies, damned lies, and statistics. Benjamin Disraeli +/- Benjamin Disraeli Introduction Statistics, assuming they ve
More informationAnalytics in Days White Paper and Business Case
Analytics in Days White Paper and Business Case Analytics Navigating the Maze Analytics is hot. It seems virtually everyone needs or wants it, but many still aren t sure what the business case is or how
More informationPreparing for Big Data for SoC/IC Design By Dean Drako, IC Manage President & CEO DAC 2014 Multi-Site Design Panel Opening Remarks (edited transcript)
Preparing for Big Data for SoC/IC Design By Dean Drako, IC Manage President & CEO DAC 2014 Multi-Site Design Panel Opening Remarks (edited transcript) One of the things we are working on at IC manage is
More informationCharacterizing Task Usage Shapes in Google s Compute Clusters
Characterizing Task Usage Shapes in Google s Compute Clusters Qi Zhang 1, Joseph L. Hellerstein 2, Raouf Boutaba 1 1 University of Waterloo, 2 Google Inc. Introduction Cloud computing is becoming a key
More informationAnalyzing Big Data: The Path to Competitive Advantage
White Paper Analyzing Big Data: The Path to Competitive Advantage by Marcia Kaplan Contents Introduction....2 How Big is Big Data?................................................................................
More informationARE YOUR CUSTOMER SERVICE METRICS TELLING THE TRUTH? Many rank frontline teams unfairly.
ARE YOUR CUSTOMER SERVICE METRICS TELLING THE TRUTH? Many rank frontline teams unfairly. It s frightening how many companies are measuring and managing customer service with inaccurate metrics. Sandy Rogers,
More informationAmerican Economic Association
American Economic Association Does the Basketball Market Believe in the `Hot Hand,'? Author(s): Colin F. Camerer Source: The American Economic Review, Vol. 79, No. 5 (Dec., 1989), pp. 1257-1261 Published
More informationInternet Search Activity & Crowdsourcing
Novel Data Sources for Disease Surveillance and Epidemiology: Internet Search Activity & Crowdsourcing Sumiko Mekaru, DVM, MPVM, MLIS HealthMap, Boston Children s Hospital American College of Epidemiology
More informationHow To Understand Data Science
EBPI Epidemiology, Biostatistics and Prevention Institute Big Data Science Torsten Hothorn 2014-03-31 The end of theory The End of Theory: The Data Deluge Makes the Scientific Method Obsolete (Chris Anderson,
More informationSpeed bump. Acceleration-ramp blues on the information superhighway
Speed bump Acceleration-ramp blues on the information superhighway The signs on the Infobahn say, Full Speed Ahead... but some bumps in the road might send unlucky travelers hurtling off the edge and into
More informationSOCIAL MEDIA: A NEW DATA SOURCE FOR PUBLIC HEALTH. Mark Dredze Johns Hopkins University Michael Paul, Alex Lamb, David Broniatowski
SOCIAL MEDIA: A NEW DATA SOURCE FOR PUBLIC HEALTH Mark Dredze Johns Hopkins University Michael Paul, Alex Lamb, David Broniatowski BIG DATA: SOCIAL MEDIA AND HEALTH Tweets: ~500 million a day Health Tweets:
More informationAlgorithmic Trading Session 1 Introduction. Oliver Steinki, CFA, FRM
Algorithmic Trading Session 1 Introduction Oliver Steinki, CFA, FRM Outline An Introduction to Algorithmic Trading Definition, Research Areas, Relevance and Applications General Trading Overview Goals
More informationBig Data and Its Role in the Health IT Space Presented by MobileHelp
Big Data and Its Role in the Health IT Space Presented by MobileHelp With excerpts from an interview with Jean Robichaud, CTO of MobileHelp Big Data and Its Role in the Health IT Space Presented by MobileHelp
More informationTHE THREE "Rs" OF PREDICTIVE ANALYTICS
THE THREE "Rs" OF PREDICTIVE As companies commit to big data and data-driven decision making, the demand for predictive analytics has never been greater. While each day seems to bring another story of
More informationT he complete guide to SaaS metrics
T he complete guide to SaaS metrics What are the must have metrics each SaaS company should measure? And how to calculate them? World s Simplest Analytics Tool INDEX Introduction 4-5 Acquisition Dashboard
More informationQUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS
QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS Huina Mao School of Informatics and Computing Indiana University, Bloomington, USA ECB Workshop on Using Big Data for Forecasting
More informationThe Social Impact of Open Data
United States of America Federal Trade Commission The Social Impact of Open Data Remarks of Maureen K. Ohlhausen 1 Commissioner, Federal Trade Commission Center for Data Innovation The Social Impact of
More informationWhat is Big Data? The three(or four) Vs in Big Data In 2013 the total amount of stored information is estimated to be Volume.
8/26/2014 CS581 Big Data - Fall 2014 1 8/26/2014 CS581 Big Data - Fall 2014 2 CS535/CS581A BIG DATA What is Big Data? PART 0. INTRODUCTION 1. INTRODUCTION TO BIG DATA 2. COURSE INTRODUCTION PART 0. INTRODUCTION
More informationHow do you Solve a Problem like Analytics?
How do you Solve a Problem like Analytics? Stewart Robinson Professor of Management Science School of Business and Economics Loughborough University Outline My analytics story What is analytics? The Dianoetic
More informationDanny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank
Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»
More informationOverview. Why this policy? Influenza. Vaccine or mask policies. Other approaches Conclusion. epidemiology transmission vaccine
Overview Why this policy? Influenza epidemiology transmission vaccine Vaccine or mask policies development and implementation Other approaches Conclusion Influenza or mask policy Receive the influenza
More informationThe Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg
The Big Picture on Big Data Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg Objective of Talk 1. Deliver a Primer on Big Data. 2. How does this emerging topic apply to Quality? 3.
More informationBig data: are we making a big mistake?
data science Big data: are we making a big mistake? Economist, journalist and broadcaster Tim Harford delivered the 2014 Significance lecture at the Royal Statistical Society International Conference.
More informationAutomated Text Analytics. Testing Manual Processing against Automated Listening
Automated Text Analytics Testing Manual Processing against Automated Listening Contents Executive Summary... 3 Why Is Manual Analysis Inaccurate and Automated Text Analysis on Target?... 3 Text Analytics
More informationIntroduction to Pattern Recognition
Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)
More informationCPE 462 VHDL: Simulation and Synthesis
CPE 462 VHDL: Simulation and Synthesis Topic #09 - a) Introduction to random numbers in hardware Fortuna was the goddess of fortune and personification of luck in Roman religion. She might bring good luck
More informationUnderstanding ETFs. An ETF (exchange traded fund) is a way for you to invest/trade in the market.
Let s start with the basics. An ETF (exchange traded fund) is a way for you to invest/trade in the market. ETFs are easy to invest in; no broker/financial advisor needed. You can do it yourself. All you
More informationPutting IBM Watson to Work In Healthcare
Martin S. Kohn, MD, MS, FACEP, FACPE Chief Medical Scientist, Care Delivery Systems IBM Research marty.kohn@us.ibm.com Putting IBM Watson to Work In Healthcare 2 SB 1275 Medical data in an electronic or
More informationCross Validation. Dr. Thomas Jensen Expedia.com
Cross Validation Dr. Thomas Jensen Expedia.com About Me PhD from ETH Used to be a statistician at Link, now Senior Business Analyst at Expedia Manage a database with 720,000 Hotels that are not on contract
More informationSoftware for data analysis and accurate forecasting. Forecasts for Guaranteed Profits. The Predictive Analytics Software for Insurance Companies
Software for data analysis and accurate forecasting Forecasts for Guaranteed Profits The Predictive Analytics Software for Insurance Companies About Blue Yonder Blue Yonder, established in 2008, is the
More informationColleen s Interview With Ivan Kolev
Colleen s Interview With Ivan Kolev COLLEEN: [TO MY READERS] Hello, everyone, today I d like to welcome you to my interview with Ivan Kolev (affectionately known as Coolice). Hi there, Ivan, and thank
More informationLecture 23: Pairs Trading Steven Skiena. http://www.cs.sunysb.edu/ skiena
Lecture 23: Pairs Trading Steven Skiena Department of Computer Science State University of New York Stony Brook, NY 11794 4400 http://www.cs.sunysb.edu/ skiena Pairs Trading This strategy was pioneered
More informationWhat to consider before investing in Recruitment Software
What to consider before investing in Recruitment Software Finding the best recruitment software for your company is one of the biggest decisions a recruitment organisation will make and is one that is
More informationTutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA
Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data
More informationWeb content provided for Blue Square Design see www.blue-square.com.au. Home Page
Web content provided for Blue Square Design see www.blue-square.com.au Home Page We help your business make new friends When you harness the skills of a graphic and web design studio, there s a certain
More informationHow To Predict Stock Price With Mood Based Models
Twitter Mood Predicts the Stock Market Xiao-Jun Zeng School of Computer Science University of Manchester x.zeng@manchester.ac.uk Outline Introduction and Motivation Approach Framework Twitter mood model
More informationSo Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI
So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI Disclosures James E. Tcheng, MD, FACC, FSCAI Affiliations / Financial Relationships / Other RWI ACC Chair, Informatics and Health IT Task Force
More informationWhite Paper. Benefits and Challenges for Today s Online B- to- B Research Methodology. By Pete Cape, Director, Global Knowledge Management.
White Paper Benefits and Challenges for Today s Online B- to- B Research Methodology By Pete Cape, Director, Global Knowledge Management March 2015 Survey Sampling International, 2015 ABOUT THE AUTHOR
More informationStock Market Trends...P1. What Is Automated Trading...P2. Advantages & Disadvantages...P3. Automated Trading Conclusion...P4
I S SUE 1 VOLUME 1 YE AR 2013 Stock Market Trends...P1 What Is Automated Trading...P2 Advantages & Disadvantages...P3 Automated Trading Conclusion...P4 IS S UE 1 V O L UME 1 YEAR 2013 Automated stock trading
More informationWebsite Promotion for Voice Actors: How to get the Search Engines to give you Top Billing! By Jodi Krangle http://www.voiceoversandvocals.
Website Promotion for Voice Actors: How to get the Search Engines to give you Top Billing! By Jodi Krangle http://www.voiceoversandvocals.com Why have a website? If you re busier than you d like to be
More informationHow To Improve Data Quality
Big Data Promises and Pitfalls David J. Hand Imperial College, London and Winton Capital Management July 2015 Policy making in the Big Data Era 1 The world of data is changing Not something which happens
More informationA Pharmacometrician s Perspective for Utilization of Big Data
Is There a Role of Big Data in Drug Development Decisions? ACoP6 Oct. 5, 2015 Crystal City, VA A Pharmacometrician s Perspective for Utilization of Big Data Marc R. Gastonguay, Ph.D. President & CEO Metrum
More informationA/B TESTING. Comparing Data. October 25, 2007 Version 4.0
A/B TESTING Comparing Data October 25, 2007 Version 4.0 CHAPTER 1 1 What is A/B Testing? A/B testing is a means by which you can compare data sets from two different pages, banners, products, etc. A/B
More informationStatistics for BIG data
Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before
More informationSURVEY REPORT DATA SCIENCE SOCIETY 2014
SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses
More informationFinal Report by Charlotte Burney, Ananda Costa, Breyuana Smith. Social Media Engagement & Evaluation
Final Report by Charlotte Burney, Ananda Costa, Breyuana Smith Social Media Engagement & Evaluation Table of Contents Executive Summary Insights Questions Explored and Review of Key Actionable Insights
More informationThe Adwords Companion
The Adwords Companion 5 Essential Insights Google Don t Teach You About Adwords By Steve Gibson www.ppc-services-uk.co.uk Copyright: Steve Gibson, ppc-services-uk.co.uk, 2008 1 Table Of Contents Introduction
More informationTable of Contents 11-step plan on how to get the most out of the strategies backtesting... 2 Step #1... 2 Pick any strategy you like from the "10
Table of Contents 11-step plan on how to get the most out of the strategies backtesting... 2 Step #1... 2 Pick any strategy you like from the "10 simple free strategies" file... 2 Step #2... 2 Get a strict
More informationQuantifying human behaviour using online data. Suzy Moat Data Science Lab Behavioural Science, WBS Suzy.Moat@wbs.ac.uk
Quantifying human behaviour using online data Suzy Moat Data Science Lab Behavioural Science, WBS Suzy.Moat@wbs.ac.uk Data Science Lab Behavioural Science, WBS 1 The advantage of looking forward more Google
More informationBig Data how it changes the way you treat data
Big Data how it changes the way you treat data Oct. 2013 Chung-Min Chen Chief Scientist Info. Analysis Research & Services The views and opinions expressed in this presentation are those of the author
More informationPUBLIC HEALTH MEETS SOCIAL MEDIA: MINING HEALTH INFO FROM TWITTER
PUBLIC HEALTH MEETS SOCIAL MEDIA: MINING HEALTH INFO FROM TWITTER Michael Paul (@mjp39) Johns Hopkins University Crowdsourcing and Human Computation Lecture 18 Learning about the real world through Twitter
More informationLead Generation in Emerging Markets
Lead Generation in Emerging Markets White paper Summary I II III IV V VI VII Which are the emerging markets? Why emerging markets? How does online help? Seasonality Do we know when to profit on what we
More informationData Mining for Model Creation. Presentation by Paul Below, EDS 2500 NE Plunkett Lane Poulsbo, WA USA 98370 paul.below@eds.
Sept 03-23-05 22 2005 Data Mining for Model Creation Presentation by Paul Below, EDS 2500 NE Plunkett Lane Poulsbo, WA USA 98370 paul.below@eds.com page 1 Agenda Data Mining and Estimating Model Creation
More informationConsultant Led Workshops Instant consulting on major changes or new skills sets. e-learning Self paced, learner driven development
j s@ THE SALES MANAGER S COACHING ROLE Whilst most sales managers recognise their role in developing and training their sales people, very few approach it rigorously. Yet, without this critical activity,
More informationPITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU
PITFALLS IN TIME SERIES ANALYSIS Cliff Hurvich Stern School, NYU The t -Test If x 1,..., x n are independent and identically distributed with mean 0, and n is not too small, then t = x 0 s n has a standard
More informationCloud Computing with Microsoft Azure
Cloud Computing with Microsoft Azure Michael Stiefel www.reliablesoftware.com development@reliablesoftware.com http://www.reliablesoftware.com/dasblog/default.aspx Azure's Three Flavors Azure Operating
More informationUncovering Value in Healthcare Data with Cognitive Analytics. Christine Livingston, Perficient Ken Dugan, IBM
Uncovering Value in Healthcare Data with Cognitive Analytics Christine Livingston, Perficient Ken Dugan, IBM Conflict of Interest Christine Livingston Ken Dugan Has no real or apparent conflicts of interest
More informationSales Lead Brokerage Profit Plan Bonus Document
Sales Lead Brokerage Profit Plan Bonus Document Introduction Hello and thanks for ordering the Sales Lead Brokerage Profit Plan through the Money Makers Reviewed website. As you ll know if you read my
More informationEmployee Surveys: Four Do s and Don ts. Alec Levenson
Employee Surveys: Four Do s and Don ts Alec Levenson Center for Effective Organizations University of Southern California 3415 S. Figueroa Street, DCC 200 Los Angeles, CA 90089 USA Phone: 1-213-740-9814
More informationInternational Equities: Another Turn of the Wheel
International Equities: Another Turn of the Wheel May 6, 2015 by David Ruff of Forward Investing Non-U.S. equities have recently surged after a long spell of underperformance. Does this signal a new market
More informationYour Questions from Chapter 1. General Psychology PSYC 200. Your Questions from Chapter 1. Your Questions from Chapter 1. Science is a Method.
General Psychology PSYC 200 Methods of Psychology Your Questions from Chapter 1 Which names of people are important to remember? In what way are we going to be tested on this material? What exactly did
More informationBasic research methods. Basic research methods. Question: BRM.2. Question: BRM.1
BRM.1 The proportion of individuals with a particular disease who die from that condition is called... BRM.2 This study design examines factors that may contribute to a condition by comparing subjects
More informationProceedings of the 9th WSEAS International Conference on APPLIED COMPUTER SCIENCE
Automated Futures Trading Environment Effect on the Decision Making PETR TUCNIK Department of Information Technologies University of Hradec Kralove Rokitanskeho 62, 500 02 Hradec Kralove CZECH REPUBLIC
More informationCurrency Trading and Forex 100 Success Secrets 100 Most Asked Questions on becoming a Successful Currency Trader
Currency Trading and Forex 100 Success Secrets 100 Most Asked Questions on becoming a Successful Currency Trader Copyright 2008 Currency Trading and Forex 100 Success Secrets Notice of rights All rights
More informationTop tips for online campaign optimisation
Contents 1. The best laid plans... 3 2. Playing to the right audience... 3 3. Quality not quantity... 4 4. You only get out what you put in.. 4 5. Less is more... 5 6. Did you get my message?... 6 7. If
More informationRidiculously Good Outsourcing. The Monetization of Big Data: Made Possible By Humans. www.taskus.com info@taskus.com (888) 400 - TASK
From The TaskUs Library The Monetization of Big Data: Made Possible By Humans Ridiculously Good Outsourcing www.taskus.com info@taskus.com (888) 400 - TASK The Monetization of Big Data: Made Possible by
More informationSeven Things You Must Know Before Hiring a DUI Lawyer
Seven Things You Must Know Before Hiring a DUI Lawyer 1 Introduction Some people don t quite understand the severity of getting a DUI. In many cases, your license is instantly taken away and you won t
More informationHoliday Fraud Myths. How They Leave Retailers Vulnerable
Holiday Fraud Myths How They Leave Retailers Vulnerable Table of Contents 03 04 06 08 10 12 14 Introduction Myth #1 Digital Gift Cards Myth #2 Distance, Dollar and Expedite Myth #3 Machine vs. Manual Review
More informationDoing Multidisciplinary Research in Data Science
Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University aadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 16 May
More informationOur Data & Methodology. Understanding the Digital World by Turning Data into Insights
Our Data & Methodology Understanding the Digital World by Turning Data into Insights Understanding Today s Digital World SimilarWeb provides data and insights to help businesses make better decisions,
More informationData Centric Computing Revisited
Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data
More informationExercises. Updated images, review exercises, and homeworks can be found on the Internet at: http://www.anderson.ucla.edu/faculty/edward.
Preface The story of this book began with my difficult transition from teaching international economics and econometrics in Economics Ph.D. programs at Harvard and UCLA to teaching in the MBA programs
More informationPOLLING STANDARDS. The following standards were developed by a committee of editors and reporters and should be adhered to when using poll results.
! POLLING STANDARDS June, 2006 The following standards were developed by a committee of editors and reporters and should be adhered to when using poll results. OVERVIEW Reporting on polls is no different
More informationThe Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics
The Data Engineer Mike Tamir Chief Science Officer Galvanize Steven Miller Global Leader Academic Programs IBM Analytics Alessandro Gagliardi Lead Faculty Galvanize Businesses are quickly realizing that
More informationGoogle AdWords Remarketing
Google AdWords Remarketing AdWords remarketing is not only great for driving visitors back to your website to convert but is also great at improving your branding which in effect increases conversion and
More informationA U T H O R S : G a n e s h S r i n i v a s a n a n d S a n d e e p W a g h Social Media Analytics
contents A U T H O R S : G a n e s h S r i n i v a s a n a n d S a n d e e p W a g h Social Media Analytics Abstract... 2 Need of Social Content Analytics... 3 Social Media Content Analytics... 4 Inferences
More informationChapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks
Chapter 29 Scale-Free Network Topologies with Clustering Similar to Online Social Networks Imre Varga Abstract In this paper I propose a novel method to model real online social networks where the growing
More informationCounter Expertise Review on the TNO Security Analysis of the Dutch OV-Chipkaart. OV-Chipkaart Security Issues Tutorial for Non-Expert Readers
Counter Expertise Review on the TNO Security Analysis of the Dutch OV-Chipkaart OV-Chipkaart Security Issues Tutorial for Non-Expert Readers The current debate concerning the OV-Chipkaart security was
More informationGetting to Know Big Data
Getting to Know Big Data Dr. Putchong Uthayopas Department of Computer Engineering, Faculty of Engineering, Kasetsart University Email: putchong@ku.th Information Tsunami Rapid expansion of Smartphone
More information