Data Science and Prediction*
|
|
|
- Poppy Atkins
- 10 years ago
- Views:
Transcription
1 Data Science and Prediction* Vasant Dhar Professor Editor-in-Chief, Big Data Co-Director, Center for Business Analytics, NYU Stern Faculty, Center for Data Science, NYU *Article in Communications of the ACM, Vol. 56 No. 12, December
2 Themes: What Makes Prediction Hard? Noise (physical versus social systems) Physics has 3 theories that explain 99% of observed phenomena, whereas Psychology has 99 theories that explain 3% of observed phenomena. Not knowing the right question to ask (formulation) If only you knew what to ask, I d show you something really interesting! Not having the right/enough data: observational versus experimental Am I looking under the lamppost for the key because? Combining machine and human intelligence Surely human and machine intelligence can augment each other? Believing in the analysis! I don t believe the result. Go find 2
3 The Data Landscape and Applications Financial Markets What will the market do tomorrow? Will the retail sector pull back within a month? Healthcare Who will become sick in the near future? How will some respond to a medication? Marketing Who will respond to what offer? Is a customer likely to attrit shortly? Social/Product Networks Will demand for XXX go up next week given the activity of its neighbors? How should I craft my message so that it spreads through the network? i.e. where should I seed it? What is the sentiment in a collection of textual data? Does the sentiment have any predictive power?
4 Data Science and Prediction Data Science is the study of the generalizable extraction of knowledge from data * A key epistemic requirement for new knowledge (and its actionability ) is its ability to predict and not just explain *Dhar, V., Data Science and Prediction, Communications of the ACM, Vol. 56 No. 12, December
5 1. Noise Physical Systems: theory is expected to be "complete" Social/Health Systems: incomplete models intended to be partial approximations of reality, often based on assumptions of human behavior known to be simplistic. 5
6 What is Noise Anyway? How would you order these on the continuum?
7 Skill and Luck in Baseball: Batting Avg, Singles, and Strikeout YoY Correlation
8 Baseball Metrics By Skill and Luck
9 Is This Ordering Credible? Reversion to the mean exists in activities that combine skill and luck It is useful to know where the problem lies on the continuum above Knowing where we lie in the continuum allows us to anticipate outcomes Illusion of control is a factor in luck situations!
10 Disentangling Skill and Luck: The Formula Variance(observed) = Variance (skill) + Variance (luck) Variance(skill) = Variance (observed) Variance (luck) Variance of winning percentages of teams For win/loss outcomes, stdev = p(1-p)/sqrt(n) where p=prob of outcome (i.e. win), and n=number of cases (i.e. games) This is observable from the data Depends on sample size
11 Prediction in Noisy Domains (Markets)* The more important predictions Actuals Predictions from a model The more important predictions *From: Dhar, V., Prediction in financial markets: The case for small disjuncts, ACM transactions on Intelligent Systems and Technology, volume 2, No 3, April 2011
12 2. Asking the Right Question Patterns Emerge Before Reasons for Them Become Apparent Asking the right question is therefore critical: If only you knew what question to ask me, I d give you very interesting answers from the data. Keep moving on? Dig for causality?
13 What is the Right Question Here?* Clean Period Diagnosis Outcome period T I M E Are complications associated with the yellow meds? Or with the gray meds? Or the yellows in the absence of the blues? Or is it more than three yellows or three blues? Or is it the greens in quick succession? Or does it have to do with lifestyle choices?! (i.e. Bias? Gather modata? *Dhar, V., Data Science and Prediction, Communications of the ACM, Vol. 56 No. 12, December
14 High Level View of Model Discovery Decision Rule or Trading Strategy (i.e. the question ) Better part Better part Better part Better part Better part Breeds Breeds Breeds Breeds Worse part Worse part Worse part Worse part Worse part Drops Out Drops Out Drops Out Drops Out Drops Out Solution Quality Best Average Worst Iterations
15 Solutions Can Represent Arbitrary Data Structures Arrays and sequences a + - c b Trees 0 < 0.8 AND Boolean Expressions I.e. X1 between 0 and 0.8
16 An Interesting Pattern?
17 3. Observational Data and Acquiring New Data Observational data may answer questions without explicitly asking anyone anything! It may also require an understanding of how the data are being generated Are there natural experiments that are reflected in the data or are the data somehow biased through self selection? Is it possible to runnatural experiments to getadditional data to answer the question?
18 Have We Collected The Right Data?
19 Is More Always Better?* Is more data always better? How much you should pay for external data?** *See Provost et.al article in Big Data journal (volume 2, issue 1, 2014) ** See Dalessandroet.al article in Big Data journal (volume 2, issue 2, 2014)
20 4. Prediction as Epistemic Criterion In assessing whether new knowledge is actionable for decision making: predictive power, not just ability to explain the past
21 Validation Framework for Prediction Across Time NO YES Across Universe NO A A Out of sample A A Out of sample Out of time YES B B Out of sample Out of universe Out of sample Out of time Out of universe
22 Prediction Vs Explanation: Varying Model Complexity Future Performance Versus Complexity Zone with (small) correlation between in-sample and out of sample 0.6 Overfitting Future Performance Complexity Note: Complexity measure is illustrative only, based on the learning method used
23 Prediction Errors and Big Data(*) Sources of Error in Predictive Models 1. Mis-specification of the model Big data admits a larger space of functional forms 2. Using a sample to estimate the model With big data, sample is a good estimate of the population 3. Randomness Predictive modeling attempts to minimize the combination of these two errors *Adapted from Shmueli, G. To explain or to predict? Statistical Science 25, 3 (Aug. 2010), )
24 Making Predictions Usable in Noisy Domains* why bother making predictions? i.e. with so many false positives? 1 ROC Curve For Diabetes Prediction 0.8 True Positive Rate (TPR) False Positive Rate (FPR) It all depends on the costs of being wrong and the aggregate pickup* *From Big Data and Predictive Analytics in Healthcare, Big Data Journal, volume 2, Number 3, Sep
25 If Patterns Emerge Before Reasons Are Apparent does it imply you re better off rebuilding the model periodically? when is it necessary to understand the reasons for the patterns? (i.e. causality)
26 If Prediction is Important is problem formulation different than if the purpose is explanation? is it about the right tradeoff between the structure of the problem and the complexity of the model? how do you ensure against common mistakes such as data snooping or leakage in building predictive models?
27 Transparency in Complex Systems How can you describe the learned model? Is it possible to envision when the model will and won t perform well?
28 What s the Explanation? What happened here and why?
29 Current Areas of Inquiry with Big Data Unstructured data WATSON: leveraging human curated data and AI to synthesize answers Deciding data what to keep, aggregate, etc New risk factors based on text and other data at higher frequency News and social media Digital IQ of companies Understanding the relationship between noise, system behavior, expected predictive accuracy, and performance bounds Understanding how actionable results will change future data making future insights harder and less actionable!
30 30
The Rise of Machines and Liquidity Patterns in Markets
The Rise of Machines and Liquidity Patterns in Markets Vasant Dhar Professor and Co-Director, Center for Business Analytics Stern School of Business Editor-in-Chief, Big Data Journal April 25, 2014 Key
Business Analytics Syllabus
B6101 Business Analytics Fall 2014 Business Analytics Syllabus Course Description Business analytics refers to the ways in which enterprises such as businesses, non-profits, and governments can use data
Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer
Machine Learning Chapter 18, 21 Some material adopted from notes by Chuck Dyer What is learning? Learning denotes changes in a system that... enable a system to do the same task more efficiently the next
Supervised Learning (Big Data Analytics)
Supervised Learning (Big Data Analytics) Vibhav Gogate Department of Computer Science The University of Texas at Dallas Practical advice Goal of Big Data Analytics Uncover patterns in Data. Can be used
Applying Data Science to Sales Pipelines for Fun and Profit
Applying Data Science to Sales Pipelines for Fun and Profit Andy Twigg, CTO, C9 @lambdatwigg Abstract Machine learning is now routinely applied to many areas of industry. At C9, we apply machine learning
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Evaluating the Accuracy of a Classifier Holdout, random subsampling, crossvalidation, and the bootstrap are common techniques for
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Evaluation & Validation: Credibility: Evaluating what has been learned
Evaluation & Validation: Credibility: Evaluating what has been learned How predictive is a learned model? How can we evaluate a model Test the model Statistical tests Considerations in evaluating a Model
Data Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
Data Mining Analytics for Business Intelligence and Decision Support
Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing
Behavioral Segmentation
Behavioral Segmentation TM Contents 1. The Importance of Segmentation in Contemporary Marketing... 2 2. Traditional Methods of Segmentation and their Limitations... 2 2.1 Lack of Homogeneity... 3 2.2 Determining
Preliminary Syllabus for the course of Data Science for Business Analytics
Preliminary Syllabus for the course of Data Science for Business Analytics Miguel Godinho de Matos 1,2 and Pedro Ferreira 2,3 1 Cato_lica-Lisbon, School of Business and Economics 2 Heinz College, Carnegie
A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier
A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier G.T. Prasanna Kumari Associate Professor, Dept of Computer Science and Engineering, Gokula Krishna College of Engg, Sullurpet-524121,
Data Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
Performance Measures in Data Mining
Performance Measures in Data Mining Common Performance Measures used in Data Mining and Machine Learning Approaches L. Richter J.M. Cejuela Department of Computer Science Technische Universität München
How I won the Chess Ratings: Elo vs the rest of the world Competition
How I won the Chess Ratings: Elo vs the rest of the world Competition Yannis Sismanis November 2010 Abstract This article discusses in detail the rating system that won the kaggle competition Chess Ratings:
TEXT ANALYTICS INTEGRATION
TEXT ANALYTICS INTEGRATION A TELECOMMUNICATIONS BEST PRACTICES CASE STUDY VISION COMMON ANALYTICAL ENVIRONMENT Structured Unstructured Analytical Mining Text Discovery Text Categorization Text Sentiment
Maximize Social Media Effectiveness with Data Science. An Insurance Industry White Paper from Saama Technologies, Inc.
Maximize Social Media Effectiveness with Data Science An Insurance Industry White Paper from Saama Technologies, Inc. February 2014 Table of Contents Executive Summary 1 Social Media for Insurance 2 Effective
Chapter 6: Episode discovery process
Chapter 6: Episode discovery process Algorithmic Methods of Data Mining, Fall 2005, Chapter 6: Episode discovery process 1 6. Episode discovery process The knowledge discovery process KDD process of analyzing
Hexaware E-book on Predictive Analytics
Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,
WHITEPAPER. Text Analytics Beginner s Guide
WHITEPAPER Text Analytics Beginner s Guide What is Text Analytics? Text Analytics describes a set of linguistic, statistical, and machine learning techniques that model and structure the information content
What? So what? NOW WHAT? Presenting metrics to get results
What? So what? NOW WHAT? What? So what? Visualization is like photography. Impact is a function of focus, illumination, and perspective. What? NOW WHAT? Don t Launch! Prevent your own disastrous decisions
Text Analytics Beginner s Guide. Extracting Meaning from Unstructured Data
Text Analytics Beginner s Guide Extracting Meaning from Unstructured Data Contents Text Analytics 3 Use Cases 7 Terms 9 Trends 14 Scenario 15 Resources 24 2 2013 Angoss Software Corporation. All rights
A Capability Model for Business Analytics: Part 3 Using the Capability Assessment
A Capability Model for Business Analytics: Part 3 Using the Capability Assessment The first article of this series presents a capability model for business analytics, and the second article describes a
Maschinelles Lernen mit MATLAB
Maschinelles Lernen mit MATLAB Jérémy Huard Applikationsingenieur The MathWorks GmbH 2015 The MathWorks, Inc. 1 Machine Learning is Everywhere Image Recognition Speech Recognition Stock Prediction Medical
Data Mining: An Introduction
Data Mining: An Introduction Michael J. A. Berry and Gordon A. Linoff. Data Mining Techniques for Marketing, Sales and Customer Support, 2nd Edition, 2004 Data mining What promotions should be targeted
Why do statisticians "hate" us?
Why do statisticians "hate" us? David Hand, Heikki Mannila, Padhraic Smyth "Data mining is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data
WHY THE WATERFALL MODEL DOESN T WORK
Chapter 2 WHY THE WATERFALL MODEL DOESN T WORK M oving an enterprise to agile methods is a serious undertaking because most assumptions about method, organization, best practices, and even company culture
PSYCHOLOGY PROGRAM LEARNING GOALS AND OUTCOMES BY COURSE LISTING
PSYCHOLOGY PROGRAM LEARNING GOALS AND OUTCOMES BY COURSE LISTING Psychology 1010: General Psychology Learning Goals and Outcomes LEARNING GOAL 1: KNOWLEDGE BASE OF PSYCHOLOGY Demonstrate familiarity with
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 11 Sajjad Haider Fall 2013 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
DEMYSTIFYING BIG DATA. What it is, what it isn t, and what it can do for you.
DEMYSTIFYING BIG DATA What it is, what it isn t, and what it can do for you. JAMES LUCK BIO James Luck is a Data Scientist with AT&T Consulting. He has 25+ years of experience in data analytics, in addition
Optimizing Enrollment Management with Predictive Modeling
Optimizing Enrollment Management with Predictive Modeling Tips and Strategies for Getting Started with Predictive Analytics in Higher Education an ebook presented by Optimizing Enrollment with Predictive
What is Data Science? Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014
What is Data Science? { Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014 Let s start with: What is Data? http://upload.wikimedia.org/wikipedia/commons/f/f0/darpa
Big Data Analytics in Mobile Environments
1 Big Data Analytics in Mobile Environments 熊 辉 教 授 罗 格 斯 - 新 泽 西 州 立 大 学 2012-10-2 Rutgers, the State University of New Jersey Why big data: historical view? Productivity versus Complexity (interrelatedness,
I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD
I N T E L L I G E N T S O L U T I O N S, I N C. OILFIELD DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD 5 5 T A R A P L A C E M O R G A N T O W N, W V 2 6 0 5 0 USA
White Paper: Impact of Inventory on Network Design
White Paper: Impact of Inventory on Network Design Written as Research Project at Georgia Tech with support from people at IBM, Northwestern, and Opex Analytics Released: January 2014 Georgia Tech Team
B2B opportunity predictiona Big Data and Advanced. Analytics Approach. Insert
B2B opportunity predictiona Big Data and Advanced Analytics Approach Vodafone Global Enterprise Manu Kumar, Head of Targeting, Optimization & Data Science Insert Agenda Why B2B opportunities are hard to
Advancing Analytics in Your Organization
Advancing Analytics in Your Organization Sarah Shillington Leidos Health, EVP Annette Savage Leidos Health, Senior Solutions Manager Bryan Fiekers HIMSS Analytics, Director leidoshealth.com Uniting 25
Social Media and Digital Marketing Analytics ( INFO-UB.0038.01) Professor Anindya Ghose Monday Friday 6-9:10 pm from 7/15/13 to 7/30/13
Social Media and Digital Marketing Analytics ( INFO-UB.0038.01) Professor Anindya Ghose Monday Friday 6-9:10 pm from 7/15/13 to 7/30/13 [email protected] twitter: aghose pages.stern.nyu.edu/~aghose
Course Description This course will change the way you think about data and its role in business.
INFO-GB.3336 Data Mining for Business Analytics Section 32 (Tentative version) Spring 2014 Faculty Class Time Class Location Yilu Zhou, Ph.D. Associate Professor, School of Business, Fordham University
Social Media Intelligence
Social Media Intelligence In the world of Facebook, Twitter, and Yelp, water-cooler conversations with co- workers and backyard small talk with neighbors have moved from the physical world to the digital
The Decision Management Manifesto
An Introduction Decision Management is a powerful approach, increasingly used to adopt business rules and advanced analytic technology. The Manifesto lays out key principles of the approach. James Taylor
IBM Content Analytics with Enterprise Search, Version 3.0
IBM Content Analytics with Enterprise Search, Version 3.0 Highlights Enables greater accuracy and control over information with sophisticated natural language processing capabilities to deliver the right
TECHNIQUES FOR OPTIMIZING THE RELATIONSHIP BETWEEN DATA STORAGE SPACE AND DATA RETRIEVAL TIME FOR LARGE DATABASES
Techniques For Optimizing The Relationship Between Data Storage Space And Data Retrieval Time For Large Databases TECHNIQUES FOR OPTIMIZING THE RELATIONSHIP BETWEEN DATA STORAGE SPACE AND DATA RETRIEVAL
Big Data: Key Concepts The three Vs
Big Data: Key Concepts The three Vs Big data in general has context in three Vs: Sheer quantity of data Speed with which data is produced, processed, and digested Diversity of sources inside and outside.
PREDICTIVE ANALYTICS FOR THE HEALTHCARE INDUSTRY
PREDICTIVE ANALYTICS FOR THE HEALTHCARE INDUSTRY By Andrew Pearson Qualex Asia Today, healthcare companies are drowning in data. According to IBM, most healthcare organizations have terabytes and terabytes
Health Data Analytics. Data to Value For Small and Medium Healthcare organizations
Health Data Analytics Data to Value For Small and Medium Healthcare organizations HEALTH DATA ANALYTICS WHITE PAPER JULY 2013 GREENCASTLE CONSULTING Abstract This paper is targeted toward small and medium
Big Data & Analytics. The. Deal. About. Jacob Büchler [email protected] Cand. Polit. IBM Denmark, Solution Exec. 2013 IBM Corporation
The Big Data & Analytics Deal About Jacob Büchler [email protected] Cand. Polit. IBM Denmark, Solution Exec. 1 Big Data is All Data from Everywhere Big Data Is Becoming The Next Natural Resource We
9 Keys to Effectively Managing Software Projects
9 Keys to Effectively Managing Software Projects Introduction Can managing software development be as simple as reading a brief to-do/not-to-do list? No. All evidence indicates that software development
Credibility and Pooling Applications to Group Life and Group Disability Insurance
Credibility and Pooling Applications to Group Life and Group Disability Insurance Presented by Paul L. Correia Consulting Actuary [email protected] (207) 771-1204 May 20, 2014 What I plan to cover
Big Data & Analytics. Counterparty Credit Risk Management. Big Data in Risk Analytics
Deniz Kural, Senior Risk Expert BeLux Patrick Billens, Big Data Solutions Leader Big Data & Analytics Counterparty Credit Risk Management Challenges for the Counterparty Credit Risk Manager Regulatory
A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH
205 A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH ABSTRACT MR. HEMANT KUMAR*; DR. SARMISTHA SARMA** *Assistant Professor, Department of Information Technology (IT), Institute of Innovation in Technology
Five Ways Retailers Can Profit from Customer Intelligence
Five Ways Retailers Can Profit from Customer Intelligence Use predictive analytics to reach your best customers. An Apption Whitepaper Tel: 1-888-655-6875 Email: [email protected] www.apption.com/customer-intelligence
Cutting Through The Hype: What You Need To Know About Big Data
Cutting Through The Hype: What You Need To Know About Big Data Bill Franks Chief Analytics Officer, Global Alliances, Teradata [email protected] @BillFranksGA www.bill-franks.com www.linkedin.com/in/billfranksga
Planning sample size for randomized evaluations
TRANSLATING RESEARCH INTO ACTION Planning sample size for randomized evaluations Simone Schaner Dartmouth College povertyactionlab.org 1 Course Overview Why evaluate? What is evaluation? Outcomes, indicators
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 10 Sajjad Haider Fall 2012 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
Busting 7 Myths about Master Data Management
Knowledge Integrity Incorporated Busting 7 Myths about Master Data Management Prepared by: David Loshin Knowledge Integrity, Inc. August, 2011 Sponsored by: 2011 Knowledge Integrity, Inc. 1 (301) 754-6350
Summary of GAO Cost Estimate Development Best Practices and GAO Cost Estimate Audit Criteria
Characteristic Best Practice Estimate Package Component / GAO Audit Criteria Comprehensive Step 2: Develop the estimating plan Documented in BOE or Separate Appendix to BOE. An analytic approach to cost
TDWI Best Practice BI & DW Predictive Analytics & Data Mining
TDWI Best Practice BI & DW Predictive Analytics & Data Mining Course Length : 9am to 5pm, 2 consecutive days 2012 Dates : Sydney: July 30 & 31 Melbourne: August 2 & 3 Canberra: August 6 & 7 Venue & Cost
Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05
Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 2015-03-05 Roman Kern (KTI, TU Graz) Ensemble Methods 2015-03-05 1 / 38 Outline 1 Introduction 2 Classification
Making Sense of the Mayhem: Machine Learning and March Madness
Making Sense of the Mayhem: Machine Learning and March Madness Alex Tran and Adam Ginzberg Stanford University [email protected] [email protected] I. Introduction III. Model The goal of our research
Big Data for Marketing & Sales: Data Accuracy to Business Impact
Needs Strategy Big Data for Marketing & Sales: Data Accuracy to Business Impact An IDG Connect survey of marketing, sales and research personnel in 300 US enterprise organizations. Decisions Usage Planning
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification
The Truth About Credit Repair
The Truth About Credit Repair Discover The Insider Secrets Of How The Credit System Really Works and How To Beat The Credit Bureaus At Their Own Game. David Shapiro Esq. Applied Credit Repair Solutions
Intelligent Heuristic Construction with Active Learning
Intelligent Heuristic Construction with Active Learning William F. Ogilvie, Pavlos Petoumenos, Zheng Wang, Hugh Leather E H U N I V E R S I T Y T O H F G R E D I N B U Space is BIG! Hubble Ultra-Deep Field
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
Building a Data Quality Scorecard for Operational Data Governance
Building a Data Quality Scorecard for Operational Data Governance A White Paper by David Loshin WHITE PAPER Table of Contents Introduction.... 1 Establishing Business Objectives.... 1 Business Drivers...
CHAPTER THREE: METHODOLOGY. 3.1. Introduction. emerging markets can successfully organize activities related to event marketing.
Event Marketing in IMC 44 CHAPTER THREE: METHODOLOGY 3.1. Introduction The overall purpose of this project was to demonstrate how companies operating in emerging markets can successfully organize activities
Measuring the value of social software
IBM Software Services for Lotus White Paper June 2010 Defining a measurement approach that maps activity to business value 1 Contents Introduction Why measure? Defining objectives Types of measurement
Diaspark White Paper July 2010 Microsoft SharePoint 2010 INSIGHT Business Intelligence on a technically enriched platform
Diaspark White Paper July 2010 Microsoft SharePoint 2010 INSIGHT Business Intelligence on a technically enriched platform Contents Executive Summary... 3 Who should read this paper?... 3 SharePoint 2010
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
InvGen: An Efficient Invariant Generator
InvGen: An Efficient Invariant Generator Ashutosh Gupta and Andrey Rybalchenko Max Planck Institute for Software Systems (MPI-SWS) Abstract. In this paper we present InvGen, an automatic linear arithmetic
Measurement and measures. Professor Brian Oldenburg
Measurement and measures Professor Brian Oldenburg Learning objectives 1. To identify similarities/differences between qualitative & quantitative measures 2. To identify steps involved in choosing and/or
Using Earned Value, Part 2: Tracking Software Projects. Using Earned Value Part 2: Tracking Software Projects
Using Earned Value Part 2: Tracking Software Projects Abstract We ve all experienced it too often. The first 90% of the project takes 90% of the time, and the last 10% of the project takes 90% of the time
large-scale machine learning revisited Léon Bottou Microsoft Research (NYC)
large-scale machine learning revisited Léon Bottou Microsoft Research (NYC) 1 three frequent ideas in machine learning. independent and identically distributed data This experimental paradigm has driven
Performance Measures for Machine Learning
Performance Measures for Machine Learning 1 Performance Measures Accuracy Weighted (Cost-Sensitive) Accuracy Lift Precision/Recall F Break Even Point ROC ROC Area 2 Accuracy Target: 0/1, -1/+1, True/False,
PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU
PITFALLS IN TIME SERIES ANALYSIS Cliff Hurvich Stern School, NYU The t -Test If x 1,..., x n are independent and identically distributed with mean 0, and n is not too small, then t = x 0 s n has a standard
Digital vs. Analog Volume Controls
Digital vs. Analog Volume Controls October 2011 AMM ESS 10/11 Summary of this Presentation In a Digital Audio System what is the trade-off between using a digital or an analog volume control? To answer
The Basics of Building Credit
The Basics of Building Credit This program was developed to help middle school students learn the basics of building credit. At the end of this lesson, you should know about all of the Key Topics below:»
Lesson 3 - Processing a Multi-Layer Yield History. Exercise 3-4
Lesson 3 - Processing a Multi-Layer Yield History Exercise 3-4 Objective: Develop yield-based management zones. 1. File-Open Project_3-3.map. 2. Double click the Average Yield surface component in the
How To Bet On An Nfl Football Game With A Machine Learning Program
Beating the NFL Football Point Spread Kevin Gimpel [email protected] 1 Introduction Sports betting features a unique market structure that, while rather different from financial markets, still boasts
Artificial Intelligence Beating Human Opponents in Poker
Artificial Intelligence Beating Human Opponents in Poker Stephen Bozak University of Rochester Independent Research Project May 8, 26 Abstract In the popular Poker game, Texas Hold Em, there are never
Beating the MLB Moneyline
Beating the MLB Moneyline Leland Chen [email protected] Andrew He [email protected] 1 Abstract Sports forecasting is a challenging task that has similarities to stock market prediction, requiring time-series
MACHINE LEARNING BASICS WITH R
MACHINE LEARNING [Hands-on Introduction of Supervised Machine Learning Methods] DURATION 2 DAY The field of machine learning is concerned with the question of how to construct computer programs that automatically
