Joseph M. Juran Center for Research in Supply Chain, Operations, and Quality

Size: px
Start display at page:

Download "Joseph M. Juran Center for Research in Supply Chain, Operations, and Quality"

Transcription

1 Joseph M. Juran Center for Research in Supply Chain, Operations, and Quality Professor Kevin Linderman Academic Co-Director of Joseph M. Juran Center for Research in Supply Chain, Operations, and Quality Carlson School of Management University of Minnesota

2 Origins of the Juran Fellows Program

3 Quote from Dr. Juran This will be a place where leaders engage with scholars to shape critical questions, where new knowledge is developed, translated, and disseminated, and, above all, where Quality scholars are trained. It is my hope that, generations from now, historians will note that we helped create the Century of Quality. J.M. Juran

4 Juran Fellowship Award Application Content 1. Career information 2. Executive summary 3. Project description (up to 15 pages) 4. Short curriculum vitae (1-2 pages) 5. Letter of recommendation from a faculty advisor

5 Evaluation Criteria 1. Importance of the Problem (20% of score) 2. Connection to Existing Quality Research (20% of score) 3. Appropriate Methodology (20% of score) 4. Relevance to Scholarship and Practice (20% of score) 5. Timeline and Probability of completion (10% of score)

6 Community of Juran Fellows 6

7 Community of Juran Fellows

8 The Future From seedlings to orchards

9 Recent Juran Fellow Winners Anna Errore University of Palermo (Italy) Ujjal Kumar Mukherjee Carlson School of Management

10 Anna Errore University of Palermo (Italy) Visiting PhD student at the Carlson School of Management Definitive Screening Designs for Quality Improvement

11 Anna Errore University of Palermo, Italy (visiting PhD student at Carlson School of Management, University of Minnesota) William Li Christopher J. Nachtsheim Carlson School of Management, University of Minnesota Definitive Screening Designs and Discrete Choice Experiments for Quality Improvement

12 DOE in Business and Industry All models are wrong; some models are useful. George Box, 1978 Experiments, statistically designed or not, are a component of the learning process. How well one succeeds will be a function of adherence to the scientific method, the most rapid means for speeding the learning process. Juran, J. M., and Godfrey, A. B. (1999). Juran s quality handbook (Vol. 2). New York, McGraw Hill.

13 DOE in Business and Industry In quality applications DOE plays a fundamental role in a huge variety of situations, from design and development of new products, to product or process improvement practices. In conducting experiments one of the most challenging tasks is the design of the experimental settings. Assumptions Design Experiment Analysis

14 Screening Experiments The term `Screening Design' refers to an experimental plan that is intended to find the few significant factors from a list of many potential ones. Primary purpose is to identify significant main effects Three empirical principles Effects sparsity Effects heredity Effects hierarcy Source: Li, Xiang, Nandan Sudarsanam, and Daniel D. Frey. "Regularities in data from factorial experiments." Complexity 11.5 (2006):

15 Example Example of a Screening Experiment The reactor experiment of Box, Hunter, and Hunter (2005) 5 factors: Feed Rate (A), Catalyst (B), Stir Rate (C), Temperature (D), Concentration (E). The response is the percent reacted (y).

16 Example of a Screening Experiment The fitted equation is: y = B D 3.125E BD 5.5DE Obtained with a full factorial experimental design: 2 5 = 32 runs In practice full factorial designs are very rare, especially for screening, because the run size increases exponentially with the number of factors.

17 Purpose of this project Design efficient screening designs Linear models (quantitative response) Non-Linear models (qualitative response) Identify and refine methods for the analysis Get the right information at the minimum cost Cost Information

18 Over reliance on a single model Optimal design depends on a pre-stated model Full Factorial designs optimal for full-factorial model Screening designs assume only first-order effects active (resolution III fractional factorial, Plackett Burman designs) If higher order effects are active results may be misleading

19 Desirable design features Statistical efficiency Minimum alias (confounding) of the effects Small run sizes Cost Information

20 Definitive screening designs Jones, B., & Nachtsheim, C. J. (2011) A Class of Three-Level Designs for Definitive Screening in the Presence of Second-Order Effects, Journal of Quality Technology 43(1), ASQ Brumbaugh Award 2012 Lloyd S. Nelson

21 Definitive screening designs 2 level factors 3+2 level factors 3 level factors First paper of this project (under review) Presented at Spring Research Conference 2013 Statistics in Industry and Technology University of California Los Angeles June 20th -22nd Errore, A., Jones, B., Li, W., Nachtsheim, C. J. (under review) Two-Level Definitive Screening Designs.

22 Three-level DSDs Jones, B., & Nachtsheim, C. J. (2011) A Class of Three-Level Designs for Definitive Screening in the Presence of Second-Order Effects, Journal of Quality Technology 43(1), 1-15.

23 Three-level plus two-level DSDs Added 2-level categorical factors Two construction methods: DSD-augmented, unbiased designs Orth-augmented, orthogonal designs Jones, B., & Nachtsheim, C. J. (2013). Definitive Screening Designs with Added Two-Level Categorical Factors. Journal of quality technology, 45(2),

24 Two-level DSDs Only 2-level factors Categorical factors or too costly to pick three levels Need to correctly identify active factors Goals: Clear the estimation of the ME from the 2fi High D-efficiency in the ME model Small run size Alias = 0 Orthogonal or nearly orthogonal Min run size

25 Construction method Start random m Search exchange Judge improvements 2m m 1 m Intercept ME

26 Example Screening design with 7 factors 1. Optimize a 7x7 design 2. Foldover the 7x7 design to a 14x

27 Example comparisons with other designs 7 factors Typical design choices - Fractional Factorial Resolution III 8 runs

28 Example comparisons with other designs 7 factors Typical design choices: -PlackettBurman 12 runs

29 Example comparisons with other designs 7 factors - Margolin (1969) Weighing design D-efficiency = % Ave abs corr=

30 Example comparisons with other designs 7 factors - Two-level DSD D-efficiency = 91.65% Ave abs corr=

31 Current and future work Design Two-level DSDs (paper under review) Extension to non-linear models and discrete choice experiments (WIP) Analysis Identification and selection of the correct active terms in the analysis stage (WIP)

32 LEARNING FROM HIGH TECH INNOVATION FAILURES: APPLICATION OF BIG DATA PREDICTIVE ANALYTICS ON MEDICAL DEVICE FAILURES Ujjal Kumar Mukherjee PhD Student, Carlson School of Management, UMN Kingshuk K. Sinha Professor, Mosaic Company Professor of Corporate Responsibility Supply Chain & Operations, Carlson School of Management, UMN March 11 th, 2014

33 OVERALL RESEARCH AGENDA Understanding sources of technology failures in medical devices Technology life-cycle (age of underlying technology) Differential impact on sources: Stages of development and manufacturing Predicting device failures Failure risk estimation based on unstructured market data (user experience) Predictive analytics of Big unstructured datasets Improving precision of prediction Judgment bias in failure detection Improving usage and adoption of new technology in healthcare Context of surgical robot Field study in a large multi-specialty hospital in US Looking at doctors and surgical team learning and adoption Organizational capability building of complex and critical technology in the context of healthcare

34 INNOVATION FAILURES IN THE MEDICAL DEVICE INDUSTRY Source: FDA Database Innovation failure adds substantial cost and inconvenience to the system In the second quarter of 2012, million device units were recalled Increasing complexity of devices as well as increased speed of innovation. Failures are not desirable but are not always avoidable. Important to mitigate the impact of failures if one were to happen. Therefore, failure risk prediction is critical to ensure quality of innovation.

35 OBJECTIVES OF THE RESEARCH In this study we are using a large unstructured dataset and BIG DATA analytics to build a PREDICTIVE MODEL of failure risk of high tech innovations. Research Questions: Do field failure data provide a credible signal for early detection and prediction of innovation failure risk? (Prediction) What technology, firm or industry factors individually and in combination can improve the precision of prediction of innovation failure risk? (Precision) Do firms systematically exhibit a judgment bias in interpreting signals related to innovation failures? What factors influence the judgment bias, if present? (Consistency)

36 FAILURE PREDICTION: USING SIGNAL DETECTION THEORY Rate of adverse effects reported by users as signal of failure Under normal working conditions some adverse effect reports are likely - Noise However, systematic failures lead to abnormal adverse effect rate - Signal The idea is to monitor market reports on an ongoing basis Build predictive risk models for device failures Classify user reports into likelihood of device failures Statistically, separating the Signal from Noise is important.

37 VISUALIZING PREDICTION OUTCOME IN SIGNAL DETECTION THEORY (SDT) Predicted False True Actual False Correct Rejection Rate False Alarm Rate (FAR) True Miss Rate (MR) Hit Rate (HR) Receiver operating characteristics (ROC) curve Both are expensive to the firm Accuracy of the Prediction: Area under the curve (AUC) of the ROC curve

38 HYPOTHESES Hypothesis 1 (H1): Field failures predict innovation failure. Hypothesis 2 (H2): Field failures supplemented by factors related to design, production, supply chain, regulation and technology life cycle of the innovation reduces the prediction error of the innovation failure. Hypothesis 3A (H3A): Signal to noise ratio (in H2) is associated with systematic bias in the prediction of innovation failure. High signal to noise ratio is associated with under-reaction bias and low signal to noise ratio is associated with over-reaction bias. Hypothesis 3B (H3B): Severity of innovation failure (in H2) is associated with systematic bias in the prediction of innovation failure. High severity is associated with over-reaction bias and low severity is associated with under-reaction bias.

39 Primary Data Source DATA 1 Adverse events FDA manufacturer and user reported adverse event database (MAUDE) ~ 3 Million usable datapoints. Fairly unstructured with missing data and mis-classification (BIG DATA). ( ) 2 Recalls FDA Recall database ( ) 3 Firm level product FDA approval database, Compustat database ( ) development data 4 Industry competition FDA approval database, Compustat database ( ) 5 Manufacturing changes and Global Manufacturing at the firm level. FDA user and manufacturer facility registration database; FDA supplemental approval text files ( ). 6 Supplier information FDA user and manufacturer facility registration database; FDA supplemental approval text files ( ). 7 Device classification FDA approval database ( ) 8 Usage classification FDA approval database ( ) 9 Adverse event severity FDA / CDRH Patient reports database ~ 6 million data entries ( ). 10 Implant / non-implant FDA / CDRH device database ( ) 11 Device approval classification FDA 510(k) and PMA / PMA supplement database ( )

40 CONCLUSION FROM SEVERAL PREDICTIVE ANALYTIC MODELS ON THE DATA 1. Density of adverse effects (Measure of rate) are significant predictor of recalls. 2. Product development intensity parameters are significant predictors of recalls. 3. Supply chain changes increases recall likelihood in the short run. However, data indicates that generally firms recover from short run glitches in the long run. 4. Frequency of manufacturing process changes are significant predictors of recalls. 5. Global manufacturing significantly increases the likelihood of recalls. 6. Competitive intensity increases the likelihood of recalls. 7. Regulation type PMA has higher likelihood of recalls compared to 510K.

41 PREDICTIVE MODELING AND MODEL SELECTION Generalized Linear Mixed Models (GLMM) REGRESSION MODELS Generalized Additive Models (GAM) True Positive Rate True Positive Rate False Positive Rate False Positive Rate

42 PREDICTIVE MODELING AND MODEL SELECTION ENSEMBLE CLASS MODELS Boosting Classifier Random Forest Classifier True Positive Rate True Positive Rate False Positive Rate False Positive Rate

43 PREDICTIVE MODELING AND MODEL SELECTION Method Initial Model Sensitivity Final Model Sensitivity (AUC) (AUC) LASSO Mixed Logistic Regression 0.61 ( ) 0.81 ( ) Generalized Additive Model Cox Hazard Model Naïve Bayes Neural Network Support Vector Machine (Radial Basis Function Kernel) Boosting Random Forest 0.82 ( ) 0.95 ( ) Number of devices: 5,873, Train set size: 98,650, Test set size: 28,641 and Total sample size (n) : 127,291 Considerably high prediction accuracy.

44 FIRMS EXHIBIT SOME JUDGMENT BIAS IN DETECTING FAILURE SIGNALS Variance Mean Ratio vs. Judgment Bias Failure Judgments Severity Low High Over-reaction Over-reaction Under-reaction Variance Predictive analytic framework proposed will aid decision and judgment Aimed to mitigating some of the cognitive biases Make decision support system more data and information based Experience and human judgment can never be fully replaced but can be aided with data analytics Low High

45 CONCLUSIONS 1. We have shown that it is possible to predict innovation failures with fairly high degree of accuracy. 2. We have also been able to analyze the influence of some critical covariates in improving the accuracy of predicting innovation failure. 3. We have shown that judgment bias exists in managing innovation failure. 4. Use of data analytic decision support can help improve precision of failure detection and mitigate judgment bias to a considerable extent.

46 ACKNOWLEDGEMENTS Juran Center Dissertation Grant, 2013 Social and Business Analytic Collaboration (SOBACO), UMN Grant, 2013 American Hospital Association Grant, 2012 Dissertation Fellowship, Carlson School, UMN, 2014

47 THANK YOU

48 Juran Fellowship Award We welcome your Participation! Kevin Linderman

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

Azure Machine Learning, SQL Data Mining and R

Azure Machine Learning, SQL Data Mining and R Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:

More information

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be

More information

Ujjal Kumar Mukherjee

Ujjal Kumar Mukherjee Ujjal Kumar Mukherjee Ph.D. Candidate Phone (o): (612) 626-9762 Supply Chain and Operations Department Cell: (651) 497-8617 Carlson School of Management Fax: (612) 624-8804 321 19 th Street South Minneapolis,

More information

Virtual Site Event. Predictive Analytics: What Managers Need to Know. Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015

Virtual Site Event. Predictive Analytics: What Managers Need to Know. Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015 Virtual Site Event Predictive Analytics: What Managers Need to Know Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015 1 Ground Rules Virtual Site Ground Rules PMI Code of Conduct applies for this

More information

Study Design and Statistical Analysis

Study Design and Statistical Analysis Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing

More information

Design of Experiments (DOE)

Design of Experiments (DOE) MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Design

More information

Data Mining. Nonlinear Classification

Data Mining. Nonlinear Classification Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15

More information

Predictive Data modeling for health care: Comparative performance study of different prediction models

Predictive Data modeling for health care: Comparative performance study of different prediction models Predictive Data modeling for health care: Comparative performance study of different prediction models Shivanand Hiremath hiremat.nitie@gmail.com National Institute of Industrial Engineering (NITIE) Vihar

More information

Statistics for Experimenters

Statistics for Experimenters Statistics for Experimenters Design, Innovation, and Discovery Second Edition GEORGE E. P. BOX J. STUART HUNTER WILLIAM G. HUNTER WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION FACHGEBIETSBGCHEREI

More information

Classification Problems

Classification Problems Classification Read Chapter 4 in the text by Bishop, except omit Sections 4.1.6, 4.1.7, 4.2.4, 4.3.3, 4.3.5, 4.3.6, 4.4, and 4.5. Also, review sections 1.5.1, 1.5.2, 1.5.3, and 1.5.4. Classification Problems

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department of Mathematics and Statistics Degree Level Expectations, Learning Outcomes, Indicators of

More information

THE UNIVERSITY OF MARYLAND ROBERT H. SMITH SCHOOL OF BUSINESS. PhD

THE UNIVERSITY OF MARYLAND ROBERT H. SMITH SCHOOL OF BUSINESS. PhD PhD Program THE UNIVERSITY OF MARYLAND ROBERT H. SMITH SCHOOL OF BUSINESS PhD Shape the future of business scholarship. Welcome to the PhD program at the Robert H. Smith School of Business. We are a collaborative,

More information

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM KATE GLEASON COLLEGE OF ENGINEERING John D. Hromi Center for Quality and Applied Statistics NEW (or REVISED) COURSE (KGCOE- CQAS- 747- Principles of

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05

Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05 Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 2015-03-05 Roman Kern (KTI, TU Graz) Ensemble Methods 2015-03-05 1 / 38 Outline 1 Introduction 2 Classification

More information

BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES

BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents

More information

Benchmarking of different classes of models used for credit scoring

Benchmarking of different classes of models used for credit scoring Benchmarking of different classes of models used for credit scoring We use this competition as an opportunity to compare the performance of different classes of predictive models. In particular we want

More information

School of Public Health and Health Services Department of Epidemiology and Biostatistics

School of Public Health and Health Services Department of Epidemiology and Biostatistics School of Public Health and Health Services Department of Epidemiology and Biostatistics Master of Public Health and Graduate Certificate Biostatistics 0-04 Note: All curriculum revisions will be updated

More information

Better credit models benefit us all

Better credit models benefit us all Better credit models benefit us all Agenda Credit Scoring - Overview Random Forest - Overview Random Forest outperform logistic regression for credit scoring out of the box Interaction term hypothesis

More information

CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES

CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES Claus Gwiggner, Ecole Polytechnique, LIX, Palaiseau, France Gert Lanckriet, University of Berkeley, EECS,

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs anton.heijs@treparel.com Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information

The Predictive Data Mining Revolution in Scorecards:

The Predictive Data Mining Revolution in Scorecards: January 13, 2013 StatSoft White Paper The Predictive Data Mining Revolution in Scorecards: Accurate Risk Scoring via Ensemble Models Summary Predictive modeling methods, based on machine learning algorithms

More information

Chapter 6. The stacking ensemble approach

Chapter 6. The stacking ensemble approach 82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described

More information

COLLEGE OF SCIENCE. John D. Hromi Center for Quality and Applied Statistics

COLLEGE OF SCIENCE. John D. Hromi Center for Quality and Applied Statistics ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE John D. Hromi Center for Quality and Applied Statistics NEW (or REVISED) COURSE: COS-STAT-747 Principles of Statistical Data Mining

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

Keywords Data mining, Classification Algorithm, Decision tree, J48, Random forest, Random tree, LMT, WEKA 3.7. Fig.1. Data mining techniques.

Keywords Data mining, Classification Algorithm, Decision tree, J48, Random forest, Random tree, LMT, WEKA 3.7. Fig.1. Data mining techniques. International Journal of Emerging Research in Management &Technology Research Article October 2015 Comparative Study of Various Decision Tree Classification Algorithm Using WEKA Purva Sewaiwar, Kamal Kant

More information

Predict Influencers in the Social Network

Predict Influencers in the Social Network Predict Influencers in the Social Network Ruishan Liu, Yang Zhao and Liuyu Zhou Email: rliu2, yzhao2, lyzhou@stanford.edu Department of Electrical Engineering, Stanford University Abstract Given two persons

More information

Statistics in Retail Finance. Chapter 7: Fraud Detection in Retail Credit

Statistics in Retail Finance. Chapter 7: Fraud Detection in Retail Credit Statistics in Retail Finance Chapter 7: Fraud Detection in Retail Credit 1 Overview > Detection of fraud remains an important issue in retail credit. Methods similar to scorecard development may be employed,

More information

Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification

Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati tnpatil2@gmail.com, ss_sherekar@rediffmail.com

More information

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-

More information

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence Government of Russian Federation Federal State Autonomous Educational Institution of High Professional Education National Research University «Higher School of Economics» Faculty of Computer Science School

More information

Classification of Bad Accounts in Credit Card Industry

Classification of Bad Accounts in Credit Card Industry Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition

More information

Doctor of Philosophy in Social Work

Doctor of Philosophy in Social Work GRADUATE FACULTY COUNCIL DOC. NO. 1051 Approved November 20, 2006 RECOMMENDATION OF THE GRADUATE CURRICULUM COMMITTEE AND THE FACULTY OF THE DEPARTMENT OF SOCIAL WORK FOR GRADUATE SCHOOL BULLETIN COPY

More information

BOSTON UNIVERSITY SCHOOL OF PUBLIC HEALTH PUBLIC HEALTH COMPETENCIES

BOSTON UNIVERSITY SCHOOL OF PUBLIC HEALTH PUBLIC HEALTH COMPETENCIES BOSTON UNIVERSITY SCHOOL OF PUBLIC HEALTH PUBLIC HEALTH COMPETENCIES Competency-based education focuses on what students need to know and be able to do in varying and complex situations. These competencies

More information

Data Mining - Evaluation of Classifiers

Data Mining - Evaluation of Classifiers Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010

More information

Getting insights about life cycle cost drivers: an approach based on big data inspired statistical modelling

Getting insights about life cycle cost drivers: an approach based on big data inspired statistical modelling Introduction A Big Data applied to LCC Conclusion, Getting insights about life cycle cost drivers: an approach based on big data inspired statistical modelling Instituto Superior Técnico, Universidade

More information

PROPOSAL FOR A GRADUATE CERTIFICATE IN SOCIAL SCIENCE METHODOLOGY

PROPOSAL FOR A GRADUATE CERTIFICATE IN SOCIAL SCIENCE METHODOLOGY PROPOSAL FOR A GRADUATE CERTIFICATE IN SOCIAL SCIENCE METHODOLOGY By Curtiss Cobb and Jon Krosnick May, 2006 One of Stanford s greatest strengths in the social sciences is methodology. Each social science

More information

Fraud Detection for Online Retail using Random Forests

Fraud Detection for Online Retail using Random Forests Fraud Detection for Online Retail using Random Forests Eric Altendorf, Peter Brende, Josh Daniel, Laurent Lessard Abstract As online commerce becomes more common, fraud is an increasingly important concern.

More information

Leveraging electronic health records for predictive modeling of surgical complications. Grant Weller

Leveraging electronic health records for predictive modeling of surgical complications. Grant Weller Leveraging electronic health records for predictive modeling of surgical complications Grant Weller ISCB 2015 Utrecht NL August 26, 2015 Collaborators: David W. Larson, MD; Jenna Lovely, PharmD, RPh; Berton

More information

Health Policy and Administration PhD Track in Health Services and Policy Research

Health Policy and Administration PhD Track in Health Services and Policy Research Health Policy and Administration PhD Track in Health Services and Policy INTRODUCTION The Health Policy and Administration (HPA) Division of the UIC School of Public Health offers a PhD track in Health

More information

Module 223 Major A: Concepts, methods and design in Epidemiology

Module 223 Major A: Concepts, methods and design in Epidemiology Module 223 Major A: Concepts, methods and design in Epidemiology Module : 223 UE coordinator Concepts, methods and design in Epidemiology Dates December 15 th to 19 th, 2014 Credits/ECTS UE description

More information

Ph.D. in Educational Psychology - Course Sequence

Ph.D. in Educational Psychology - Course Sequence 1 Ph.D. in Educational Psychology - Course Sequence First Semester - Summer EDUC 700: Online Orientation (0 credits) EFND 705A: Residency (2 credits) UNIV LIB: Information, Research, and Resources (0 credits)

More information

Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios

Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios By: Michael Banasiak & By: Daniel Tantum, Ph.D. What Are Statistical Based Behavior Scoring Models And How Are

More information

Prediction of Stock Performance Using Analytical Techniques

Prediction of Stock Performance Using Analytical Techniques 136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University

More information

Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)

Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos) Machine Learning Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos) What Is Machine Learning? A computer program is said to learn from experience E with respect to some class of

More information

Curriculum - Doctor of Philosophy

Curriculum - Doctor of Philosophy Curriculum - Doctor of Philosophy CORE COURSES Pharm 545-546.Pharmacoeconomics, Healthcare Systems Review. (3, 3) Exploration of the cultural foundations of pharmacy. Development of the present state of

More information

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis ElegantJ BI White Paper The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis Integrated Business Intelligence and Reporting for Performance Management, Operational

More information

Total Credits: 32 credits are required for master s program graduates and 53 credits for undergraduate program.

Total Credits: 32 credits are required for master s program graduates and 53 credits for undergraduate program. Middle East Technical University Graduate School of Social Sciences Doctor of Philosophy in Business Administration In the Field of Quantitative Methods Aim of the PhD Program: Quantitative Methods is

More information

Learning from Diversity

Learning from Diversity Learning from Diversity Epitope Prediction with Sequence and Structure Features using an Ensemble of Support Vector Machines Rob Patro and Carl Kingsford Center for Bioinformatics and Computational Biology

More information

Supervised Learning (Big Data Analytics)

Supervised Learning (Big Data Analytics) Supervised Learning (Big Data Analytics) Vibhav Gogate Department of Computer Science The University of Texas at Dallas Practical advice Goal of Big Data Analytics Uncover patterns in Data. Can be used

More information

Predictive Analytics Tools and Techniques

Predictive Analytics Tools and Techniques Global Journal of Finance and Management. ISSN 0975-6477 Volume 6, Number 1 (2014), pp. 59-66 Research India Publications http://www.ripublication.com Predictive Analytics Tools and Techniques Mr. Chandrashekar

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

MS1b Statistical Data Mining

MS1b Statistical Data Mining MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to

More information

MASTER OF ARTS IN PSYCHOLOGY (208)

MASTER OF ARTS IN PSYCHOLOGY (208) The Graduate School MASTER OF ARTS IN PSYCHOLOGY (208) Program Coordinator: Dr. Tabitha Holmes Office: Jacobson Faculty Tower, room 210 Phone: 845-257-3955 Email: holmest@newpaltz.edu The Psychology Graduate

More information

Speaker First Plenary Session THE USE OF "BIG DATA" - WHERE ARE WE AND WHAT DOES THE FUTURE HOLD? William H. Crown, PhD

Speaker First Plenary Session THE USE OF BIG DATA - WHERE ARE WE AND WHAT DOES THE FUTURE HOLD? William H. Crown, PhD Speaker First Plenary Session THE USE OF "BIG DATA" - WHERE ARE WE AND WHAT DOES THE FUTURE HOLD? William H. Crown, PhD Optum Labs Cambridge, MA, USA Statistical Methods and Machine Learning ISPOR International

More information

Instructional Delivery Model Courses in the Ph.D. program are offered online.

Instructional Delivery Model Courses in the Ph.D. program are offered online. Doctor of Philosophy in Education Doctor of Philosophy Mission Statement The Doctor of Philosophy (Ph.D.) is designed to support the mission of the Fischler School of Education. The program prepares individuals

More information

Better decision making under uncertain conditions using Monte Carlo Simulation

Better decision making under uncertain conditions using Monte Carlo Simulation IBM Software Business Analytics IBM SPSS Statistics Better decision making under uncertain conditions using Monte Carlo Simulation Monte Carlo simulation and risk analysis techniques in IBM SPSS Statistics

More information

ERIC E. SPIRES. Ph.D. May 1987, University of Illinois (major - accounting; special area - auditing; minor - statistics and experimental design)

ERIC E. SPIRES. Ph.D. May 1987, University of Illinois (major - accounting; special area - auditing; minor - statistics and experimental design) ERIC E. SPIRES Department of Accounting & MIS Fisher College of Business The Ohio State University 2100 Neil Avenue Columbus, Ohio 43210 (614) 292-4422 (voice) (614) 292-2118 (fax) spires.1@osu.edu EDUCATION:

More information

DESCRIPTION OF COURSES

DESCRIPTION OF COURSES DESCRIPTION OF COURSES MGT600 Management, Organizational Policy and Practices The purpose of the course is to enable the students to understand and analyze the management and organizational processes and

More information

ID&T PhD Handbook 1. Appendix. Instructional Design & Technology

ID&T PhD Handbook 1. Appendix. Instructional Design & Technology ID&T PhD Handbook 1 Appendix Instructional Design & Technology ID&T PhD Handbook 2 Table of Contents Introduction to the Ph.D. Program In Instructional Design & Technology... 3 Admissions Criteria... 3

More information

Employer Health Insurance Premium Prediction Elliott Lui

Employer Health Insurance Premium Prediction Elliott Lui Employer Health Insurance Premium Prediction Elliott Lui 1 Introduction The US spends 15.2% of its GDP on health care, more than any other country, and the cost of health insurance is rising faster than

More information

2012 3 R s and Predictive Modeling Boot Camp Nov. 8-9, 2012. Session #1: Predictive Modeling: An Overview Syed Muzayan Mehmud, ASA, FCA, MAAA

2012 3 R s and Predictive Modeling Boot Camp Nov. 8-9, 2012. Session #1: Predictive Modeling: An Overview Syed Muzayan Mehmud, ASA, FCA, MAAA 2012 3 R s and Predictive Modeling Boot Camp Nov. 8-9, 2012 Session #1: Predictive Modeling: An Overview Syed Muzayan Mehmud, ASA, FCA, MAAA Predictive Modeling: An Overview November 8, 2012 Syed M. Mehmud

More information

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Spring 2015 Thomas Hill, Ph.D. VP Analytic Solutions Dell Statistica Overview and Agenda Dell Software overview Dell in

More information

Course Descriptions. Seminar in Organizational Behavior II

Course Descriptions. Seminar in Organizational Behavior II Course Descriptions B55 MKT 670 Seminar in Marketing Management This course is an advanced seminar of doctoral level standing. The course is aimed at students pursuing a degree in business, economics or

More information

MHI3000 Big Data Analytics for Health Care Final Project Report

MHI3000 Big Data Analytics for Health Care Final Project Report MHI3000 Big Data Analytics for Health Care Final Project Report Zhongtian Fred Qiu (1002274530) http://gallery.azureml.net/details/81ddb2ab137046d4925584b5095ec7aa 1. Data pre-processing The data given

More information

Lecture 8: Signal Detection and Noise Assumption

Lecture 8: Signal Detection and Noise Assumption ECE 83 Fall Statistical Signal Processing instructor: R. Nowak, scribe: Feng Ju Lecture 8: Signal Detection and Noise Assumption Signal Detection : X = W H : X = S + W where W N(, σ I n n and S = [s, s,...,

More information

Detecting Corporate Fraud: An Application of Machine Learning

Detecting Corporate Fraud: An Application of Machine Learning Detecting Corporate Fraud: An Application of Machine Learning Ophir Gottlieb, Curt Salisbury, Howard Shek, Vishal Vaidyanathan December 15, 2006 ABSTRACT This paper explores the application of several

More information

Ph.D. Biostatistics 2014-2015 Note: All curriculum revisions will be updated immediately on the website http://www.publichealth.gwu.

Ph.D. Biostatistics 2014-2015 Note: All curriculum revisions will be updated immediately on the website http://www.publichealth.gwu. Columbian College of Arts and Sciences and Milken Institute School of Public Health Ph.D. Biostatistics 2014-2015 Note: All curriculum revisions will be updated immediately on the website http://www.publichealth.gwu.edu

More information

T-61.3050 : Email Classification as Spam or Ham using Naive Bayes Classifier. Santosh Tirunagari : 245577

T-61.3050 : Email Classification as Spam or Ham using Naive Bayes Classifier. Santosh Tirunagari : 245577 T-61.3050 : Email Classification as Spam or Ham using Naive Bayes Classifier Santosh Tirunagari : 245577 January 20, 2011 Abstract This term project gives a solution how to classify an email as spam or

More information

Class #6: Non-linear classification. ML4Bio 2012 February 17 th, 2012 Quaid Morris

Class #6: Non-linear classification. ML4Bio 2012 February 17 th, 2012 Quaid Morris Class #6: Non-linear classification ML4Bio 2012 February 17 th, 2012 Quaid Morris 1 Module #: Title of Module 2 Review Overview Linear separability Non-linear classification Linear Support Vector Machines

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD Predictive Analytics Techniques: What to Use For Your Big Data March 26, 2014 Fern Halper, PhD Presenter Proven Performance Since 1995 TDWI helps business and IT professionals gain insight about data warehousing,

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

Detection Sensitivity and Response Bias

Detection Sensitivity and Response Bias Detection Sensitivity and Response Bias Lewis O. Harvey, Jr. Department of Psychology University of Colorado Boulder, Colorado The Brain (Observable) Stimulus System (Observable) Response System (Observable)

More information

School-Wide Public Health Competencies for MPH Students

School-Wide Public Health Competencies for MPH Students School-Wide Public Health Competencies for MPH Students The Louisiana State University Health Sciences Center School of Public Health (LSUHSC SPH) has embraced a competency-based curriculum. The school

More information

Data Mining Part 5. Prediction

Data Mining Part 5. Prediction Data Mining Part 5. Prediction 5.7 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Introduction Linear Regression Other Regression Models References Introduction Introduction Numerical prediction is

More information

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 )

Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) Chapter 13 Introduction to Nonlinear Regression( 非 線 性 迴 歸 ) and Neural Networks( 類 神 經 網 路 ) 許 湘 伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 35 13 Examples

More information

Ph.D. Programs. Wonder Investigate Communicate

Ph.D. Programs. Wonder Investigate Communicate Ph.D. Programs Wonder Investigate Communicate Accounting and Taxation Business Statistics Finance Management Management Information Systems Marketing Operations Management Visit us online at phd.ba.ttu.edu

More information

What is Data Science? Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014

What is Data Science? Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014 What is Data Science? { Data, Databases, and the Extraction of Knowledge Renée T., @becomingdatasci, November 2014 Let s start with: What is Data? http://upload.wikimedia.org/wikipedia/commons/f/f0/darpa

More information

Pragmatic Perspectives on the Measurement of Information Systems Service Quality

Pragmatic Perspectives on the Measurement of Information Systems Service Quality Pragmatic Perspectives on the Measurement of Information Systems Service Quality Analysis with LISREL: An Appendix to Pragmatic Perspectives on the Measurement of Information Systems Service Quality William

More information

Data Mining mit der JMSL Numerical Library for Java Applications

Data Mining mit der JMSL Numerical Library for Java Applications Data Mining mit der JMSL Numerical Library for Java Applications Stefan Sineux 8. Java Forum Stuttgart 07.07.2005 Agenda Visual Numerics JMSL TM Numerical Library Neuronale Netze (Hintergrund) Demos Neuronale

More information

List of Ph.D. Courses

List of Ph.D. Courses Research Methods Courses (5 courses/15 hours) List of Ph.D. Courses The research methods set consists of five courses (15 hours) that discuss the process of research and key methodological issues encountered

More information

PROGRAM HANDBOOK Doctor of Education Higher Education Community College Administration

PROGRAM HANDBOOK Doctor of Education Higher Education Community College Administration PROGRAM HANDBOOK Doctor of Education Higher Education Community College Administration College of Education Graduate Education and Research Texas Tech University Box 41071 Lubbock, TX 79409-1071 (806)

More information

Information and Decision Sciences (IDS)

Information and Decision Sciences (IDS) University of Illinois at Chicago 1 Information and Decision Sciences (IDS) Courses IDS 400. Advanced Business Programming Using Java. 0-4 Visual extended business language capabilities, including creating

More information

Microsoft Azure Machine learning Algorithms

Microsoft Azure Machine learning Algorithms Microsoft Azure Machine learning Algorithms Tomaž KAŠTRUN @tomaz_tsql Tomaz.kastrun@gmail.com http://tomaztsql.wordpress.com Our Sponsors Speaker info https://tomaztsql.wordpress.com Agenda Focus on explanation

More information

DOCTORAL STUDENT HANDBOOK. PhD in Business Administration Area of Marketing

DOCTORAL STUDENT HANDBOOK. PhD in Business Administration Area of Marketing DOCTORAL STUDENT HANDBOOK PhD in Business Administration Area of Marketing Excellence, Opportunity, Success Version 2.0 (September 2013) Table of Contents I. Introduction... 1 II. Program Requirements...

More information

TDWI Best Practice BI & DW Predictive Analytics & Data Mining

TDWI Best Practice BI & DW Predictive Analytics & Data Mining TDWI Best Practice BI & DW Predictive Analytics & Data Mining Course Length : 9am to 5pm, 2 consecutive days 2012 Dates : Sydney: July 30 & 31 Melbourne: August 2 & 3 Canberra: August 6 & 7 Venue & Cost

More information

Predictive Modeling and Big Data

Predictive Modeling and Big Data Predictive Modeling and Presented by Eileen Burns, FSA, MAAA Milliman Agenda Current uses of predictive modeling in the life insurance industry Potential applications of 2 1 June 16, 2014 [Enter presentation

More information

Credit Risk Models. August 24 26, 2010

Credit Risk Models. August 24 26, 2010 Credit Risk Models August 24 26, 2010 AGENDA 1 st Case Study : Credit Rating Model Borrowers and Factoring (Accounts Receivable Financing) pages 3 10 2 nd Case Study : Credit Scoring Model Automobile Leasing

More information

Performance Evaluation of Requirements Engineering Methodology for Automated Detection of Non Functional Requirements

Performance Evaluation of Requirements Engineering Methodology for Automated Detection of Non Functional Requirements Performance Evaluation of Engineering Methodology for Automated Detection of Non Functional J.Selvakumar Assistant Professor in Department of Software Engineering (PG) Sri Ramakrishna Engineering College

More information

Comparison of Data Mining Techniques used for Financial Data Analysis

Comparison of Data Mining Techniques used for Financial Data Analysis Comparison of Data Mining Techniques used for Financial Data Analysis Abhijit A. Sawant 1, P. M. Chawan 2 1 Student, 2 Associate Professor, Department of Computer Technology, VJTI, Mumbai, INDIA Abstract

More information

How To Build A Predictive Model In Insurance

How To Build A Predictive Model In Insurance The Do s & Don ts of Building A Predictive Model in Insurance University of Minnesota November 9 th, 2012 Nathan Hubbell, FCAS Katy Micek, Ph.D. Agenda Travelers Broad Overview Actuarial & Analytics Career

More information

2014-2015 The Master s Degree with Thesis Course Descriptions in Industrial Engineering

2014-2015 The Master s Degree with Thesis Course Descriptions in Industrial Engineering 2014-2015 The Master s Degree with Thesis Course Descriptions in Industrial Engineering Compulsory Courses IENG540 Optimization Models and Algorithms In the course important deterministic optimization

More information

Statistical Models in Data Mining

Statistical Models in Data Mining Statistical Models in Data Mining Sargur N. Srihari University at Buffalo The State University of New York Department of Computer Science and Engineering Department of Biostatistics 1 Srihari Flood of

More information

Business Administration (PhD)

Business Administration (PhD) Business Administration (PhD) The Doctor of Philosophy (PhD) in Business Administration at a minimum consists of 63 academic hours. Each student selects a specialization area such as finance, management,

More information

Active Learning SVM for Blogs recommendation

Active Learning SVM for Blogs recommendation Active Learning SVM for Blogs recommendation Xin Guan Computer Science, George Mason University Ⅰ.Introduction In the DH Now website, they try to review a big amount of blogs and articles and find the

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct

More information

Simple and efficient online algorithms for real world applications

Simple and efficient online algorithms for real world applications Simple and efficient online algorithms for real world applications Università degli Studi di Milano Milano, Italy Talk @ Centro de Visión por Computador Something about me PhD in Robotics at LIRA-Lab,

More information