Getting insights about life cycle cost drivers: an approach based on big data inspired statistical modelling
|
|
|
- Kellie Fisher
- 10 years ago
- Views:
Transcription
1 Introduction A Big Data applied to LCC Conclusion, Getting insights about life cycle cost drivers: an approach based on big data inspired statistical modelling Instituto Superior Técnico, Universidade de Lisboa, Portugal [email protected] EDAM Engineering Design Roundtables February 24, 2015
2 Introduction A Big Data applied to LCC Conclusion Table of Contents 1 Introduction Context Challenges in LCC 2 A Big Data applied to LCC LCC framework Manufacturing cost Maintenance cost 3 Conclusion Summary Discussion Extension
3 Introduction A Big Data applied to LCC Conclusion Context Challenges in LCC Context and motivation Industrial context of jet engine manufacturing Competitive pressure, challenging customer requirements, demanding airworthiness regulations Development of quality and profitable products Trading lifecycle cost (LCC) with performance, weight, quality, etc. Importance of LCC modeling in early design of jet engines [Haskins, 2010] Major part of the cost typically committed during this early phase [Barton, 2001] Multi-attributes decision problem requiring a Systems Engineering (SE) approach Ability to leverage large volumes of historical data to predict future LCC
4 Introduction A Big Data applied to LCC Conclusion Context Challenges in LCC Context and motivation Industrial context of jet engine manufacturing Competitive pressure, challenging customer requirements, demanding airworthiness regulations Development of quality and profitable products Trading lifecycle cost (LCC) with performance, weight, quality, etc. Importance of LCC modeling in early design of jet engines [Haskins, 2010] Major part of the cost typically committed during this early phase [Barton, 2001] Multi-attributes decision problem requiring a Systems Engineering (SE) approach Ability to leverage large volumes of historical data to predict future LCC
5 Introduction A Big Data applied to LCC Conclusion Context Challenges in LCC Jet engine 101
6 Introduction A Big Data applied to LCC Conclusion Context Challenges in LCC Definition of LCC NASA s definition of LCC «(LCC is) the total of the direct, indirect, recurring, nonrecurring, and other related expenses incurred, or estimated to be incurred, in the design, development, verification, production, operation, maintenance, support, and disposal of a project.» [NASA, 2010] Categories of (LCC) modeling approaches [Asiedu, 1998] Parametric (regression on a few variables) Analogous (extrapolation from similar items) Analytical (detailed, based on activity or physics) Challenges in LCC estimation in early design [Meisl, 1988] Insufficient details in requirements or contextual parameters Lack of quality data
7 Introduction A Big Data applied to LCC Conclusion Context Challenges in LCC Definition of LCC NASA s definition of LCC «(LCC is) the total of the direct, indirect, recurring, nonrecurring, and other related expenses incurred, or estimated to be incurred, in the design, development, verification, production, operation, maintenance, support, and disposal of a project.» [NASA, 2010] Categories of (LCC) modeling approaches [Asiedu, 1998] Parametric (regression on a few variables) Analogous (extrapolation from similar items) Analytical (detailed, based on activity or physics) Challenges in LCC estimation in early design [Meisl, 1988] Insufficient details in requirements or contextual parameters Lack of quality data
8 Introduction A Big Data applied to LCC Conclusion Context Challenges in LCC Definition of LCC NASA s definition of LCC «(LCC is) the total of the direct, indirect, recurring, nonrecurring, and other related expenses incurred, or estimated to be incurred, in the design, development, verification, production, operation, maintenance, support, and disposal of a project.» [NASA, 2010] Categories of (LCC) modeling approaches [Asiedu, 1998] Parametric (regression on a few variables) Analogous (extrapolation from similar items) Analytical (detailed, based on activity or physics) Challenges in LCC estimation in early design [Meisl, 1988] Insufficient details in requirements or contextual parameters Lack of quality data
9 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Table of Contents 1 Introduction Context Challenges in LCC 2 A Big Data applied to LCC LCC framework Manufacturing cost Maintenance cost 3 Conclusion Summary Discussion Extension
10 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Expression of LCC Two-terms (simplified) equation of LCC For a given mechanical component, at time t 0 : LCC t0 = MC t0 + i th shop visit R t i xmc ti where: MC t : Manufacturing cost at a given time t R ti : Replacement rate at the i th maintenance visit Main features of the LCC model LCC as a function of time: Manufacturing: materials rates, quantity, productivity... Maintenance: engine deterioration, fleet structure... Simplified equation of LCC No R&D (fuzzy) or disposal costs (negligible) Manufacturing cost: no labor or consumables considered Maintenance cost: repair not considered
11 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Expression of LCC Two-terms (simplified) equation of LCC For a given mechanical component, at time t 0 : LCC t0 = MC t0 + i th shop visit R t i xmc ti where: MC t : Manufacturing cost at a given time t R ti : Replacement rate at the i th maintenance visit Main features of the LCC model LCC as a function of time: Manufacturing: materials rates, quantity, productivity... Maintenance: engine deterioration, fleet structure... Simplified equation of LCC No R&D (fuzzy) or disposal costs (negligible) Manufacturing cost: no labor or consumables considered Maintenance cost: repair not considered
12 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Description of the case study Compressor blades Dozens of parts in each of the 14 stages Fleet of engines currently in service Parts manufactured according to forging processes Deterioration drivers Temperature near combustion chamber Vibration, impacts...
13 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Data availability Large maintenance dataset More than 2 millions of parts considered 100 input variables Irregular "dirty" data Small manufacturing dataset Manufacturing cost for the year of 2012 as the output variable 520 parts manufactured 6 covariates
14 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Estimation of manufacturing cost Probabilistic equation of manufacturing cost For a given mechanical component, at time t: MC t = f (L, W, P, A, M, Q t, CR t ) where: MC t : Manufacturing cost (continuous output variable) f (.): Complex nonlinear function to estimate (non time dependent) Important preliminary remarks Model at the component level Time dimension due to economic-related variables Probabilistic aspect due to variability in the input variables
15 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Estimation of manufacturing cost Probabilistic equation of manufacturing cost For a given mechanical component, at time t: MC t = f (L, W, P, A, M, Q t, CR t ) where: MC t : Manufacturing cost (continuous output variable) f (.): Complex nonlinear function to estimate (non time dependent) Important preliminary remarks Model at the component level Time dimension due to economic-related variables Probabilistic aspect due to variability in the input variables
16 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Characteristics of the manufacturing cost model Seven input variables depending on each mechanical component Technical input variables Component length L and width W Manufacturing process P Alloy type A and machinability M Economics-related input variables Accumulated production volume Q t Materials cost rate CR t Procedure and criteria for model assessment Training the model with the data on 80% of the dataset and evaluate its generalization on remaining 20% Prediction accuracy measured by percentage of good prediction on kept data
17 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Characteristics of the manufacturing cost model Seven input variables depending on each mechanical component Technical input variables Component length L and width W Manufacturing process P Alloy type A and machinability M Economics-related input variables Accumulated production volume Q t Materials cost rate CR t Procedure and criteria for model assessment Training the model with the data on 80% of the dataset and evaluate its generalization on remaining 20% Prediction accuracy measured by percentage of good prediction on kept data
18 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Results Overall predictive accuracy Goodness-of-fit (R 2 ) Accuracy (MAPE) >0.80 >90% Relative influence of each covariate on manufacturing cost
19 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Equation of maintenance cost Probabilistic equation of maintenance cost For a given mechanical component, at time t i : R ti = g(t j T, Pj T, Sj T, V j T,..., AT T, AP T, H T, C T,...) where: R ti : Average replacement rate of the part (binary output) g(.): Complex nonlinear function to estimate T = [t i 1, t i ]: Time between two maintenance visits Important remarks Hundreds of input variables extracted from complex time series (air temperature AT T or pressure AP T...) Six binary classifiers used as statistical models: logistic regression, support vector machines, decision trees, random forests, boosted trees, neural networks
20 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Equation of maintenance cost Probabilistic equation of maintenance cost For a given mechanical component, at time t i : R ti = g(t j T, Pj T, Sj T, V j T,..., AT T, AP T, H T, C T,...) where: R ti : Average replacement rate of the part (binary output) g(.): Complex nonlinear function to estimate T = [t i 1, t i ]: Time between two maintenance visits Important remarks Hundreds of input variables extracted from complex time series (air temperature AT T or pressure AP T...) Six binary classifiers used as statistical models: logistic regression, support vector machines, decision trees, random forests, boosted trees, neural networks
21 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Characteristics of the maintenance cost model Pre-processing to extract features from raw historical sensor data Min/max, mean, variance Curve trend Discontinuities Class based on patterns Signature analysis (wavelets, splines) Criteria assessing model performance % of correctly predicted outcomes Area Under the ROC curve (AUC) more robust
22 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Results Procedure and criteria for model assessment similar to the ones used for manufacturing cost Training with 80% of the data and prediction accuracy on remaining 20% Binary classifiers in lieu of regression models in manufacturing cost Goodness-of-fit measured by TPR Prediction accuracy measured by AUC Goodness-of-fit (TPR) Accuracy (AUC) >85% >85%
23 Introduction A Big Data applied to LCC Conclusion LCC framework Manufacturing cost Maintenance cost Results Procedure and criteria for model assessment similar to the ones used for manufacturing cost Training with 80% of the data and prediction accuracy on remaining 20% Binary classifiers in lieu of regression models in manufacturing cost Goodness-of-fit measured by TPR Prediction accuracy measured by AUC Goodness-of-fit (TPR) Accuracy (AUC) >85% >85%
24 Introduction A Big Data applied to LCC Conclusion Summary Discussion Extension Table of Contents 1 Introduction Context Challenges in LCC 2 A Big Data applied to LCC LCC framework Manufacturing cost Maintenance cost 3 Conclusion Summary Discussion Extension
25 Introduction A Big Data applied to LCC Conclusion Summary Discussion Extension Take-home messages A valid innovative framework Validity of the data-driven, statistics-based approach Scalable, reusable, maintainable model of LCC Contribute to structure the data within the IT system Satisfactory results Complementary insights from ensemble models Good fit (R 2 > 0.8) and prediction accuracy (MAPE > 0.8 & AUC > 85%) on new data, for manufacturing and maintenance costs Ranking of most relevant drivers LCC, challenging traditional engineering approach
26 Introduction A Big Data applied to LCC Conclusion Summary Discussion Extension Take-home messages A valid innovative framework Validity of the data-driven, statistics-based approach Scalable, reusable, maintainable model of LCC Contribute to structure the data within the IT system Satisfactory results Complementary insights from ensemble models Good fit (R 2 > 0.8) and prediction accuracy (MAPE > 0.8 & AUC > 85%) on new data, for manufacturing and maintenance costs Ranking of most relevant drivers LCC, challenging traditional engineering approach
27 Introduction A Big Data applied to LCC Conclusion Summary Discussion Extension Benefits and limits of the statistical approach Benefits Generic framework Applicable at system, subsystem and component levels Applicable to many (mechanical) products and sectors Probabilistic, data-driven approach Risk in decision-making and financial evaluation Estimation based on actual data (less dubious assumptions needed) Limits Data-driven (!) Requires sufficient amount of actual good quality data Pre-existing databases and data-rich IT infrastructure Requires statistical training amongst engineers Less applicable to families of products with low commonality
28 Introduction A Big Data applied to LCC Conclusion Summary Discussion Extension Benefits and limits of the statistical approach Benefits Generic framework Applicable at system, subsystem and component levels Applicable to many (mechanical) products and sectors Probabilistic, data-driven approach Risk in decision-making and financial evaluation Estimation based on actual data (less dubious assumptions needed) Limits Data-driven (!) Requires sufficient amount of actual good quality data Pre-existing databases and data-rich IT infrastructure Requires statistical training amongst engineers Less applicable to families of products with low commonality
29 Introduction A Big Data applied to LCC Conclusion Summary Discussion Extension Potential next steps Refine the models Add more input variables, better extracted from time series (maintenance cost) Use more sophisticated models (Dynamic Bayesian networks, Functional Data Analysis, Markov-like models...) Interactions between drivers of manufacturing and maintenance costs Benchmark and extend the results Compare with accuracy and variable importance obtained from current «manual» estimates of LCC by Rolls-Royce cost engineers Verify validity on different components or sectors
30 Introduction A Big Data applied to LCC Conclusion Summary Discussion Extension Potential next steps Refine the models Add more input variables, better extracted from time series (maintenance cost) Use more sophisticated models (Dynamic Bayesian networks, Functional Data Analysis, Markov-like models...) Interactions between drivers of manufacturing and maintenance costs Benchmark and extend the results Compare with accuracy and variable importance obtained from current «manual» estimates of LCC by Rolls-Royce cost engineers Verify validity on different components or sectors
31 Introduction A Big Data applied to LCC Conclusion Summary Discussion Extension References Asiedu Y. and P. Gu Product life cycle cost analysis: state of the art review. International Journal of Production Research. 36(4): Barton J. A., D. M. Love and G. D. Taylor Design determines 70% of cost? A review of implications for design evaluation. Journal of Engineering Design 12 (1): Haskins, C Systems Engineering Handbook: A Guide for System Life Cycle Processes and Activities. Version 3.2. Revised by M. Krueger, D. Walden, and R. D. Hamelin. San Diego, CA (US): INCOSE Liu H., V. Gopalkrishnan, W.K. Ng, B. Song and X. Li An intelligent system for estimating full product Life Cycle Cost at the early design stage. International Journal of Product Lifecycle Management 2 (2-3): Meisl C.J Techniques for cost estimating in early program phases. Engineering Costs and Production Economics 14 (2), NASA NASA Cost Estimating Handbook. Washington, DC (US): National Aeronautics and Space Administration NASA Headquarters - Cost Analysis Division. Seo K.-K., J.-H. Park, D.-S. Jang and D. Wallace Approximate Estimation of the Product Life Cycle Cost Using Artificial Neural Networks in Conceptual Design. The International Journal of Advanced Manufacturing Technology. 19 (6):
Classification of Bad Accounts in Credit Card Industry
Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition
Data Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
Random forest algorithm in big data environment
Random forest algorithm in big data environment Yingchun Liu * School of Economics and Management, Beihang University, Beijing 100191, China Received 1 September 2014, www.cmnt.lv Abstract Random forest
WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat
Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise
Data Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
How To Make A Credit Risk Model For A Bank Account
TRANSACTIONAL DATA MINING AT LLOYDS BANKING GROUP Csaba Főző [email protected] 15 October 2015 CONTENTS Introduction 04 Random Forest Methodology 06 Transactional Data Mining Project 17 Conclusions
New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Introduction
Introduction New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Predictive analytics encompasses the body of statistical knowledge supporting the analysis of massive data sets.
Software project cost estimation using AI techniques
Software project cost estimation using AI techniques Rodríguez Montequín, V.; Villanueva Balsera, J.; Alba González, C.; Martínez Huerta, G. Project Management Area University of Oviedo C/Independencia
MACHINE LEARNING IN HIGH ENERGY PHYSICS
MACHINE LEARNING IN HIGH ENERGY PHYSICS LECTURE #1 Alex Rogozhnikov, 2015 INTRO NOTES 4 days two lectures, two practice seminars every day this is introductory track to machine learning kaggle competition!
From Raw Data to. Actionable Insights with. MATLAB Analytics. Learn more. Develop predictive models. 1Access and explore data
100 001 010 111 From Raw Data to 10011100 Actionable Insights with 00100111 MATLAB Analytics 01011100 11100001 1 Access and Explore Data For scientists the problem is not a lack of available but a deluge.
Pentaho Data Mining Last Modified on January 22, 2007
Pentaho Data Mining Copyright 2007 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest information, please visit our web site at www.pentaho.org
Classification Problems
Classification Read Chapter 4 in the text by Bishop, except omit Sections 4.1.6, 4.1.7, 4.2.4, 4.3.3, 4.3.5, 4.3.6, 4.4, and 4.5. Also, review sections 1.5.1, 1.5.2, 1.5.3, and 1.5.4. Classification Problems
AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.
AUTOMATION OF ENERGY DEMAND FORECASTING by Sanzad Siddique, B.S. A Thesis submitted to the Faculty of the Graduate School, Marquette University, in Partial Fulfillment of the Requirements for the Degree
The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2
2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of
Azure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
MS1b Statistical Data Mining
MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to
Gerard Mc Nulty Systems Optimisation Ltd [email protected]/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I
Gerard Mc Nulty Systems Optimisation Ltd [email protected]/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I Data is Important because it: Helps in Corporate Aims Basis of Business Decisions Engineering Decisions Energy
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail
DATA MINING, DIRTY DATA, AND COSTS (Research-in-Progress)
DATA MINING, DIRTY DATA, AND COSTS (Research-in-Progress) Leo Pipino University of Massachusetts Lowell [email protected] David Kopcso Babson College [email protected] Abstract: A series of simulations
Course Syllabus. Purposes of Course:
Course Syllabus Eco 5385.701 Predictive Analytics for Economists Summer 2014 TTh 6:00 8:50 pm and Sat. 12:00 2:50 pm First Day of Class: Tuesday, June 3 Last Day of Class: Tuesday, July 1 251 Maguire Building
EST.03. An Introduction to Parametric Estimating
EST.03 An Introduction to Parametric Estimating Mr. Larry R. Dysert, CCC A ACE International describes cost estimating as the predictive process used to quantify, cost, and price the resources required
An Introduction to Data Mining
An Introduction to Intel Beijing [email protected] January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
Applying Data Science to Sales Pipelines for Fun and Profit
Applying Data Science to Sales Pipelines for Fun and Profit Andy Twigg, CTO, C9 @lambdatwigg Abstract Machine learning is now routinely applied to many areas of industry. At C9, we apply machine learning
Better decision making under uncertain conditions using Monte Carlo Simulation
IBM Software Business Analytics IBM SPSS Statistics Better decision making under uncertain conditions using Monte Carlo Simulation Monte Carlo simulation and risk analysis techniques in IBM SPSS Statistics
Data Mining Techniques for Prognosis in Pancreatic Cancer
Data Mining Techniques for Prognosis in Pancreatic Cancer by Stuart Floyd A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUE In partial fulfillment of the requirements for the Degree
KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES
HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES Translating data into business value requires the right data mining and modeling techniques which uncover important patterns within
THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell
THE HYBID CAT-LOGIT MODEL IN CLASSIFICATION AND DATA MINING Introduction Dan Steinberg and N. Scott Cardell Most data-mining projects involve classification problems assigning objects to classes whether
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be
Data Mining mit der JMSL Numerical Library for Java Applications
Data Mining mit der JMSL Numerical Library for Java Applications Stefan Sineux 8. Java Forum Stuttgart 07.07.2005 Agenda Visual Numerics JMSL TM Numerical Library Neuronale Netze (Hintergrund) Demos Neuronale
Benchmarking of different classes of models used for credit scoring
Benchmarking of different classes of models used for credit scoring We use this competition as an opportunity to compare the performance of different classes of predictive models. In particular we want
Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup
Network Anomaly Detection A Machine Learning Perspective Dhruba Kumar Bhattacharyya Jugal Kumar KaKta»C) CRC Press J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor
RAVEN: A GUI and an Artificial Intelligence Engine in a Dynamic PRA Framework
INL/CON-13-28360 PREPRINT RAVEN: A GUI and an Artificial Intelligence Engine in a Dynamic PRA Framework ANS Annual Meeting C. Rabiti D. Mandelli A. Alfonsi J. J. Cogliati R. Kinoshita D. Gaston R. Martineau
An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century
An Overview of Data Mining: Predictive Modeling for IR in the 21 st Century Nora Galambos, PhD Senior Data Scientist Office of Institutional Research, Planning & Effectiveness Stony Brook University AIRPO
Knowledge Based Descriptive Neural Networks
Knowledge Based Descriptive Neural Networks J. T. Yao Department of Computer Science, University or Regina Regina, Saskachewan, CANADA S4S 0A2 Email: [email protected] Abstract This paper presents a
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD
Predictive Analytics Techniques: What to Use For Your Big Data March 26, 2014 Fern Halper, PhD Presenter Proven Performance Since 1995 TDWI helps business and IT professionals gain insight about data warehousing,
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Risk pricing for Australian Motor Insurance
Risk pricing for Australian Motor Insurance Dr Richard Brookes November 2012 Contents 1. Background Scope How many models? 2. Approach Data Variable filtering GLM Interactions Credibility overlay 3. Model
Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement
Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement Toshio Sugihara Abstract In this study, an adaptive
MSCA 31000 Introduction to Statistical Concepts
MSCA 31000 Introduction to Statistical Concepts This course provides general exposure to basic statistical concepts that are necessary for students to understand the content presented in more advanced
USING DATA MINING FOR BANK DIRECT MARKETING: AN APPLICATION OF THE CRISP-DM METHODOLOGY
USING DATA MINING FOR BANK DIRECT MARKETING: AN APPLICATION OF THE CRISP-DM METHODOLOGY Sérgio Moro and Raul M. S. Laureano Instituto Universitário de Lisboa (ISCTE IUL) Av.ª das Forças Armadas 1649-026
Dynamic Predictive Modeling in Claims Management - Is it a Game Changer?
Dynamic Predictive Modeling in Claims Management - Is it a Game Changer? Anil Joshi Alan Josefsek Bob Mattison Anil Joshi is the President and CEO of AnalyticsPlus, Inc. (www.analyticsplus.com)- a Chicago
Audit Data Analytics. Bob Dohrer, IAASB Member and Working Group Chair. Miklos Vasarhelyi Phillip McCollough
Audit Data Analytics Bob Dohrer, IAASB Member and Working Group Chair Miklos Vasarhelyi Phillip McCollough IAASB Meeting September 2015 Agenda Item 6-A Page 1 Audit Data Analytics Agenda Introductory remarks
Evaluation of Feature Selection Methods for Predictive Modeling Using Neural Networks in Credits Scoring
714 Evaluation of Feature election Methods for Predictive Modeling Using Neural Networks in Credits coring Raghavendra B. K. Dr. M.G.R. Educational and Research Institute, Chennai-95 Email: [email protected]
An Overview of Predictive Analytics for Practitioners. Dean Abbott, Abbott Analytics
An Overview of Predictive Analytics for Practitioners Dean Abbott, Abbott Analytics Thank You Sponsors Empower users with new insights through familiar tools while balancing the need for IT to monitor
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
SOA 2013 Life & Annuity Symposium May 6-7, 2013. Session 30 PD, Predictive Modeling Applications for Life and Annuity Pricing and Underwriting
SOA 2013 Life & Annuity Symposium May 6-7, 2013 Session 30 PD, Predictive Modeling Applications for Life and Annuity Pricing and Underwriting Moderator: Barry D. Senensky, FSA, FCIA, MAAA Presenters: Jonathan
TOTAL ASSET MANAGEMENT. Life Cycle Costing Guideline
TOTAL ASSET MANAGEMENT Life Cycle Costing Guideline September 2004 TAM04-10 Life Cycle Costing Guideline September 2004 TAM04-10 ISBN 0 7313 3325 X (set) ISBN 0 7313 3272 5 1. Asset management New South
The Predictive Data Mining Revolution in Scorecards:
January 13, 2013 StatSoft White Paper The Predictive Data Mining Revolution in Scorecards: Accurate Risk Scoring via Ensemble Models Summary Predictive modeling methods, based on machine learning algorithms
Prediction of Stock Performance Using Analytical Techniques
136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Mimicking human fake review detection on Trustpilot
Mimicking human fake review detection on Trustpilot [DTU Compute, special course, 2015] Ulf Aslak Jensen Master student, DTU Copenhagen, Denmark Ole Winther Associate professor, DTU Copenhagen, Denmark
BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING
BOOSTED REGRESSION TREES: A MODERN WAY TO ENHANCE ACTUARIAL MODELLING Xavier Conort [email protected] Session Number: TBR14 Insurance has always been a data business The industry has successfully
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Data Isn't Everything
June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,
BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL
The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University
Big Data Text Mining and Visualization. Anton Heijs
Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark
MHI3000 Big Data Analytics for Health Care Final Project Report
MHI3000 Big Data Analytics for Health Care Final Project Report Zhongtian Fred Qiu (1002274530) http://gallery.azureml.net/details/81ddb2ab137046d4925584b5095ec7aa 1. Data pre-processing The data given
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
Forecasting stock markets with Twitter
Forecasting stock markets with Twitter Argimiro Arratia [email protected] Joint work with Marta Arias and Ramón Xuriguera To appear in: ACM Transactions on Intelligent Systems and Technology, 2013,
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
Introduction to Engineering System Dynamics
CHAPTER 0 Introduction to Engineering System Dynamics 0.1 INTRODUCTION The objective of an engineering analysis of a dynamic system is prediction of its behaviour or performance. Real dynamic systems are
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Erkan Er Abstract In this paper, a model for predicting students performance levels is proposed which employs three
Fuzzy Signature Neural Network
Fuzzy Signature Neural Network Final presentation for COMP8780 IHCC Project Supervisor: Professor Tom GEDEON Presented by: Outline Background Neural Network Fuzzy Logic, Fuzzy Rule Based System and Fuzzy
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
Predictive Modeling Techniques in Insurance
Predictive Modeling Techniques in Insurance Tuesday May 5, 2015 JF. Breton Application Engineer 2014 The MathWorks, Inc. 1 Opening Presenter: JF. Breton: 13 years of experience in predictive analytics
Predictive Modeling and Big Data
Predictive Modeling and Presented by Eileen Burns, FSA, MAAA Milliman Agenda Current uses of predictive modeling in the life insurance industry Potential applications of 2 1 June 16, 2014 [Enter presentation
BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics http://www.ccmb.med.umich.edu/node/1376
Course Director: Dr. Kayvan Najarian (DCM&B, [email protected]) Lectures: Labs: Mondays and Wednesdays 9:00 AM -10:30 AM Rm. 2065 Palmer Commons Bldg. Wednesdays 10:30 AM 11:30 AM (alternate weeks) Rm.
A Comparison Between Data Mining Prediction Algorithms for Fault Detection (Case study: Ahanpishegan co.)
www.ijcsi.org 425 A Comparison Between Data Mining Prediction Algorithms for Fault Detection (Case study: Ahanpishegan co.) Golriz Amooee 1*, Behrouz Minaei-Bidgoli 2, Malihe Bagheri-Dehnavi 3 1 Department
EXPLORING & MODELING USING INTERACTIVE DECISION TREES IN SAS ENTERPRISE MINER. Copyr i g ht 2013, SAS Ins titut e Inc. All rights res er ve d.
EXPLORING & MODELING USING INTERACTIVE DECISION TREES IN SAS ENTERPRISE MINER ANALYTICS LIFECYCLE Evaluate & Monitor Model Formulate Problem Data Preparation Deploy Model Data Exploration Validate Models
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM ABSTRACT Luis Alexandre Rodrigues and Nizam Omar Department of Electrical Engineering, Mackenzie Presbiterian University, Brazil, São Paulo [email protected],[email protected]
COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments
Contents List of Figures Foreword Preface xxv xxiii xv Acknowledgments xxix Chapter 1 Fraud: Detection, Prevention, and Analytics! 1 Introduction 2 Fraud! 2 Fraud Detection and Prevention 10 Big Data for
Unlocking Value from. Patanjali V, Lead Data Scientist, Tiger Analytics Anand B, Director Analytics Consulting,Tiger Analytics
Unlocking Value from Patanjali V, Lead Data Scientist, Anand B, Director Analytics Consulting, EXECUTIVE SUMMARY Today a lot of unstructured data is being generated in the form of text, images, videos
Program Life Cycle Cost Driver Model (LCCDM) Daniel W. Miles, General Physics, June 2008
Program Life Cycle Cost Driver Model (LCCDM) Daniel W. Miles, General Physics, June 28 Introduction Several years ago during the creation of the Periscope Total Ownership Cost (TOC) Program, it became
What methods are used to conduct testing?
What is testing? Testing is the practice of making objective judgments regarding the extent to which the system (device) meets, exceeds or fails to meet stated objectives What the purpose of testing? There
Spend Enrichment: Making better decisions starts with accurate data
IBM Software Industry Solutions Industry/Product Identifier Spend Enrichment: Making better decisions starts with accurate data Spend Enrichment: Making better decisions starts with accurate data Contents
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification
An innovative approach combining industrial process data analytics and operator participation to implement lean energy programs: A Case Study
An innovative approach combining industrial process data analytics and operator participation to implement lean energy programs: A Case Study Philippe Mack, Pepite SA Joanna Huddleston, Pepite SA Bernard
Learning is a very general term denoting the way in which agents:
What is learning? Learning is a very general term denoting the way in which agents: Acquire and organize knowledge (by building, modifying and organizing internal representations of some external reality);
A DIFFERENT KIND OF PROJECT MANAGEMENT
SEER for Software SEER project estimation and management solutions improve success rates on complex software projects. Based on sophisticated modeling technology and extensive knowledge bases, SEER solutions
The Operational Value of Social Media Information. Social Media and Customer Interaction
The Operational Value of Social Media Information Dennis J. Zhang (Kellogg School of Management) Ruomeng Cui (Kelley School of Business) Santiago Gallino (Tuck School of Business) Antonio Moreno-Garcia
Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer
Machine Learning Chapter 18, 21 Some material adopted from notes by Chuck Dyer What is learning? Learning denotes changes in a system that... enable a system to do the same task more efficiently the next
CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES
CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES Claus Gwiggner, Ecole Polytechnique, LIX, Palaiseau, France Gert Lanckriet, University of Berkeley, EECS,
Generating analytics impact for a leading aircraft component manufacturer
Case Study Generating ANALYTICS Impact Generating analytics impact for a leading aircraft component manufacturer Client Genpact solution Business impact A global aviation OEM and services major with a
Predict Influencers in the Social Network
Predict Influencers in the Social Network Ruishan Liu, Yang Zhao and Liuyu Zhou Email: rliu2, yzhao2, [email protected] Department of Electrical Engineering, Stanford University Abstract Given two persons
Sales Forecast for Pickup Truck Parts:
Sales Forecast for Pickup Truck Parts: A Case Study on Brake Rubber Mojtaba Kamranfard University of Semnan Semnan, Iran [email protected] Kourosh Kiani Amirkabir University of Technology Tehran,
Random Forest Based Imbalanced Data Cleaning and Classification
Random Forest Based Imbalanced Data Cleaning and Classification Jie Gu Software School of Tsinghua University, China Abstract. The given task of PAKDD 2007 data mining competition is a typical problem
Learning outcomes. Knowledge and understanding. Competence and skills
Syllabus Master s Programme in Statistics and Data Mining 120 ECTS Credits Aim The rapid growth of databases provides scientists and business people with vast new resources. This programme meets the challenges
Is a Data Scientist the New Quant? Stuart Kozola MathWorks
Is a Data Scientist the New Quant? Stuart Kozola MathWorks 2015 The MathWorks, Inc. 1 Facts or information used usually to calculate, analyze, or plan something Information that is produced or stored by
How To Get A Masters Degree In Logistics And Supply Chain Management
Industrial and Systems Engineering Master of Science Program Logistics and Supply Chain Management Department of Integrated Systems Engineering The Ohio State University Logistics is the science of design,
A Property & Casualty Insurance Predictive Modeling Process in SAS
Paper AA-02-2015 A Property & Casualty Insurance Predictive Modeling Process in SAS 1.0 ABSTRACT Mei Najim, Sedgwick Claim Management Services, Chicago, Illinois Predictive analytics has been developing
Title. Introduction to Data Mining. Dr Arulsivanathan Naidoo Statistics South Africa. OECD Conference Cape Town 8-10 December 2010.
Title Introduction to Data Mining Dr Arulsivanathan Naidoo Statistics South Africa OECD Conference Cape Town 8-10 December 2010 1 Outline Introduction Statistics vs Knowledge Discovery Predictive Modeling
