From Cognitive Science to Data Mining: The first intelligence amplifier

Size: px
Start display at page:

Download "From Cognitive Science to Data Mining: The first intelligence amplifier"

Transcription

1 From Cognitive Science to Data Mining: The first intelligence amplifier Tom Khabaza Abstract This paper gives a brief account of two hypotheses. First that data mining is a kind of intelligence amplifier, and second that machine learning algorithms inspired by ideas from cognitive science contributed significantly to the field of data mining. 1. Introduction: Intelligence Amplifiers and Data Mining Intelligence Amplification Ashby (1956); Licklider (1960); Engelbart (1962) refers the idea that the products of Artificial Intelligence will be used initially, not to create fully intelligent machines, but to amplify or increase the power of human intelligence. Data mining Berry and Linoff (1997); Helberg (2002) is one such intelligence amplifier; data mining algorithms form the core of a process which amplifies our ability to detect and act upon patterns in large quantities of data. Whether data mining is really the first intelligence amplifier is open to debate; perhaps it is the first intelligence amplifier in widespread use. The purpose of this claim is to emphasise that data mining enhances our mental abilities in a way which is much closer to the idea of intelligence amplification than most of the widespread use of IT. 2. Historical Background: Poplog, Clementine and CRISP-DM During the 1980s, the Poplog AI programming environment du Boulay et al. (1986) (developed at Sussex University under the leadership of Aaron Sloman) address: [email protected] (Tom Khabaza) From Animals to Robots and Back September 8, 2011

2 was sold in the non-academic market by Systems Designers Ltd, which later became SD-Scicon. A management buyout from SD-Scicon in 1989 created Integral Solutions Ltd (ISL), whose core business was initially Poplog. At this stage, ISLs product range included two machine learning modules based on decision trees and neural networks, and ISLs early business included a series of projects which applied machine learning to extract useful patterns from customers data that is, data mining projects Fitzsimons et al. (1993). Based on the experience of these projects, Colin Shearer invented the Clementine data mining workbench Khabaza and Shearer (1995). Despite being the first practitioner to execute ISLs commercial data mining projects, I was initially sceptical about the prospects for data mining and the Clementine workbench. Clearly the machine learning techniques used for data mining could not in themselves solve business problems of any significance; how then could data mining technology be of practical use? The answer, which emerged from successive projects, lay in the data mining process. Clementine had the then unique property of making data mining algorithms (at that time synonymous with machine learning algorithms) accessible to non-technologists. This meant that the process of understanding and preparing the data, applying the algorithms, and interpreting and using the results, could be executed by or in close collaboration with people whose primary knowledge was in the business domain Shearer and Khabaza (1995). This in turn meant that business knowledge and understanding could be closely integrated with data mining technology in the process of business problem-solving, without falling foul of the limitations of machine knowledge representation. The design of Clementine, and the business-oriented data mining process which it enabled, were highly influential, and could be said to have shaped modern data mining practice and tools. The business-oriented process was later standardised in the data mining methodology CRISP-DM Chapman et al. (1999). 3. Data Mining Data mining is the use of business knowledge to create new knowledge in natural or artificial form by discovering and interpreting patterns in data. The term business is used here to emphasise the use of data mining for practical purposes, but the definition would be equally correct if business were replaced with domain. At heart, data mining is a business process, and is used in a wide variety of applications, including customer analytics, fraud detection, risk management and law enforcement, and also in science and medicine. 174

3 Figure 1: CRISP-DM diagram. The more recent term Predictive Analytics usually refers to complete solutions in which data mining is embedded. Data mining is distinguished from other forms of data analysis by the use of data mining algorithms, also sometimes called predictive modelling algorithms. Knowledge in artificial form refers to the output of these algorithms, predictive models or data mining models, which are used to increase information locally on the basis of generalisation, and are often embedded in Predictive Analytics solutions. The industry standard data mining methodology is called CRISP-DM [CRISP- DM] (which stands for CRoss-Industry Standard Process for Data Mining), and is depicted in Figure 1. CRISP-DM was created by a research consortium, based on consultation with a wide circle of practicing data miners; during this consultation process, it was discovered that all practicing data miners had independently discovered approximately the same process for successful data mining. CRISP-DM provides an accurate picture of how data mining is carried out, but omits some key properties of the data mining process, and does Figure 1: CRISP-DM diagram not explain why the process has the form that it does Laws of Data Mining Attempting to answer some nagging questions about data mining, I have recently published the 9 laws of data mining Khabaza (2010), listed below: 175

4 1. Business objectives are the origin of every data mining solution (Business Goals Law) 2. Business knowledge is central to every step of the data mining process (Business Knowledge Law) 3. Data preparation is more than half of every data mining process (Data Preparation Law) 4. The right model for a given application can only be discovered by experiment or There is No Free Lunch for the Data Miner (NFL-DM) 5. There are always patterns (Watkins Law) 6. Data mining amplifies perception in the business domain (Insight Law) 7. Prediction increases information locally by generalisation (Prediction Law) 8. The value of data mining results is not determined by the accuracy or stability of predictive models (Value Law) 9. All patterns are subject to change (Law of Change) These laws address many aspects of the data mining process, but in this paper I will focus on the 6th law: Data mining amplifies perception in the business domain. This is also called the Insight Law because in data mining the creation of new knowledge in natural form (knowledge in the head) is often described as producing insight, this being one of the two types of result from data mining, the other being predictive models. 5. From Intelligence to Perception How and why does the data mining process produce new knowledge? The data mining process is essentially one of problem-solving; the business expert works out how to achieve an objective in the business domain. Business problems are solved by humans, not by algorithms, so how does data mining play a part in this? The key issue addressed by data mining is that there may be useful information buried in data, where the required volume of data is too large for patterns to be seen unaided. (Watkins Law indicates that such information is always present.) A conventional view of data mining would suggest that business goals are translated into data mining goals, then the algorithms are applied to the data, producing predictive models; these models are used to make predictions and help guide business decision-making in such a way as to help achieve the business goal. However, this view omits two crucial factors one is the pervasive role of business knowledge (as per the 2nd law) and the other is the production 176

5 of insight, or new knowledge. It is on this second shortcoming that I will now focus. While data mining may indeed produce predictive models to aid decisionmaking, both the models themselves and the process that produces them can also tell us new things about the business or domain. The process of understanding and preparing the data means examining the data in a great deal of detail, and new facts often emerge from this process; the data themselves have no intrinsic meaning, but when interpreted in the light of business knowledge the data often reveal important new information about the business, even before data mining algorithms are applied. When predictive models are produced, these will also often tell us important information about the business this may be revealed by the behaviour of the model, or by the model itself, such as the readable rules in a decision-tree model, or by the relative importance of different input variables in unreadable models. Again this information has no intrinsic importance, but can be seen to be important when interpreted in the light of business knowledge. It is a characteristic of these processes that they take place in the business domain; every piece of data and every action has a business meaning. The data miner works, not in the realm of bits, bytes and algorithms, but in the domain of enquiry. The data mining process enables the data miner to see things which would not be visible unaided. We know that perception is an active, knowledgebased process. The data miner sees things in the business domain by knowing what they are looking at. My first hypothesis in this paper is that data mining amplifies perception in the following way: data mining algorithms can detect patterns in data which are not visible to the naked eye, but the algorithms themselves have no domain knowledge. The business expert has the business knowledge but cannot see the patterns unaided. The data mining process (as described by CRISP-DM) enables the business expert to incorporate the pattern discovery capabilities of the algorithms into their own perceptual process. There is nothing mysterious about this the process is mostly a codification of common sense but it explains why data miners have the experience of seeing things in the data. It is because data mining is like a perceptual process. I have always wondered why machine learning algorithms (from the field of AI) seem to work better for data mining than those originating in the field of statistics. My second hypothesis in this paper is that machine learning algorithms work well for data miners because they are designed to be part of a cognitive system. Machine learning systems tend to be based on intuitively plausible models of knowledge. For the purposes of the data miner, it matters little whether these 177

6 models are correct descriptions of human cognition; what makes them helpful for data miners is the plausible nature of the knowledge they create or the patterns they discover. This makes the algorithms easier to use as an extension of ones own cognition. 6. Conclusion: The Impact of Cognitive Science A birds-eye view of the activities of data miners in organisations would not immediately reveal anything to do with cognition. A data miner appears to (and does in fact) work in the domain of application they would seem like marketeers, or fraud detection operatives, or police intelligence officers, or geneticists, or medics. They are exactly this, but augmented by having their perceptual abilities, within their domain of operation, enhanced by the ability to see meaningful patterns in data. Data mining is acting, for data miners, as an intelligence amplifier. This kind of intelligence amplifier does not provide the expanded human intellect envisioned by Ashby Asaro (2008); nevertheless, the expanded perceptual abilities of data miners can be used to make the world a better place (e.g. Van (2003); Piatetsky-Shapiro et al. (2003); Adderley and Musgrove (1999); McCue (2006); Chang and Shyue (2009)). If my second hypothesis is correct, then this ability of data mining to enhance the perception of domain workers is the result of the output of Cognitive Science research. By focussing on cognition, we have produced tools which can become part of cognition. References Adderley, R., Musgrove, P., Bcs special group expert systems. In: Data mining at the West Midlands Police: A study of bogus official burglaries. Springer-Verlag, London, pp Asaro, P., From mechanisms of adaptation to intelligence amplifiers: The philosophy of w. ross ashby. In: Husbands, P., Holland, O., Wheeler, M. (Eds.), The Mechanical Mind in History. MIT Press. Ashby, W., An Introduction to Cybernetics. Chapman and Hall. Berry, M., Linoff, G., Data Mining Techniques: For Marketing, Sales and Customer Support. Wiley. Chang, C., Shyue, S., A study on the application of data mining to disadvantaged social class in taiwan s population census. Expert Systems with Applications 36, Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., Wirth, R., Crisp-dm 1.0: Step-by-step data mining guide. du Boulay, J., Khabaza, T., Elsom-Cook, M., Taylor, J., Poplog and the learner: An artificial intelligence environment used in education. In: Directory of Computer Training Badegmore part Enterprises for Hoskyns Education. 178

7 Engelbart, D., Oct Augmenting human intellect: A conceptual framework. Tech. Rep. Summary Report AFOSR-3233, Stanford Research Institute, Menlo Park, CA. Fitzsimons, M., Khabaza, T., Shearer, C., November The application of rule induction and neural networks for television audience prediction. In: Proceedings of ESO- MAR/EMAC/AFM Symposium on Information Based Decision Making in Marketing. Paris, pp Helberg, C., Data Mining with Confidence. SPSS Inc., Chicago. Khabaza, T., Nine laws of data mining. Khabaza, T., Shearer, C., Data mining with clementine. In: IEE Colloquium on Knowledge Discovery in Databases in Digest No 1995/021(B). IEE, London. Licklider, J., Man-computer symbosis. IRE Transactions on Human Factors in Electronics HFE-1, McCue, C., Data Mining and Predictive Analysis: Intelligence Gathering and Crime Analysis. Butterworth-Heinemann. Piatetsky-Shapiro, G., Khabaza, T., Ramaswamy, S., August Capturing best practice for microarray gene expression analysis. In: SIGKDD Shearer, C., Khabaza, T., Data mining by data owners. In: Intelligent Data Analysis. Baden-Baden, Germany. Van, J., 11th January Spss tools unravel secrets of disease. Chicago Tribune. 179

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant

More information

Cost Drivers of a Parametric Cost Estimation Model for Data Mining Projects (DMCOMO)

Cost Drivers of a Parametric Cost Estimation Model for Data Mining Projects (DMCOMO) Cost Drivers of a Parametric Cost Estimation Model for Mining Projects (DMCOMO) Oscar Marbán, Antonio de Amescua, Juan J. Cuadrado, Luis García Universidad Carlos III de Madrid (UC3M) Abstract Mining is

More information

CRISP-DM: Towards a Standard Process Model for Data Mining

CRISP-DM: Towards a Standard Process Model for Data Mining CRISP-DM: Towards a Standard Process Model for Mining Rüdiger Wirth DaimlerChrysler Research & Technology FT3/KL PO BOX 2360 89013 Ulm, Germany [email protected] Jochen Hipp Wilhelm-Schickard-Institute,

More information

Tom Khabaza. Hard Hats for Data Miners: Myths and Pitfalls of Data Mining

Tom Khabaza. Hard Hats for Data Miners: Myths and Pitfalls of Data Mining Tom Khabaza Hard Hats for Data Miners: Myths and Pitfalls of Data Mining Hard Hats for Data Miners: Myths and Pitfalls of Data Mining By Tom Khabaza The intrepid data miner runs many risks, including being

More information

CRISP - DM. Data Mining Process. Process Standardization. Why Should There be a Standard Process? Cross-Industry Standard Process for Data Mining

CRISP - DM. Data Mining Process. Process Standardization. Why Should There be a Standard Process? Cross-Industry Standard Process for Data Mining Mining Process CRISP - DM Cross-Industry Standard Process for Mining (CRISP-DM) European Community funded effort to develop framework for data mining tasks Goals: Cross-Industry Standard Process for Mining

More information

Data Mining and Application in Accounting and Auditing

Data Mining and Application in Accounting and Auditing Journal of Education and Vocational Research Vol. 2, No. 6, pp. 211-215, Dec 2011 (ISSN 2221-2590) Data Mining and Application in Accounting and Auditing KeramatOllah Heydari Rostami 1, Saber Samadi 1,

More information

Database Marketing, Business Intelligence and Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski

More information

Start-up Companies Predictive Models Analysis. Boyan Yankov, Kaloyan Haralampiev, Petko Ruskov

Start-up Companies Predictive Models Analysis. Boyan Yankov, Kaloyan Haralampiev, Petko Ruskov Start-up Companies Predictive Models Analysis Boyan Yankov, Kaloyan Haralampiev, Petko Ruskov Abstract: A quantitative research is performed to derive a model for predicting the success of Bulgarian start-up

More information

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining Lluis Belanche + Alfredo Vellido Intelligent Data Analysis and Data Mining a.k.a. Data Mining II Office 319, Omega, BCN EET, office 107, TR 2, Terrassa [email protected] skype, gtalk: avellido Tels.:

More information

PREDICTING STOCK PRICES USING DATA MINING TECHNIQUES

PREDICTING STOCK PRICES USING DATA MINING TECHNIQUES The International Arab Conference on Information Technology (ACIT 2013) PREDICTING STOCK PRICES USING DATA MINING TECHNIQUES 1 QASEM A. AL-RADAIDEH, 2 ADEL ABU ASSAF 3 EMAN ALNAGI 1 Department of Computer

More information

Information Visualization WS 2013/14 11 Visual Analytics

Information Visualization WS 2013/14 11 Visual Analytics 1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and

More information

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control Andre BERGMANN Salzgitter Mannesmann Forschung GmbH; Duisburg, Germany Phone: +49 203 9993154, Fax: +49 203 9993234;

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over

More information

Data Mining Applications in Higher Education

Data Mining Applications in Higher Education Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2

More information

DATA MINING AND WAREHOUSING CONCEPTS

DATA MINING AND WAREHOUSING CONCEPTS CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation

More information

Big Data. Introducción. Santiago González <[email protected]>

Big Data. Introducción. Santiago González <sgonzalez@fi.upm.es> Big Data Introducción Santiago González Contenidos Por que BIG DATA? Características de Big Data Tecnologías y Herramientas Big Data Paradigmas fundamentales Big Data Data Mining

More information

Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms

Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms Y.Y. Yao, Y. Zhao, R.B. Maguire Department of Computer Science, University of Regina Regina,

More information

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

In this presentation, you will be introduced to data mining and the relationship with meaningful use. In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine

More information

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam ECLT 5810 E-Commerce Data Mining Techniques - Introduction Prof. Wai Lam Data Opportunities Business infrastructure have improved the ability to collect data Virtually every aspect of business is now open

More information

Healthcare Measurement Analysis Using Data mining Techniques

Healthcare Measurement Analysis Using Data mining Techniques www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik

More information

USING DATA MINING FOR BANK DIRECT MARKETING: AN APPLICATION OF THE CRISP-DM METHODOLOGY

USING DATA MINING FOR BANK DIRECT MARKETING: AN APPLICATION OF THE CRISP-DM METHODOLOGY USING DATA MINING FOR BANK DIRECT MARKETING: AN APPLICATION OF THE CRISP-DM METHODOLOGY Sérgio Moro and Raul M. S. Laureano Instituto Universitário de Lisboa (ISCTE IUL) Av.ª das Forças Armadas 1649-026

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Chapter 12 Business Intelligence and Decision Support Systems Information Technology For Management 7 th Edition Turban & Volonino Based on lecture slides by L. Beaubien, Providence College John Wiley

More information

An Introduction to Advanced Analytics and Data Mining

An Introduction to Advanced Analytics and Data Mining An Introduction to Advanced Analytics and Data Mining Dr Barry Leventhal Henry Stewart Briefing on Marketing Analytics 19 th November 2010 Agenda What are Advanced Analytics and Data Mining? The toolkit

More information

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept Statistics 215b 11/20/03 D.R. Brillinger Data mining A field in search of a definition a vague concept D. Hand, H. Mannila and P. Smyth (2001). Principles of Data Mining. MIT Press, Cambridge. Some definitions/descriptions

More information

Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report

Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report G. Banos 1, P.A. Mitkas 2, Z. Abas 3, A.L. Symeonidis 2, G. Milis 2 and U. Emanuelson 4 1 Faculty

More information

Working with telecommunications

Working with telecommunications Working with telecommunications Minimizing churn in the telecommunications industry Contents: 1 Churn analysis using data mining 2 Customer churn analysis with IBM SPSS Modeler 3 Types of analysis 3 Feature

More information

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support Rok Rupnik, Matjaž Kukar, Marko Bajec, Marjan Krisper University of Ljubljana, Faculty of Computer and Information

More information

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Machine Learning and Data Mining. Fundamentals, robotics, recognition Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,

More information

DATA MINING TECHNIQUES AND APPLICATIONS

DATA MINING TECHNIQUES AND APPLICATIONS DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,

More information

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD 72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD Paulo Gottgtroy Auckland University of Technology [email protected] Abstract This paper is

More information

DATA MINING AND CRM IN TELECOMMUNICATIONS

DATA MINING AND CRM IN TELECOMMUNICATIONS www.sjm.tf.bor.ac.yu Serbian Journal of Management 3 (1) (2008) 61-72 Serbian Journal of Management Abstract DATA MINING AND CRM IN TELECOMMUNICATIONS D. Ćamilović* BK Faculty of Management, Palmira Toljatija

More information

The Basics of Expert (Knowledge Based) Systems. Contents. 1. Introduction. 2. Scope of Definition - Formal Description

The Basics of Expert (Knowledge Based) Systems. Contents. 1. Introduction. 2. Scope of Definition - Formal Description The Basics of Expert (Knowledge Based) Systems Contents 1. Introduction 2. Scope of Definition - Formal Description 3. Expert System Component Facts 3.1 Rule Based Reasoning 3.2 Databases 3.3 Inference

More information

Requirements Elicitation in Data Mining for Business Intelligence Projects

Requirements Elicitation in Data Mining for Business Intelligence Projects Requirements Elicitation in Data Mining for Business Intelligence Projects Paola Britos 1, Oscar Dieste 2 and Ramón García-Martínez 3 1 Software and Knowledge Engineering Center. Buenos Aires Institute

More information

What is Data Mining, and How is it Useful for Power Plant Optimization? (and How is it Different from DOE, CFD, Statistical Modeling)

What is Data Mining, and How is it Useful for Power Plant Optimization? (and How is it Different from DOE, CFD, Statistical Modeling) data analysis data mining quality control web-based analytics What is Data Mining, and How is it Useful for Power Plant Optimization? (and How is it Different from DOE, CFD, Statistical Modeling) StatSoft

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table

More information

A Framework of Context-Sensitive Visualization for User-Centered Interactive Systems

A Framework of Context-Sensitive Visualization for User-Centered Interactive Systems Proceedings of 10 th International Conference on User Modeling, pp423-427 Edinburgh, UK, July 24-29, 2005. Springer-Verlag Berlin Heidelberg 2005 A Framework of Context-Sensitive Visualization for User-Centered

More information

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanbul, Turkey, May 27-29, 2007 115 Data Mining for Knowledge Management in Technology Enhanced Learning

More information

Data Mining in Construction s Project Time Management - Kayson Case Study

Data Mining in Construction s Project Time Management - Kayson Case Study Data Mining in Construction s Project Time Management - Kayson Case Study Shahram Shadrokh (Assistant Professor) Sharif University of Technology, [email protected] Seyedbehzad Aghdashi (PhD Student)

More information

Selective Naive Bayes Regressor with Variable Construction for Predictive Web Analytics

Selective Naive Bayes Regressor with Variable Construction for Predictive Web Analytics Selective Naive Bayes Regressor with Variable Construction for Predictive Web Analytics Boullé Orange Labs avenue Pierre Marzin 3 Lannion, France [email protected] ABSTRACT We describe our submission

More information

Unit Options and Core Texts

Unit Options and Core Texts Unit Options and s BSc Health Psychology (Full-Time) Core units Year 1 Foundations to Psychology Introduction to Psychological Research and Data Analysis Psychology in Everyday Life Health and Wellbeing

More information

The Concepts of Predictive Analytics

The Concepts of Predictive Analytics International Journal of Developments in Big Data and Analytics Volume 1 No. 1, 2014, pp. 86 94 The Concepts of Predictive Analytics JAMES OGUNLEYE Middlesex University, United Kingdom ABSTRACT Predictive

More information

How To Use Data Mining For Loyalty Based Management

How To Use Data Mining For Loyalty Based Management Data Mining for Loyalty Based Management Petra Hunziker, Andreas Maier, Alex Nippe, Markus Tresch, Douglas Weers, Peter Zemp Credit Suisse P.O. Box 100, CH - 8070 Zurich, Switzerland [email protected],

More information

Using Data Mining Techniques in Customer Segmentation

Using Data Mining Techniques in Customer Segmentation RESEARCH ARTICLE OPEN ACCESS Using Data Mining Techniques in Customer Segmentation Hasan Ziafat *, Majid Shakeri ** *(Department of Computer Science, Islamic Azad University Natanz branch, Natanz, Iran)

More information

Improving tax administration with data mining

Improving tax administration with data mining Executive report Improving tax administration with data mining Daniele Micci-Barreca, PhD, and Satheesh Ramachandran, PhD Elite Analytics, LLC Table of contents Introduction..............................................................2

More information

Text mining for insurance claim cost prediction

Text mining for insurance claim cost prediction Text mining for insurance claim cost prediction Prepared by Inna Kolyshkina and Marcel van Rooyen Presented to the Institute of Actuaries of Australia XVth General Insurance Seminar 16-19 October 2005

More information

Learning is a very general term denoting the way in which agents:

Learning is a very general term denoting the way in which agents: What is learning? Learning is a very general term denoting the way in which agents: Acquire and organize knowledge (by building, modifying and organizing internal representations of some external reality);

More information

Using Data Mining to Detect Insurance Fraud

Using Data Mining to Detect Insurance Fraud IBM SPSS Modeler Using Data Mining to Detect Insurance Fraud Improve accuracy and minimize loss Highlights: combines powerful analytical techniques with existing fraud detection and prevention efforts

More information

CRISP-DM 1.0. Step-by-step data mining guide

CRISP-DM 1.0. Step-by-step data mining guide CRISP-DM 1.0 Step-by-step data mining guide Pete Chapman (NCR), Julian Clinton (SPSS), Randy Kerber (NCR), Thomas Khabaza (SPSS), Thomas Reinartz (DaimlerChrysler), Colin Shearer (SPSS) and Rüdiger Wirth

More information

Gerard Mc Nulty Systems Optimisation Ltd [email protected]/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I

Gerard Mc Nulty Systems Optimisation Ltd gmcnulty@iol.ie/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I Gerard Mc Nulty Systems Optimisation Ltd [email protected]/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I Data is Important because it: Helps in Corporate Aims Basis of Business Decisions Engineering Decisions Energy

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining José Hernández ndez-orallo Dpto.. de Systems Informáticos y Computación Universidad Politécnica de Valencia, Spain [email protected] Horsens, Denmark, 26th September 2005

More information

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM M. Mayilvaganan 1, S. Aparna 2 1 Associate

More information

Step-by-step data mining guide

Step-by-step data mining guide Step-by-step data mining guide Pete Chapman (NCR), Julian Clinton (SPSS), Randy Kerber (NCR), Thomas Khabaza (SPSS), Thomas Reinartz (DaimlerChrysler), Colin Shearer (SPSS) and Rüdiger Wirth (DaimlerChrysler)

More information

Data Mining with Microsoft SQL Server 2005

Data Mining with Microsoft SQL Server 2005 International DSI / Asia and Pacific DSI 2007 Full Paper (July, 2007) Data Mining with Microsoft SQL Server 2005 Henning Stolz 1), Peter Lehmann 1),Waranya Poonnawat 3) 1) Institute for Business Intelligence,

More information

Data Mining and Analytics in Realizeit

Data Mining and Analytics in Realizeit Data Mining and Analytics in Realizeit November 4, 2013 Dr. Colm P. Howlin Data mining is the process of discovering patterns in large data sets. It draws on a wide range of disciplines, including statistics,

More information

Master of Science in Artificial Intelligence

Master of Science in Artificial Intelligence Master of Science in Artificial Intelligence Options: Engineering and Computer Science (ECS) Speech and Language Technology (SLT) Big Data Analytics (BDA) Faculty of Engineering Science Faculty of Science

More information

APPLICATION OF DATA MINING TECHNIQUES FOR THE DEVELOPMENT OF NEW ROCK MECHANICS CONSTITUTIVE MODELS

APPLICATION OF DATA MINING TECHNIQUES FOR THE DEVELOPMENT OF NEW ROCK MECHANICS CONSTITUTIVE MODELS APPLICATION OF DATA MINING TECHNIQUES FOR THE DEVELOPMENT OF NEW ROCK MECHANICS CONSTITUTIVE MODELS T. Miranda 1, L.R. Sousa 2 *, W. Roggenthen 3, and R.L. Sousa 4 1 University of Minho, Guimarães, Portugal

More information

Design and Development of Electronic Prescription and Patient Information Systems for Developing World By

Design and Development of Electronic Prescription and Patient Information Systems for Developing World By Design and Development of Electronic Prescription and Patient Information Systems for Developing World By Dr Boniface Ekechukwu* and Chidi Obi **Dr Arinze Nweze* *Department of Computer Science, Nnamdi

More information

Comparison of K-means and Backpropagation Data Mining Algorithms

Comparison of K-means and Backpropagation Data Mining Algorithms Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and

More information

Operations Research and Knowledge Modeling in Data Mining

Operations Research and Knowledge Modeling in Data Mining Operations Research and Knowledge Modeling in Data Mining Masato KODA Graduate School of Systems and Information Engineering University of Tsukuba, Tsukuba Science City, Japan 305-8573 [email protected]

More information

IBM SPSS Modeler Professional

IBM SPSS Modeler Professional IBM SPSS Modeler Professional Make better decisions through predictive intelligence Highlights Create more effective strategies by evaluating trends and likely outcomes. Easily access, prepare and model

More information

Standardization of Components, Products and Processes with Data Mining

Standardization of Components, Products and Processes with Data Mining B. Agard and A. Kusiak, Standardization of Components, Products and Processes with Data Mining, International Conference on Production Research Americas 2004, Santiago, Chile, August 1-4, 2004. Standardization

More information

A Medical Decision Support System (DSS) for Ubiquitous Healthcare Diagnosis System

A Medical Decision Support System (DSS) for Ubiquitous Healthcare Diagnosis System , pp. 237-244 http://dx.doi.org/10.14257/ijseia.2014.8.10.22 A Medical Decision Support System (DSS) for Ubiquitous Healthcare Diagnosis System Regin Joy Conejar 1 and Haeng-Kon Kim 1* 1 School of Information

More information

Data Mining and KDD: A Shifting Mosaic. Joseph M. Firestone, Ph.D. White Paper No. Two. March 12, 1997

Data Mining and KDD: A Shifting Mosaic. Joseph M. Firestone, Ph.D. White Paper No. Two. March 12, 1997 1 of 11 5/24/02 3:50 PM Data Mining and KDD: A Shifting Mosaic By Joseph M. Firestone, Ph.D. White Paper No. Two March 12, 1997 The Idea of Data Mining Data Mining is an idea based on a simple analogy.

More information

Discovering, Not Finding. Practical Data Mining for Practitioners: Level II. Advanced Data Mining for Researchers : Level III

Discovering, Not Finding. Practical Data Mining for Practitioners: Level II. Advanced Data Mining for Researchers : Level III www.cognitro.com/training Predicitve DATA EMPOWERING DECISIONS Data Mining & Predicitve Training (DMPA) is a set of multi-level intensive courses and workshops developed by Cognitro team. it is designed

More information

Evaluating Data Mining Models: A Pattern Language

Evaluating Data Mining Models: A Pattern Language Evaluating Data Mining Models: A Pattern Language Jerffeson Souza Stan Matwin Nathalie Japkowicz School of Information Technology and Engineering University of Ottawa K1N 6N5, Canada {jsouza,stan,nat}@site.uottawa.ca

More information

Our unique perspective on brand and comms tracking

Our unique perspective on brand and comms tracking Our unique perspective on brand and comms tracking Hamish Asser Research Director Introducing BrandBox A powerful, flexible and transparent brand tracking tool that monitors brand performance and identifies

More information

1 What is Machine Learning?

1 What is Machine Learning? COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #1 Scribe: Rob Schapire February 4, 2008 1 What is Machine Learning? Machine learning studies computer algorithms for learning to do

More information

Masters in Information Technology

Masters in Information Technology Computer - Information Technology MSc & MPhil - 2015/6 - July 2015 Masters in Information Technology Programme Requirements Taught Element, and PG Diploma in Information Technology: 120 credits: IS5101

More information

Interactive Exploration of Decision Tree Results

Interactive Exploration of Decision Tree Results Interactive Exploration of Decision Tree Results 1 IRISA Campus de Beaulieu F35042 Rennes Cedex, France (email: pnguyenk,[email protected]) 2 INRIA Futurs L.R.I., University Paris-Sud F91405 ORSAY Cedex,

More information

MSc Finance & Business Analytics Programme Design. Academic Year 2014-15

MSc Finance & Business Analytics Programme Design. Academic Year 2014-15 MSc Finance & Business Analytics Programme Design Academic Year 2014-15 MSc Finance & Business Analytics The MSc Financial Management programme is divided into three distinct sections: The first semester

More information

Levels of Analysis and ACT-R

Levels of Analysis and ACT-R 1 Levels of Analysis and ACT-R LaLoCo, Fall 2013 Adrian Brasoveanu, Karl DeVries [based on slides by Sharon Goldwater & Frank Keller] 2 David Marr: levels of analysis Background Levels of Analysis John

More information

Statistics for BIG data

Statistics for BIG data Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before

More information

Facilitating Business Process Discovery using Email Analysis

Facilitating Business Process Discovery using Email Analysis Facilitating Business Process Discovery using Email Analysis Matin Mavaddat [email protected] Stewart Green Stewart.Green Ian Beeson Ian.Beeson Jin Sa Jin.Sa Abstract Extracting business process

More information

Making critical connections: predictive analytics in government

Making critical connections: predictive analytics in government Making critical connections: predictive analytics in government Improve strategic and tactical decision-making Highlights: Support data-driven decisions using IBM SPSS Modeler Reduce fraud, waste and abuse

More information

Intrusion Detection via Machine Learning for SCADA System Protection

Intrusion Detection via Machine Learning for SCADA System Protection Intrusion Detection via Machine Learning for SCADA System Protection S.L.P. Yasakethu Department of Computing, University of Surrey, Guildford, GU2 7XH, UK. [email protected] J. Jiang Department

More information

The top 10 secrets to using data mining to succeed at CRM

The top 10 secrets to using data mining to succeed at CRM The top 10 secrets to using data mining to succeed at CRM Discover proven strategies and best practices Highlights: Plan and execute successful data mining projects using IBM SPSS Modeler. Understand the

More information

Data Mining. SPSS Clementine 12.0. 1. Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

Data Mining. SPSS Clementine 12.0. 1. Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine Data Mining SPSS 12.0 1. Overview Spring 2010 Instructor: Dr. Masoud Yaghini Introduction Types of Models Interface Projects References Outline Introduction Introduction Three of the common data mining

More information

Chapter 11. Managing Knowledge

Chapter 11. Managing Knowledge Chapter 11 Managing Knowledge VIDEO CASES Video Case 1: How IBM s Watson Became a Jeopardy Champion. Video Case 2: Tour: Alfresco: Open Source Document Management System Video Case 3: L'Oréal: Knowledge

More information

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots? Class 1 Data Mining Data Mining and Artificial Intelligence We are in the 21 st century So where are the robots? Data mining is the one really successful application of artificial intelligence technology.

More information

Chapter 6 - Enhancing Business Intelligence Using Information Systems

Chapter 6 - Enhancing Business Intelligence Using Information Systems Chapter 6 - Enhancing Business Intelligence Using Information Systems Managers need high-quality and timely information to support decision making Copyright 2014 Pearson Education, Inc. 1 Chapter 6 Learning

More information

Solve Your Toughest Challenges with Data Mining

Solve Your Toughest Challenges with Data Mining IBM Software Business Analytics IBM SPSS Modeler Solve Your Toughest Challenges with Data Mining Use predictive intelligence to make good decisions faster Solve Your Toughest Challenges with Data Mining

More information

Using Data Mining to Detect Insurance Fraud

Using Data Mining to Detect Insurance Fraud IBM SPSS Modeler Using Data Mining to Detect Insurance Fraud Improve accuracy and minimize loss Highlights: Combine powerful analytical techniques with existing fraud detection and prevention efforts Build

More information