Data Mining Techniques Chapter 7: Artificial Neural Networks



Similar documents
6.2.8 Neural networks for data mining

Lecture 6. Artificial Neural Networks

Data Mining and Neural Networks in Stata

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

NEURAL NETWORKS IN DATA MINING

Data Mining Techniques Chapter 6: Decision Trees

An Introduction to Neural Networks

Neural network software tool development: exploring programming language options

Data Mining Algorithms Part 1. Dejan Sarka

Performance Evaluation On Human Resource Management Of China S Commercial Banks Based On Improved Bp Neural Networks

Feedforward Neural Networks and Backpropagation

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

Chapter 12 Discovering New Knowledge Data Mining

Neural Computation - Assignment

Event driven trading new studies on innovative way. of trading in Forex market. Michał Osmoła INIME live 23 February 2016

Self Organizing Maps: Fundamentals

SUCCESSFUL PREDICTION OF HORSE RACING RESULTS USING A NEURAL NETWORK

Advanced analytics at your hands

How To Use Neural Networks In Data Mining

Course Syllabus. Purposes of Course:

Neural Networks and Back Propagation Algorithm

Comparison of Supervised and Unsupervised Learning Algorithms for Pattern Classification

IBM SPSS Neural Networks 22

Forecasting Trade Direction and Size of Future Contracts Using Deep Belief Network

Chapter 4: Artificial Neural Networks

Neural Networks and Support Vector Machines

Data Mining using Artificial Neural Network Rules

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

Role of Neural network in data mining

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

Visualization of Breast Cancer Data by SOM Component Planes

Load balancing in a heterogeneous computer system by self-organizing Kohonen network

Analecta Vol. 8, No. 2 ISSN

An Introduction to Artificial Neural Networks (ANN) - Methods, Abstraction, and Usage

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

APPLICATION OF ARTIFICIAL NEURAL NETWORKS USING HIJRI LUNAR TRANSACTION AS EXTRACTED VARIABLES TO PREDICT STOCK TREND DIRECTION

The Research of Data Mining Based on Neural Networks

Neural Networks algorithms and applications

Data Mining Part 5. Prediction

Numerical Algorithms Group

Designing a neural network for forecasting financial time series

A Neural Network based Approach for Predicting Customer Churn in Cellular Network Services

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

American International Journal of Research in Science, Technology, Engineering & Mathematics

Cash Forecasting: An Application of Artificial Neural Networks in Finance

Self-Organizing g Maps (SOM) COMP61021 Modelling and Visualization of High Dimensional Data

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Data mining and statistical models in marketing campaigns of BT Retail

NEURAL NETWORK FUNDAMENTALS WITH GRAPHS, ALGORITHMS, AND APPLICATIONS

not possible or was possible at a high cost for collecting the data.

Machine learning in financial forecasting. Haindrich Henrietta Vezér Evelin

Data Mining Practical Machine Learning Tools and Techniques

Open Access Research on Application of Neural Network in Computer Network Security Evaluation. Shujuan Jin *

AN APPLICATION OF TIME SERIES ANALYSIS FOR WEATHER FORECASTING

Design call center management system of e-commerce based on BP neural network and multifractal

Big Data Analytics Using Neural networks

Bank Customers (Credit) Rating System Based On Expert System and ANN

Correspondence should be addressed to Chandra Shekhar Yadav;

Sanjeev Kumar. contribute

Real Stock Trading Using Soft Computing Models

Potential Value of Data Mining for Customer Relationship Marketing in the Banking Industry

Building an Iris Plant Data Classifier Using Neural Network Associative Classification

COMP3420: Advanced Databases and Data Mining. Classification and prediction: Introduction and Decision Tree Induction

Customer and Business Analytic

Models of Cortical Maps II

IFT3395/6390. Machine Learning from linear regression to Neural Networks. Machine Learning. Training Set. t (3.5, -2,..., 127, 0,...

Comparison of K-means and Backpropagation Data Mining Algorithms

Prediction Model for Crude Oil Price Using Artificial Neural Networks

IBM SPSS Neural Networks 19

FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS

Neural Network Applications in Stock Market Predictions - A Methodology Analysis

Segmentation of stock trading customers according to potential value

Summary Data Mining & Process Mining (1BM46) Content. Made by S.P.T. Ariesen

More Data Mining with Weka

Data Mining Techniques

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Understanding Characteristics of Caravan Insurance Policy Buyer

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH

NEURAL NETWORKS FOR TARGET SELECTION IN DIRECT MARKETING ROB POTHARST, UZAY KAYMAK, WIM PIJLS

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Keywords: Data Mining, Neural Networks, Data Mining Process, Knowledge Discovery, Implementation. I. INTRODUCTION

What is Data Mining? MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling

Neural Network Add-in

Artificial neural networks

Prediction of Cancer Count through Artificial Neural Networks Using Incidence and Mortality Cancer Statistics Dataset for Cancer Control Organizations

Power Prediction Analysis using Artificial Neural Network in MS Excel

A simple application of Artificial Neural Network to cloud classification

D A T A M I N I N G C L A S S I F I C A T I O N

Machine Learning: Multi Layer Perceptrons

CHAPTER 5 PREDICTIVE MODELING STUDIES TO DETERMINE THE CONVEYING VELOCITY OF PARTS ON VIBRATORY FEEDER

129: Artificial Neural Networks. Ajith Abraham Oklahoma State University, Stillwater, OK, USA 1 INTRODUCTION TO ARTIFICIAL NEURAL NETWORKS

Using Predictive Analytics to Detect Fraudulent Claims

Stock Data Analysis Based On Neural Network. 1Rajesh Musne, 2 Sachin Godse

Machine Learning and Data Mining -

Lecture 8 February 4

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

THE QUEST FOR ALPHA: can artificial neural networks help?

Transcription:

Data Mining Techniques Chapter 7: Artificial Neural Networks Artificial Neural Networks.................................................. 2 Neural network example................................................... 3 Inputs and output....................................................... 4 Neural network process.................................................... 5 What is a neural net?..................................................... 6 A neuron............................................................. 7 Feed-forward neural net example............................................. 8 How neural nets learn..................................................... 9 Neural net heuristics...................................................... 10 Choosing training sample.................................................. 11 Preparing data......................................................... 12 Other considerations..................................................... 13 1

Artificial Neural Networks Biological neural networks: human brain enables people to generalize from experience. Artificial neural networks: computers generalize and learn from experience. Black box technique: little insight into how the model works or what the computer has done. History: ideas developed in computer science and artificial intelligence. Case study: real estate appraisal (p 213 8): realtors combine features of a house (size, age, neighborhood, etc.) to come up with a valuation; computer can do the same thing via a neural network model (see next slide). c Iain Pardoe, 2006 2 / 13 Neural network example Inputs well understood, but how to combine (weight) them is unknown. Output well understood. Experience is available (lots of training data). Training presents known examples (data from previous sales) to the network so that it can learn how to calculate the sales price: e.g., iteratively adjust weights to find best ones for predicting price (minimize overall error). c Iain Pardoe, 2006 3 / 13 Inputs and output Inputs usually standardized or rescaled to lie between 1 and +1 (to avoid inputs with large ranges dominating the network): quantitative: standardize (XLMiner: normalize ); quantitative (alternative): 2(X m X )/r X, where m X is halfway between the minimum and maximum and r X is the range; ordered qualitative/categorical: map different categories to numbers between 1 and +1; unordered qualitative/categorical: create 1/ + 1 inputs for pairs of categories. Output is usually in the range 0 to 1 (or sometimes 1 to +1) and needs to be mapped back to original scale for deployment. c Iain Pardoe, 2006 4 / 13 2

Neural network process Each iteration (adjusting weights) called a generation or epoch. Final epoch when cannot reduce error rate further: fits training data best, but probably overfits. Earlier epoch usually fits validation data better. Neural networks for directed data mining (classification or prediction): identify inputs/output; standardize data; set up network topology (see later); train network with training sample; use validation sample to select weights that minimize errors; evaluate final model using test sample; deploy model to predict unknown output for known inputs. c Iain Pardoe, 2006 5 / 13 What is a neural net? c Iain Pardoe, 2006 6 / 13 3

A neuron XLMiner has two types of transfer function: standard (logistic); symmetric (hyperbolic tangent). c Iain Pardoe, 2006 7 / 13 4

Feed-forward neural net example c Iain Pardoe, 2006 8 / 13 How neural nets learn Learn using back propagation: feed classification/prediction errors back through network and adjust weights to reduce errors. Alternative methods, e.g., conjugate gradient: see p. 230 (jolly green giant). After a number of iterations/generations/epochs, errors and weights no longer change and algorithm stops. Tuning parameters: learning rate (step size for gradient descent); weight change momentum. c Iain Pardoe, 2006 9 / 13 5

Neural net heuristics # hidden layers and units/nodes per layer: trial and error; too low poor fit; too high can recognize more patterns but may overfit; Size of training set: aim for at least 30 per weight, e.g., for 15 inputs, a hidden layer with 10 nodes, and 1 output, aim for 30[10(15+1)+(10+1)] = 5130 observations. learning rate (0.1 0.9): start high, then decrease step size for gradient descent. weight change momentum (0 2): allows network to find solution quicker. c Iain Pardoe, 2006 10 / 13 Choosing training sample Coverage of values for all features (inputs and output). Number of features (variable selection), e.g., use background knowledge, intuition, or methods like decision trees first. Size of training set (see previous slide). Proportions of output categories (perhaps consider oversampling rare cases). c Iain Pardoe, 2006 11 / 13 Preparing data Features with quantitative continuous values: transform (standardize or map to ( 1, +1)); bin using quartiles or quintiles if highly skewed (or use log transform). Features with ordered discrete (integer) values: map to ( 1, +1); thermometer codes (see p. 238). Features with qualitative/categorical values: use dummy indicator variables (1s and 0s); alternative: effects coding (1s, 0s, and 1s); with L levels/categories only need L 1 variables. Other types of features, e.g., dates and addresses, need special handling. c Iain Pardoe, 2006 12 / 13 6

Other considerations Interpreting results: convert output to quantitative prediction for prediction network or probability for classification network. Neural networks for time series. Neural networks are hard to understand (black box), but sensitivity analysis can offer limited insight into input effects on a test set. Self-organizing maps (aka Kohonen networks) use undirected neural networks for clustering. Examples: p. 252 4 in book; BMW airbags neural networks used to determine which airbags to deploy in a crash based on characteristics of the crash. c Iain Pardoe, 2006 13 / 13 7