Data Mining using Artificial Neural Network Rules



Similar documents
OPTIMUM LEARNING RATE FOR CLASSIFICATION PROBLEM

Study Of Hybrid Genetic Algorithm Using Artificial Neural Network In Data Mining For The Diagnosis Of Stroke Disease

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

American International Journal of Research in Science, Technology, Engineering & Mathematics

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

Chapter 4: Artificial Neural Networks

Impact of Feature Selection on the Performance of Wireless Intrusion Detection Systems

Comparison of K-means and Backpropagation Data Mining Algorithms

Effective Data Mining Using Neural Networks

DATA MINING AND REPORTING IN HEALTHCARE

A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE

Role of Neural network in data mining

New Ensemble Combination Scheme

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

DATA MINING TECHNIQUES AND APPLICATIONS

6.2.8 Neural networks for data mining

Artificial Neural Network Approach for Classification of Heart Disease Dataset

AnalysisofData MiningClassificationwithDecisiontreeTechnique

SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH

Decision Support System on Prediction of Heart Disease Using Data Mining Techniques

Data Mining Techniques Chapter 7: Artificial Neural Networks

Analecta Vol. 8, No. 2 ISSN

Keywords data mining, prediction techniques, decision making.

Building an Iris Plant Data Classifier Using Neural Network Associative Classification

A comparative study on the pre-processing and mining of Pima Indian Diabetes Dataset

Effective Analysis and Predictive Model of Stroke Disease using Classification Methods

Chapter 12 Discovering New Knowledge Data Mining

Stock Prediction using Artificial Neural Networks

A Hybrid Model of Hierarchical Clustering and Decision Tree for Rule-based Classification of Diabetic Patients

Feed-Forward mapping networks KAIST 바이오및뇌공학과 정재승

How To Use Neural Networks In Data Mining

FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS

Prediction of Heart Disease Using Naïve Bayes Algorithm

Sensitivity Analysis for Data Mining

REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

The Use of Data Mining Classification Techniques to Predict and Diagnose of Diseases

Performance Evaluation On Human Resource Management Of China S Commercial Banks Based On Improved Bp Neural Networks

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

Iranian J Env Health Sci Eng, 2004, Vol.1, No.2, pp Application of Intelligent System for Water Treatment Plant Operation.

EFFICIENT DATA MINING FOR MINING CLASSIFICATION USING NEURAL NETWORK

Data Mining Algorithms Part 1. Dejan Sarka

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

Prediction Model for Crude Oil Price Using Artificial Neural Networks

ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS

The Role of Size Normalization on the Recognition Rate of Handwritten Numerals

Neural Networks in Data Mining

APPLYING GMDH ALGORITHM TO EXTRACT RULES FROM EXAMPLES

An Analysis on Density Based Clustering of Multi Dimensional Spatial Data

Keywords: Image complexity, PSNR, Levenberg-Marquardt, Multi-layer neural network.

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network

Manjeet Kaur Bhullar, Kiranbir Kaur Department of CSE, GNDU, Amritsar, Punjab, India

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

An Overview of Knowledge Discovery Database and Data mining Techniques

A New Method for Traffic Forecasting Based on the Data Mining Technology with Artificial Intelligent Algorithms

Neural Networks and Back Propagation Algorithm

Novel Mining of Cancer via Mutation in Tumor Protein P53 using Quick Propagation Network

PharmaSUG2011 Paper HS03

Quality Assessment in Spatial Clustering of Data Mining

degrees of freedom and are able to adapt to the task they are supposed to do [Gupta].

Design call center management system of e-commerce based on BP neural network and multifractal

Classification algorithm in Data mining: An Overview

Performance Evaluation of Artificial Neural. Networks for Spatial Data Analysis

Application of Neural Network in User Authentication for Smart Home System

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing Classifier

EVALUATION OF NEURAL NETWORK BASED CLASSIFICATION SYSTEMS FOR CLINICAL CANCER DATA CLASSIFICATION

From Knowledge Discovery to Implementation: A Business Intelligence Approach Using Neural Network Rule Extraction and Decision Tables

Learning is a very general term denoting the way in which agents:

NEURAL NETWORKS IN DATA MINING

A Data Mining Study of Weld Quality Models Constructed with MLP Neural Networks from Stratified Sampled Data

Chapter 2 The Research on Fault Diagnosis of Building Electrical System Based on RBF Neural Network

COMBINED METHODOLOGY of the CLASSIFICATION RULES for MEDICAL DATA-SETS

D A T A M I N I N G C L A S S I F I C A T I O N

Comparative Analysis of Classification Algorithms on Different Datasets using WEKA

Open Access Research on Application of Neural Network in Computer Network Security Evaluation. Shujuan Jin *

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

Data Mining Part 5. Prediction

Introduction. Chapter Before you start Formulation

Data Mining Solutions for the Business Environment

Data Mining On Diabetics

A Content based Spam Filtering Using Optical Back Propagation Technique

Diabetes Classification Using Cascaded Data Mining Technique

Application of Data Mining Techniques For Diabetic DataSet

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Dynamic Neural Networks for Actuator Fault Diagnosis: Application to the DAMADICS Benchmark Problem

More Data Mining with Weka

Graph Mining and Social Network Analysis

A Comparative Study of clustering algorithms Using weka tools

Price Prediction of Share Market using Artificial Neural Network (ANN)

A Secured Approach to Credit Card Fraud Detection Using Hidden Markov Model

Transcription:

Data Mining using Artificial Neural Network Rules Pushkar Shinde MCOERC, Nasik Abstract - Diabetes patients are increasing in number so it is necessary to predict, treat and diagnose the disease. Data Mining can help to provide knowledge about this disease. The knowledge extracted using Data Mining can help in treating and preventing the disease. Artificial Neural Network (ANN) can be used to create an classifier from the data. The neural network is trained using backpropagation algorithm The knowledge stored in the neural network is used to predict the disease. The knowledge stored in neural network is extracted using NeuroRule method. The knowledge extracted is in form of human readable rules which can used to analyze the disease amd in turn help in treating the disease. Keywords : Diabetes, data mining, neural network rule extraction, NeuroRule I. INTRODUCTION Diabetes is an chronic disease i.e. a long term disease which can be controlled but not cured. The number of patients suffering from diabetes is increasing. This has caused various researchers to work on controlling the disease. Diabetes is a condition due to malfunction in the pancreas. The pancreas doesn t produce enough insulin to control the sugar level in the blood. Data Mining can be used to extract knowledge from medical data. The knowledge extracted can help in predicting, diagnosing and preventing diabetes. Data mining is the process of discovering useful knowledge in data and also finding the inter-relation pattern among the data [1]. Data mining is used to extract new knowledge from existing data. The knowledge is hidden in the data, which is extracted using data mining. Data Mining requires large amount of data. Medical data can be used for data mining. The Pima Indian diabetes data set is used for performing data mining. The data set has been created for female patients with attributes like number of times pregnant, age, BMI, blood pressure, plasma glucose level. Attributes like this can give more knowledge about their effect on diabetes. The medical industry is among the most information intensive industries. Medical data keep growing on a daily basis. From this data useful knowledge should be extracted to provide quality health care.with the help of data mining methods, useful patterns and relationships of information can be found within the data, which can be utilized for diagnosis, prediction and detections of the trend of the disease [1]. ANN has been applied in many applications with remarkable success. For example, ANN have been successfully applied in the area of speech generation and recognition, handwritten character recognition, vision and robotics. ANN can be used to extract knowledge from the data. The knowledge extracted using ANN is stored in the form of neural network. This knowledge is not comprehensible. So ANN is also called as an black box. ANN can be used as an classifier, but doesn t easily provide information about the classification decision. To overcome this limitation an technique called NeuroRule is used as an decompositional technique. This technique extracts knowledge from the neural network as simple human readable rules. These rules provide further knowledge about the disease. These rules also help in justifying the neural networks classification decision. These rules provide information about diabetes which can be used in treating, diagnosing and preventing diabetes. In this paper an approach named Artificial Neural Network Rules (ANNR), i.e. ANN training preceded by rules extraction method. This approach is helpful for utilizing the power of ANN in data mining applications where Vol. 3 Issue 1 October 2013 157 ISSN: 2319 1058

comprehensibility is as important as the generalization ability. In other words this method overcome the "black box nature" of ANN. This method is tested on diabetic data set. This paper is organized as follows. In section 2, we briefly explain the basic concepts of ANN. Section 3 presents the rule extraction, section 4 gives the experimental result and followed by conclusion in section 5. II. NEURAL NETWORK A three layer neural network having thirty two neurons in the input layer, nine neurons in the hidden layer, one neuron in the output layer is considered (shown in Figure 1). The input data is binarized and given as input to input layer. Each attribute is clustered into an range and four clusters are formed for each attribute. Backpropagation algorithm is used to train the neural network. Backpropagation algorithm is an gradient descent algorithm. The algorithm initializes the weights of the neural network to a random value initially. The network activation values are calculated by multiplying the weights with the input neurons. The activation function used is an sigmoid function which is calculated using hyperbolic tangent of the summation. To determine the optimal weights for a classification and prediction problem is, to minimize the error. The most common method to define a cost function is, as the root mean square error function between the networks predicted output (Yk) and the expected output (Ok). This is commonly known as the root mean square error (RMS error) cost function. is used., and it is computed using equation (1) E(p) = where P is the total number of patterns in the data set, is the output units, Ok is the target value at the kth output neuron for pth sample and Yk is the actual output at the kth output neuron for the pth sample. Figure 1:- Three layer Neural Network An approach to determine the weights minimize the cost function is to use the stochastic algorithm: gradient descent with back-propagation. This is one of the simplest network training algorithms and is also known as steepest descent. With gradient descent, the initial weight vector Wold is often chosen at random, then with each iteration the weights are updated such that we move a distance in the opposite direction of the error function gradient in each iteration (1). The weights are updated at each iteration as follows: Vol. 3 Issue 1 October 2013 158 ISSN: 2319 1058

Where n is the is called the learning rate. A large learning rate leads to rapid learning but the weights may oscillate while lower learning rates leads to slower learning. The learning rate is fixed at 0.01 in this work. III. RULE EXTRACTION The neural network once trained contains knowledge about diabetes. The neural network is used to classify and predict the class of the input person. The knowledge that is hidden in the neural network can be extracted using decompositional technique like NeuroRule[2]. The NeuroRule technique computes the hidden neurons that activate the output neuron. The hidden neurons that have weights greater than an active threshold value are considered. Once the hidden neurons are selected the input neurons that activate the hidden neurons are selected and using this information the rules are extracted. The rules were extracted from the neural network by identifying the hidden neurons that have activation level above the threshold. The input-hidden rules for the corresponding hidden neurons are identified. NeuroRule technique is used for rule extraction. To facilitate the rule extraction the values of the numeric attributes were discretized. Each of the six attributes with numeric values was discretized by dividing its range into subintervals. For example the plasma glucose value is having range 81 to 199 is divided into four subintervals. The rules were extracted such that there are positive and negative rules. The positive rules are given such that they give the attribute values causing diabetes. Negative rules are given which give attribute values for not having diabetes. IV. EXPERIMENTAL RESULTS 4.1. DATA SET The Pima Indians Diabetes database [8] is used to test the proposed procedure. The total data is 768 from which 461 samples (60%) are randomly chosen and used as training patterns and tested with 307 instances (40%) of the same data set. Each instances sample represents eight attributes of female patients of Pima Indian heritage. The eight attributes are namely, number of times pregnant, plasma glucose concentration (2 hours in an oral glucose tolerance test), Diastolic blood pressure (mm Hg), Triceps skin fold thickness (mm), 2 hours serum insulin (mu/ml), body mass index (weight in kg/(height in m)2 Diabetes pedigree function, Age (years). 4.2. TRAINED NEURAL NETWORK First the network is trained using backpropagation algorithm. The trained network uses sigmoid activation function. The activation threshold has been taken as 0.7 and accordingly the RMS error has been calculated for the neural network. Table 1: Neural Network training performance In Neurons :32 Hidden Neurons: 8 No Epoch Time (s) Output Neurons: 1 Train data accuracy Learning rate : 0.01 Test data accuracy 1 5000 150 90% 90% 2 5000 160 89% 90% 3 5000 150 90% 90% 4 5000 153 89.5% 89% 5 10000 250 90.5% 90% Table 2 : Extracted Rule Accuracy Vol. 3 Issue 1 October 2013 159 ISSN: 2319 1058

Figure 2 : RMS error Vol. 3 Issue 1 October 2013 160 ISSN: 2319 1058

Figure 3: Part of Diabetes rules extracted Figure 3: Rules for diabetes Vol. 3 Issue 1 October 2013 161 ISSN: 2319 1058

VI. CONCLUSION In this paper, with the proposed ANNR approach, ANN can be utilized in data mining application. First ANN was trained using backpropagation algorithm, then the rules are extracted from the ANN using NeuroRule method. Rule extraction using ANNTR from trained ANN is more accurate. It also can work on continuous and discrete data. ANNR approach was used on diabetes data set, where it shows that this approach could benefit diabetes diagnosis because it could generate rules with strong generalization and comprehensibility ability. The knowledge gained is comprehensible and can enhance the decision making process by the doctors and will be a valuable tool for diabetes researchers. Future work on ANNR will be on reducing the redundant rules. REFERENCES [1] S. Kalaiarasi Anbananthen Sainarayanan, Ali Chekima, Jason Teo Data Mining using Artificial Neural Network Tree, IEEE Conference on Computers, Communications and Signal Processing. [2] Hongjun Lu, Rudy Setino, Huan Liu, "NeuroRule A connectionist Approach to Data Mining",VLDB Conference [3] Jiawei Han, Micheline Kamber Data Mining Concepts and Techniques. [4] S.Kalaiarasi Anbananthen, Fabian H.P. Chan, [5] K.Y. Leong. Data Mining Using Decision Tree Induction of Neural Networks, 3rd Seminar on Science and Technology. Kota Kinabalu : SST. 2004 [6] http://www.1iacc.up.pt/ml/statlog/datasets/diabetes/diabetes.doc.html [7] Sethi, I.K. Layered Neural Net Design through Decision Trees. Circuits and Systems, IEEE International Symposium. 1990. [8] A. K. C. Wong and Y. Wang. Pattern Discovery: A Data Driven Approach to Decision Support. IEEE Trans. On Systems, Man and CyberneticsPart C: Applications and Reviews., vol. 33, pp.ii4-893.2003 [9] U.M. Fayyad, G.P. Shapiro, and P. Smyth. Advances in Knowledge Discovery and Data Mining. California: Menlo park. 1996. [10] A. K. C. Wong and Y. Wang. Pattern Discovery: A Data Driven Approach to Decision Support. IEEE Trans. On Systems, Man and CyberneticsPart C: Applications and Reviews., vol. 33, pp.ii4-893.2003 Vol. 3 Issue 1 October 2013 162 ISSN: 2319 1058