American International Journal of Research in Science, Technology, Engineering & Mathematics



Similar documents
Genetic Neural Approach for Heart Disease Prediction

Role of Neural network in data mining

Data Mining using Artificial Neural Network Rules

Chapter 4: Artificial Neural Networks

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

How To Use Neural Networks In Data Mining

Price Prediction of Share Market using Artificial Neural Network (ANN)

Impact of Feature Selection on the Performance of Wireless Intrusion Detection Systems

NEURAL NETWORKS IN DATA MINING

MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL

Neural Networks and Support Vector Machines

Power Prediction Analysis using Artificial Neural Network in MS Excel

Keywords: Image complexity, PSNR, Levenberg-Marquardt, Multi-layer neural network.

SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH

APPLICATION OF ARTIFICIAL NEURAL NETWORKS USING HIJRI LUNAR TRANSACTION AS EXTRACTED VARIABLES TO PREDICT STOCK TREND DIRECTION

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

Artificial Neural Network Approach for Classification of Heart Disease Dataset

Comparison of K-means and Backpropagation Data Mining Algorithms

OPTIMUM LEARNING RATE FOR CLASSIFICATION PROBLEM

Effective Data Mining Using Neural Networks

Stock Market Prediction Model by Combining Numeric and News Textual Mining

Neural Network Design in Cloud Computing

Recurrent Neural Networks

Keywords: Data Mining, Neural Networks, Data Mining Process, Knowledge Discovery, Implementation. I. INTRODUCTION

D A T A M I N I N G C L A S S I F I C A T I O N

Keywords: Learning, neural networks, fuzzy systems, perceptron, neuro-fuzzy. 1 Introduction

Lecture 6. Artificial Neural Networks

Chapter 12 Discovering New Knowledge Data Mining

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

Neural Network Applications in Stock Market Predictions - A Methodology Analysis

REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES

A Data Mining Study of Weld Quality Models Constructed with MLP Neural Networks from Stratified Sampled Data

A Review of Data Mining Techniques

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

Evaluation of Feature Selection Methods for Predictive Modeling Using Neural Networks in Credits Scoring

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

Utilization of Neural Network for Disease Forecasting

Stock Prediction using Artificial Neural Networks

A New Approach For Estimating Software Effort Using RBFN Network

STOCK PREDICTION SYSTEM USING ANN

Neural Networks and Back Propagation Algorithm

Supply Chain Forecasting Model Using Computational Intelligence Techniques

SELECTING NEURAL NETWORK ARCHITECTURE FOR INVESTMENT PROFITABILITY PREDICTIONS

A Simple Feature Extraction Technique of a Pattern By Hopfield Network

A Prediction Model for Taiwan Tourism Industry Stock Index

6.2.8 Neural networks for data mining

A Content based Spam Filtering Using Optical Back Propagation Technique

Use of Artificial Neural Network in Data Mining For Weather Forecasting

AN APPLICATION OF TIME SERIES ANALYSIS FOR WEATHER FORECASTING

CLASSIFICATION AND PREDICTION IN DATA MINING WITH NEURAL NETWORKS

Analecta Vol. 8, No. 2 ISSN

Ms. Aruna J. Chamatkar Assistant Professor in Kamla Nehru Mahavidyalaya, Sakkardara Square, Nagpur

Weather forecast prediction: a Data Mining application

Neural Networks for Sentiment Detection in Financial Text

Prediction of Cancer Count through Artificial Neural Networks Using Incidence and Mortality Cancer Statistics Dataset for Cancer Control Organizations

Prediction Model for Crude Oil Price Using Artificial Neural Networks

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling

Neural Network and Genetic Algorithm Based Trading Systems. Donn S. Fishbein, MD, PhD Neuroquant.com

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Software and Web Sciences (IJSWS)

Data Mining Techniques Chapter 7: Artificial Neural Networks

Real-Time Credit-Card Fraud Detection using Artificial Neural Network Tuned by Simulated Annealing Algorithm

Keywords - Intrusion Detection System, Intrusion Prevention System, Artificial Neural Network, Multi Layer Perceptron, SYN_FLOOD, PING_FLOOD, JPCap

Neural Networks in Data Mining

Performance Evaluation of Artificial Neural. Networks for Spatial Data Analysis

Design call center management system of e-commerce based on BP neural network and multifractal

DATA MINING TECHNIQUES AND APPLICATIONS

Data Mining and Soft Computing. Francisco Herrera

Knowledge Based Descriptive Neural Networks

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR.

Big Data with Rough Set Using Map- Reduce

Building an Iris Plant Data Classifier Using Neural Network Associative Classification

Introduction to Artificial Neural Networks

Comparative Analysis of Classification Algorithms on Different Datasets using WEKA

Neural Networks algorithms and applications

Optimum Design of Worm Gears with Multiple Computer Aided Techniques

ANN Model to Predict Stock Prices at Stock Exchange Markets

Sanjeev Kumar. contribute

Neural Network Predictor for Fraud Detection: A Study Case for the Federal Patrimony Department

degrees of freedom and are able to adapt to the task they are supposed to do [Gupta].

Novelty Detection in image recognition using IRF Neural Networks properties

TRAINING A LIMITED-INTERCONNECT, SYNTHETIC NEURAL IC

Genetic Algorithm Based Interconnection Network Topology Optimization Analysis

A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE

International Journal of Emerging Technology & Research

Manjeet Kaur Bhullar, Kiranbir Kaur Department of CSE, GNDU, Amritsar, Punjab, India

Data Mining and Neural Networks in Stata

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

Neural network software tool development: exploring programming language options

COMBINED NEURAL NETWORKS FOR TIME SERIES ANALYSIS

Study Of Hybrid Genetic Algorithm Using Artificial Neural Network In Data Mining For The Diagnosis Of Stroke Disease

Method of Combining the Degrees of Similarity in Handwritten Signature Authentication Using Neural Networks

2. IMPLEMENTATION. International Journal of Computer Applications ( ) Volume 70 No.18, May 2013

A New Approach for Evaluation of Data Mining Techniques

EVALUATION OF NEURAL NETWORK BASED CLASSIFICATION SYSTEMS FOR CLINICAL CANCER DATA CLASSIFICATION

A Time Series ANN Approach for Weather Forecasting

THE HUMAN BRAIN. observations and foundations

Transcription:

American International Journal of Research in Science, Technology, Engineering & Mathematics Available online at http://www.iasir.net ISSN (Print): 2328-349, ISSN (Online): 2328-3580, ISSN (CD-ROM): 2328-3629 AIJRSTEM is a refereed, indexed, peer-reviewed, multidisciplinary and open access journal published by International Association of Scientific Innovation and Research (IASIR), USA (An Association Unifying the Sciences, Engineering, and Applied Research) Study and Review of Genetic Neural Approaches for Data Mining Nilakshi P. Waghulde, Nilima Patil 2 CSE Department, SSBT s COET Bambhori, North Maharashtra University Jalgaon, Maharashtra, INDIA Abstract: Data mining techniques are used to explore analyze and extract data using complex algorithms in order to discover unknown patterns. Neural network which is one of the data mining techniques was proved to be universal approximator. Neural network is able to learn a mapping between input and output nodes while the hidden nodes and weights between them contain the internal representation of the input which trains the network with local convergence. As the initialization of neural network weights is a blind process and neural network is slow to converge then it is difficult to find global optimal solution. A fixed structure of the neural network may not provide the optimal performance within the training period so the number of hidden layers and hidden nodes for particular neural network also plays an important role. Hence, this paper presents a Genetic Neural Network technique that takes advantage of global optimization of genetic algorithm for initialization of neural network along with the calculation of the number of hidden nodes and hidden layers for neural network which train the network with proper selection of neural network architecture. Keywords: Neural network, genetic algorithm, data mining I. Introduction Data mining is one of the most important steps of the knowledge discovery in databases process and is considered as significant subfield in knowledge management.[] Data mining an interdisciplinary subfield of computer science is the computational process of discovering patterns in large dataset involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems.thus data mining is decision tree, neural network, rule induction, nearest neighbour, genetic algorithm. Artificial neural network, a data mining practice, is a interconnected group of artificial neurons that uses mathematical model or computational model for information processing based on connectionist approach of computation.[2] Neural Networks are form of a multiprocessor computer system, with simple processing elements, a high degree of interconnection, simple scalar messages, and adaptive interconnection between elements. Thus, neural computing is a information processing paradigm, inspired by biological system, composed of a large number of highly interconnected processing elements(neurons) working in unison to solve specific problem. As the Neural networks are models of biological neural structures. The starting point for most neural networks is a model neuron, as in Figure. This neuron consists of multiple inputs and a single output. Each input is modified by a weight, which multiplies with the input value. The neuron will combine these weighted inputs and, with reference to a threshold value and activation function, use these to determine its output. AIJRSTEM 4-70; 204, AIJRSTEM All Rights Reserved Page 44

A. Neural Network Architecture A neural network is usually a layered graph with the output of one node feeding into one or many other nodes in the next layer. An Artificial Neural Network Architecture is a data processing system, consisting of large number of interconnected processing elements such as neurons in a network structure that can be represented by using directed graph G, an order 2 tuple(v,e), consisting of set V of vertices and set E of edges. The Vertices may represent neurons (input/output) and the edges may represent synaptic links labeled by weights attached.[3] The multiple layers are arrayed one succeeding the other so that there is an input layer, multiple intermediate layers and finally an output layer, as in Figure 2. Intermediate layers, that is those that have no inputs or outputs to the external world, are called hidden layer. In fully connected network each neuron is connected to every output from the preceding layer or one input from the external world if the neuron is in the first layer and, correspondingly, each neuron has its output connected to every neuron in the succeeding layer. x v w y vm v2 w2 x2 xl 2 l v2m v3 v3m m w2 w22 w2n wn 2 n y2 yn Figure 2: Multi-Layer Feedforward Network The output of each neuron is a function of its inputs. In particular, the output of the jth neuron in any layer is described by two sets of equations: U j = (X i *w ij )[Eqn ] and Y j =F th (U j +t j )[Eqn 2] For every neuron, j, in a layer, each of the i inputs, X i, to that layer is multiplied by a previously established weight, w ij. These are all summed together, resulting in the internal value of this operation, U j. This value is then biased by a previously established threshold value, t j, and sent through an activation function, F th. This activation function is usually the sigmoid function, which has an input to output mapping as shown in Figure 4. The resulting output, Y j, is an input to the next layer or it is a response of the neural network if it is the last layer. Figure 3:Sigmod Function In essence, Equation implements the combination operation of the neuron and Equation 2 implements the firing of the neuron [3]. AIJRSTEM 4-70; 204, AIJRSTEM All Rights Reserved Page 45

From these equations, a predetermined set of weights, a predetermined set of threshold values and a description of the neural network it is possible to compute the response of the neural network to any set of inputs. The weights in artificial neurons are adjusted during training process.[4] Neural network acquires knowledge through learning. In general, the learning steps of a neural network are as follows. First, a network structure is defined with a fixed number of inputs, hidden nodes and outputs. Second, an algorithm is chosen to realize the learning process. The recent vast research activities in neural classification have established that neural networks are a promising alternative to various conventional classification methods.[5] Although significant progress has been made in classification related areas of neural networks, a number of issues in applying neural networks still remain and have not been solved successfully or completely. Neural network trains the input data pattern through different layers with local convergences which does not provides optimal solution to the problem. The network architecture plays important role in training the input data. Thus proposed system in this paper focus on uses of genetic algorithm for finding the global optimal solution along with the proper selection of the neural network architecture. The rest of the paper is organized as follows. In Section II, neural network structures and learning processes for data mining. In Section III, the Genetic Neural Network with proper selection of neural network architecture for data mining is proposed. II. LITERATURE SURVEY Neural networks are trained by using various learning techniques. The learning process of neural network uses different approaches for knowledge discovery. Different researchers have studied different neural networks learning approaches using different optimization techniques. As the neural network learn about the input data by its own this leads to mine the data to discover pattern. Neural network process the data pattern between input and output layer along with the hidden layers. The synaptic weights initialization and hidden layer identification is done by various ways to have optimal convergence. Frank H. F. Leung, H. K. Lam, S. H. Ling, and Peter K. S. Tam[6] proposed a three-layer neural network with switches introduced in some links is proposed to facilitate the tuning of the network structure in a simple manner. The number of hidden nodes is chosen manually by increasing it from a small number until the learning performance in terms of fitness value is good enough. Teresa B. Ludermir, Akio Yamazaki, and Cleber Zanchettin[7] developed the approach that combines the advantages of simulated annealing, tabu search and the backpropagation training algorithm in order to generate an automatic process for producing networks with high classification performance and low complexity. Chi-Keong Goh, Eu-Jin Teoh, and Kay Chen Tan[8] described a hybrid multiobjective evolutionary approach to ANN design which stochastic or performance driven and estimate the necessary number of neurons to be used in training of only single-hidden-layer feedforward neural network. David Hunter, Hao Yu, Michael S. Pukish, Janusz Kolbusz, and Bogdan M. Wilamowski. [9] performed comparison between different learning algorithms the Error Back Propagation (EBP) algorithm, the Levenberg Marquardt (LM) algorithm, and the recently developed Neuron-by-Neuron (NBN) algorithm, and different network topologies, including traditional Multilayer Perceptron (MLP) networks, Bridged Multilayer Perceptron (BMLP) networks, and Fully Connected Cascade (FCC) networks. They used the trial and error approach to find number of layers and number of neuron in each hidden layer. Out of this FCC architectures are proved to be powerful with largest number of connections across layers per number of neurons. Since the power of the FCC network increases dramatically with the number of neurons, usually not many trials are required, but FCC is required to provide the architecture with acceptable training error and the smallest number of neurons. Syed Umar Amin, Kavita Agarwal, Dr Rizwan Beg[0] developed the hybrid system that uses global optimization advantage of genetic algorithm for initialization of neural network weights. This technique uses the backpropagation algorithm to train the networks using the weights optimized by Genetic Algorithm. To determine the structure of network is challenging. A small network may not provide good performance owing to its limited information processing power. A large network, on the other hand, may have some connections redundant. As the structure of the network plays important role. Hence, it is possible by the proposed approach to determine the number of inputs, layers and hidden neurons of the neural network and then to initialize the neural weights by genetic algorithm. III. PROPOSED SYSTEM Architecture design of the neural network can be formulated as optimal problem. Optimization of neural network architectures and weights is an interesting approach for the generation of efficient networks. As the neural network learn through different layers which maps the input between input and output layer with hidden layers and weights as internal part till the network converge locally so it is important to select proper size, topology of the network and proper weight initialization for the network. AIJRSTEM 4-70; 204, AIJRSTEM All Rights Reserved Page 46

Dataset Calculate number of input and output nodes and number of hidden layers and number of hidden layer nodes Use fitness function for initialization of synaptic weights For each pattern in the dataset Repeat until converge Train the network using backpropagation learning technique Figure 4: Proposed System The proposed system of neural network with genetic algorithm shown in Figure 4 uses the genetic algorithm fitness function to initializes the weights that would make possible to have global optimal convergence. Neural network architecture can be constructed by identifying the input and output layer neuron along with number of hidden layers and hidden nodes identification. So when this improved genetic neural network approach applied to the data set can construct the network depending on the input dataset and can initializes the weight by fitness function of genetic algorithm. Genetic Algorithm is adaptive heuristic search algorithm based on the evolutionary ideas of natural selection and genetics. Thus Genetic-Neural Network takes the advantage of global optimization of genetic algorithm for initialization of neural network. IV. CONCLUSION In this paper, a Genetic Neural approach which uses Neural Networks and Genetic Algorithm to optimize the connection weights of ANN so as to improve the performance of the Artificial Neural Network along with the determination of the structure of the neural network is proposed. This proposed method could be able to provide fast learning, more stable and select the neural network architecture with acceptable training error and smallest number of neurons. Considering appropriate dataset the proposed method can be implemented in future. REFERENCES [] Tipawan Silwattananusarn and KulthidaTuamsuk, Data Mining and Its Applications for Knowledge Management : A Literature Review from 2007 to 202, International Journal of Data Mining & Knowledge Management Process (IJDKP), vol.2, no.5, September 202. [2] Hongjun Lu, Rudy Setiono, and Huan Liu, Effective Data Mining Using Neural Networks,IEEE Trans on Knowledge and data Engineering,vol.8,no.6,Dec996. [3] S. Rajasekaran and G. A. Vijayalakshmi Pai, Neural Networks, Fuzzy logic and Genetic Algorithms:Synthesis and Application,Eight Economy Edition, PHI 2003. [4] B. M. Wilamowsik, Neural Network Architectures and Learning Algorithm, IEEE Trans. Ind. Electron Mag.,vol.3,no.4,55-63,994. [5] Guoqiang Peter Zhang, Neural Networks for Classification: A Survey, IEEE Trans. On Systems, Man and Cybernetics,vol.30,no.4,Nov 2000. AIJRSTEM 4-70; 204, AIJRSTEM All Rights Reserved Page 47

[6] Frank H. F. Leung, H. K. Lam, S. H. Ling, and Peter K. S. Tam, Tuning of the Structure and Parameters of a Neural Networks Using an Improved Genetic Algorithm, IEEE Trans. on Neural Networks,vol.4,no.,Jan.2003. [7] Teresa B. Ludermir, Akio Yamazaki, and Cleber Zanchettin, An Optimization Methodology for Neural Network Weights and Architectures, IEEE Trans. on Neural Networks, vol.7,no.6,nov.2006. [8] Chi-Keong Gob, Eu-Jin Teoh, and Kay Chen Tan, Hybrid Multiobjective Evolutionary Design for Artificial Neural Networks,IEEE Trans. on Neural Networks,vol.9,no.9,Sept. 2008. [9] David Hunter, Hao Yu, Michael S. Pukish, Janusz Kolbusz and Bogdan M Wilamowski, Selection of Proper Neural Network Sizes and Architectures-A Comparative Study, IEEE Trans. on Industrial Informatics,vol.8, no.2,may 202. [0] Syed Umar Amin, Kavita Agarwal, Dr. Rizwan Beg, Genetic Neural Network Based Data Mining in Prediction of Heart Disease Using Risk Factor, in Proc. IEEE Conference on Information and Communication Technologies(ICT), 203. ACKNOWLEDGMENTS The authors feel a deep sense of gratitude to Prof. Dr. G. K. Patnaik HOD of Computer Science and Engineering Department for his motivation and support during this work. The authors are also thankful to the SSBT s COET, Jalgaon for being a constant source of inspiration. AIJRSTEM 4-70; 204, AIJRSTEM All Rights Reserved Page 48