How To Use Neural Networks In Data Mining



Similar documents
NEURAL NETWORKS IN DATA MINING

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

Keywords: Data Mining, Neural Networks, Data Mining Process, Knowledge Discovery, Implementation. I. INTRODUCTION

Comparison of K-means and Backpropagation Data Mining Algorithms

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Neural Networks in Data Mining

An Overview of Knowledge Discovery Database and Data mining Techniques

The Research of Data Mining Based on Neural Networks

SPATIAL DATA CLASSIFICATION AND DATA MINING

Data Mining and Neural Networks in Stata

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH

Web Usage Mining: Identification of Trends Followed by the user through Neural Network

Analecta Vol. 8, No. 2 ISSN

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

Chapter 12 Discovering New Knowledge Data Mining

Neural Networks and Back Propagation Algorithm

Healthcare Measurement Analysis Using Data mining Techniques

Introduction to Machine Learning Using Python. Vikram Kamath

Data Warehousing and Data Mining in Business Applications

Ms. Aruna J. Chamatkar Assistant Professor in Kamla Nehru Mahavidyalaya, Sakkardara Square, Nagpur

Knowledge Based Descriptive Neural Networks

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

DATA MINING TECHNIQUES AND APPLICATIONS

Comparative Analysis of Classification Algorithms on Different Datasets using WEKA

Data Mining Applications in Fund Raising

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

CLASSIFICATION AND PREDICTION IN DATA MINING WITH NEURAL NETWORKS

NEW TECHNIQUE TO DEAL WITH DYNAMIC DATA MINING IN THE DATABASE

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Data quality in Accounting Information Systems

Database Marketing, Business Intelligence and Knowledge Discovery

SELECTING NEURAL NETWORK ARCHITECTURE FOR INVESTMENT PROFITABILITY PREDICTIONS

DATA MINING, DIRTY DATA, AND COSTS (Research-in-Progress)

Customer Classification And Prediction Based On Data Mining Technique

Computational Intelligence Introduction

A Knowledge Management Framework Using Business Intelligence Solutions

Data Mining Part 5. Prediction

Effective Data Mining Using Neural Networks

Customer Relationship Management using Adaptive Resonance Theory

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

Prediction of Cancer Count through Artificial Neural Networks Using Incidence and Mortality Cancer Statistics Dataset for Cancer Control Organizations

Data Mining Algorithms Part 1. Dejan Sarka

Dynamic Data in terms of Data Mining Streams

A Neural Network based Approach for Predicting Customer Churn in Cellular Network Services

LVQ Plug-In Algorithm for SQL Server

American International Journal of Research in Science, Technology, Engineering & Mathematics

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

A Review of Data Mining Techniques

Spatial Data Mining Methods and Problems

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL

6.2.8 Neural networks for data mining

Foundations of Business Intelligence: Databases and Information Management

Price Prediction of Share Market using Artificial Neural Network (ANN)

Cash Forecasting: An Application of Artificial Neural Networks in Finance

A Simple Feature Extraction Technique of a Pattern By Hopfield Network

Chapter ML:XI. XI. Cluster Analysis

Data Mining Solutions for the Business Environment

Power Prediction Analysis using Artificial Neural Network in MS Excel

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Neural Network Applications in Stock Market Predictions - A Methodology Analysis

Use of Artificial Neural Network in Data Mining For Weather Forecasting

Neural network software tool development: exploring programming language options

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

International Journal of Computer Trends and Technology (IJCTT) volume 4 Issue 8 August 2013

Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing

Keywords Data Mining, Knowledge Discovery, Direct Marketing, Classification Techniques, Customer Relationship Management

Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

ISSN: (Online) Volume 3, Issue 7, July 2015 International Journal of Advance Research in Computer Science and Management Studies

Big Data with Rough Set Using Map- Reduce

Stabilization by Conceptual Duplication in Adaptive Resonance Theory

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Improving Decision Making and Managing Knowledge

2. IMPLEMENTATION. International Journal of Computer Applications ( ) Volume 70 No.18, May 2013

Doctor of Philosophy in Computer Science

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

Open Access Research on Application of Neural Network in Computer Network Security Evaluation. Shujuan Jin *

D A T A M I N I N G C L A S S I F I C A T I O N

Meta-learning. Synonyms. Definition. Characteristics

Method of Combining the Degrees of Similarity in Handwritten Signature Authentication Using Neural Networks

Stock Data Analysis Based On Neural Network. 1Rajesh Musne, 2 Sachin Godse

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

THE APPLICATION OF DATA MINING TECHNOLOGY IN REAL ESTATE MARKET PREDICTION

MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL

Text Classification Using Symbolic Data Analysis

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY

Business Intelligence and Decision Support Systems

Prediction of Heart Disease Using Naïve Bayes Algorithm

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Social Media Mining. Data Mining Essentials

Data Mining: An Introduction

Information Visualization WS 2013/14 11 Visual Analytics

Transcription:

International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and Technology Marudhar Engineering College, Bikaner, Rajasthan Email- er.priyankagaur@gmail.com Abstract: The application of neural networks in the data mining is very wide. Although neural networks may have complex structure, long training time, and uneasily understandable representation of results, neural networks have high acceptance ability for noisy data and high accuracy and are preferable in data mining. In this paper the data mining based on neural networks is researched in detail, and the key technology and ways to achieve the data mining based on neural networks are also researched. Keywords Data mining, neural networks, artificial neural network (ANN), data mining process, implementation. I. INTRODUCTION Data mining is the term used to describe the process of extracting value from a database. A data-warehouse is a location where information is stored. The type of data stored depends largely on the type of industry and the company. Many companies store every piece of data they have collected, while others are more ruthless in what they deem to be important. Data mining involves the use of sophisticated data analysis tools to discover previously unknown, valid patterns and relationships in large data sets. These tools can include statistical models, mathematical algorithms, and machine learning methods (algorithms that improve their performance automatically through experience, such as neural networks or decision trees). Consequently, data mining consists of more than collecting and managing data, it also includes analysis and prediction. A number of advances in technology and business processes have contributed to a growing interest in data mining in both the public and private sectors. Some of these changes include the growth of computer networks, which can be used to connect databases; the development of enhanced search-related techniques such as neural networks and advanced algorithms; the spread of the client/server computing model, allowing users to access centralized data resources from the desktop; and an increased ability to combine data from disparate sources into a single search source. II. NEUTRAL NETWORK Neural networks represent a brain metaphor for information processing. These models are biologically inspired rather than an exact replica of how the brain actually functions. Neural networks have been shown to be very promising systems in many forecasting applications and business classification applications due to their ability to learn from the data, their nonparametric nature (i.e., no rigid assumptions), and their ability to generalize. Neural computing refers to a pattern recognition methodology for machine learning. The resulting model from neural computing is often called an artificial neural network (ANN) or a neural network. Neural networks have been used in many business applications for pattern recognition, forecasting, prediction, and classification. Neural network computing is a key component of any data mining tool kit. Figure 1. Two interconnected biological cells.

IJECSE,Volume1,Number 3 Priyanka Gaur et al. Figure 2. Processing in an Artificial Neuron. A. NEURAL NETWORK METHOD IN DATA MINING Neural network method is used for classification, clustering, feature mining, prediction and pattern recognition. It imitates the neurons structure of animals, bases on the M-P model and Hebb learning rule, so in essence it is a distributed matrix structure. Through training data mining, the neural network method gradually calculates (including repeated iteration or cumulative calculation) the weights the neural network connected. The neural network model can be broadly divided into the following three types: (a) Feed-forward networks: It regards the perception back-propagation model and the function network as representatives, and mainly used in the areas such as prediction and pattern recognition; (b) Feedback network: It regards Hopfield discrete model and continuous model as representatives, and mainly used for associative memory and optimization calculation; (c) Self-organization networks: it regards adaptive resonance theory (ART) model and Kohonen model as representatives, and mainly used for cluster analysis. NEURAL NETWORKS IN DATA MINING In more practical terms neural networks are non-linear statistical data modeling tools. They can be used to model complex relationships between inputs and outputs or to find patterns in data. Using neural networks as a tool, data warehousing firms are harvesting information from datasets in the process known as data mining. The difference between these data warehouses and ordinary databases is that there is actual anipulation and cross-fertilization of the data helping users makes more informed decisions. Neural networks essentially comprise three pieces: the architecture or model; the learning algorithm; and the activation functions. Neural networks are programmed or trained to... store, recognize, and associatively retrieve patterns or database entries; to solve combinatorial optimization problems; to filter noise from measurement data; to control ill-defined problems; in summary, to estimate sampled functions when we do not know the form of the functions. It is precisely these two abilities (pattern recognition and function estimation) which make artificial neural networks (ANN) so prevalent a utility in data mining. As data sets grow to massive sizes, the need for automated processing becomes clear. With their model-free estimators and their dual nature, neural networks serve data mining in a myriad of ways.

Neural Networks in Data Mining 1451 Figure 3. Image of data-mining process. Data mining is the business of answering questions that you ve not asked yet. Data mining reaches deep into databases. Data mining tasks can be classified into two categories: Descriptive and predictive data mining. Descriptive data mining provides information to understand what is happening inside the data without a predetermined idea. Predictive data mining allows the user to submit records with unknown field values, and the system will guess the unknown values based on previous patterns discovered form the database. Data mining models can be categorized according to the tasks they perform: Classification and Prediction, Clustering, Association Rules. Classification and prediction is a predictive model, but clustering and association rules are descriptive models. i. Classification: The most common action in data mining is classification. It recognizes patterns that describe the group to which an item belongs. It does this by examining existing items that already have been classified and inferring a set of rules. ii. Clustering: Similar to classification is clustering. The major difference being that no groups have been predefined. iii. Prediction: Prediction is the construction and use of a model to assess the class of an unlabeled object or to assess the value or value ranges of a given object is likely to have. iv. Forecasting: The next application is forecasting. This is different from predictions because it estimates the future value of continuous variables based on patterns within the data. Neural networks, depending on the architecture, provide associations, classifications, clusters, prediction and forecasting to the data mining industry. Financial forecasting is of considerable practical interest. Due to neural networks can mine valuable information from a mass of history information and be efficiently used in financial areas, so the applications of neural networks to financial forecasting have been very popular over the last few years. In data warehouses, neural networks are just one of the tools used in data mining. ANNs are used to find patterns in the data and to infer rules from them. Neural networks are useful in providing information on associations, classifications, clusters, and forecasting. The back propagation algorithm performs learning on a feed-forward neural network. III. DATA MINING PROCESS BASED ON NEURAL NETWORK Data mining process can be composed by three main phases: A. data preparation, B. data mining, C. expression and interpretation of the results, Data mining process is the reiteration of the three phases. The details are shown in Fig. 4.

IJECSE,Volume1,Number 3 Priyanka Gaur et al. Figure 4. General data mining process. The data mining based on neural network is composed by data preparation, rules extracting and rules assessment three phases, as shown in Fig. 5. A. Data Preparation Figure 5. Data mining process based on neural network. Data preparation is to define and process the mining data to make it fit specific data mining method. Data preparation is the first important step in the data mining and plays a decisive role in the entire data mining process. It mainly includes the following four processes: a) Data cleaning: Data cleansing is to fill the vacancy value of the data, eliminate the noise data and correct the inconsistencies data in the data. b) Data option: Data option is to select the data arrange and row used in this mining. c) Data preprocessing: Data preprocessing is to enhanced process the clean data which has been selected. d) Data expression Data expression is to transform the data after preprocessing into the form which can be accepted by the data mining algorithm based on neural network. The data mining based on neural network can only handle numerical data, so it is need to transform the sign data into numerical data. The simplest method is to establish a table with one-to-one correspondence between the sign data and the numerical data. The other more complex approach is to adopt appropriate Hash function to generate a unique numerical data according to given string. Although there are many data types in relational database, but they all basically can be simply come down to sign data, discrete numerical data and serial numerical data three logical data types. Fig. 6 gives the conversion of the three data types. The symbol Apple in the figure can be transformed into the corresponding discrete numerical data by using symbol table or Hash function. Then, the discrete numerical data can be quantified into continuous numerical data and can also be encoded into coding data.

Neural Networks in Data Mining 1453 Fig. 6 Data expression and conversion in data mining based on neural network B. Rules Extracting There are many methods to extract rules, in which the most commonly used methods are LRE method, black-box method, the method of extracting fuzzy rules, the method of extracting rules from recursive network, the algorithm of binary input and output rules extracting (BIO-RE), partial rules extracting algorithm (Partial-RE) and full rules extracting algorithm (Full-RE). C. Rules Assessment Although the objective of rules assessment depends on each specific application, but, in general terms, the rules can be assessed in accordance with the following objectives. 1) Find the optimal sequence of extracting rules, making it obtains the best results in the given data set; 2) Test the accuracy of the rules extracted; 3) Detect how much knowledge in the neural network has not been extracted; 4) Detect the inconsistency between the extracted rules and the trained neural network. IV. CONCLUSION At present, neural network is very suitable for solving the problems of data mining because its characteristics of good robustness, self-organizing adaptive, parallel processing, distributed storage and high degree of fault tolerance. Compared to statistical methods, NN are useful especially when there is no a priori knowledge about the analyzed data. They offer a powerful and distributed computing architecture, with significant learning abilities and they are able to represent highly nonlinear and multivariable relationships. Artificial Neural Networks offer qualitative methods for business and economic systems that traditional quantitative tools in statistics and econometrics cannot quantify due to the complexity in translating the systems into precise mathematical functions. Hence, the use of neural networks in data mining is a promising field of research especially given the ready availability of large mass of data sets and the reported ability of neural networks to detect and assimilate relationships between a large numbers of variables. V. REFERENCES [1] Agrawal, R., Imielinski, T., Swami, A., Database Mining: A Performance Perspective, IEEE Transactions on Knowledge and Data Engineering, pp. 914-925, December 1993. [2] Berry, J. A., Lindoff, G., Data Mining Techniques, Wiley Computer Publishing, 1997 (ISBN 0-471-17980-9). [3] Berson, Data Warehousing, Data-Mining & OLAP, TMH. [4] Haykin, S., Neural Networks, Prentice Hall International Inc., 1999. [5] Khajanchi, Amit, Artificial Neural Networks: The next intelligence. [6] Zurada J.M., An introduction to artificial neural networks systems, St. Paul: West Publishing (1992). [7] Y. Bengio, J. M. Buhmann, M. Embrechts, and J.M. Zurada. Introduction to the special issue on neural networks for data mining and knowledge discovery. IEEE Trans. Neural Networks. [8] M. W. Craven and J. W. Shavlik. Using neural networks for data mining. Future Generation Computer Systems, 13:211 229, 1997.