BREAST CANCER DIAGNOSIS USING STATISTICAL NEURAL NETWORKS

Similar documents
Data Mining Analysis (breast-cancer data)

Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS

Neural Network Add-in

Towards better accuracy for Spam predictions

ScienceDirect. Brain Image Classification using Learning Machine Approach and Brain Structure Analysis

Analecta Vol. 8, No. 2 ISSN

SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH

A Content based Spam Filtering Using Optical Back Propagation Technique

Using artificial intelligence for data reduction in mechanical engineering

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

A New Approach For Estimating Software Effort Using RBFN Network

Scalable Developments for Big Data Analytics in Remote Sensing

Supply Chain Forecasting Model Using Computational Intelligence Techniques

A SECURE DECISION SUPPORT ESTIMATION USING GAUSSIAN BAYES CLASSIFICATION IN HEALTH CARE SERVICES

A Simple Introduction to Support Vector Machines

Neural Networks and Support Vector Machines

SUCCESSFUL PREDICTION OF HORSE RACING RESULTS USING A NEURAL NETWORK

KNOWLEDGE BASED ANALYSIS OF VARIOUS STATISTICAL TOOLS IN DETECTING BREAST CANCER

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

Feature Selection for Stock Market Analysis

Visualization of Breast Cancer Data by SOM Component Planes

Novelty Detection in image recognition using IRF Neural Networks properties

Application of Data Mining Techniques to Model Breast Cancer Data

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

Predict Influencers in the Social Network

Data Mining Techniques for Prognosis in Pancreatic Cancer

Comparison of K-means and Backpropagation Data Mining Algorithms

A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE

An Introduction to Data Mining

degrees of freedom and are able to adapt to the task they are supposed to do [Gupta].

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network

Linear Classification. Volker Tresp Summer 2015

NEURAL NETWORKS A Comprehensive Foundation

1. Classification problems

Classification algorithm in Data mining: An Overview

The relation between news events and stock price jump: an analysis based on neural network

Electroencephalography Analysis Using Neural Network and Support Vector Machine during Sleep

Diagnosis of Breast Cancer Using Intelligent Techniques

Principles of Data Mining by Hand&Mannila&Smyth

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

Search Taxonomy. Web Search. Search Engine Optimization. Information Retrieval

An Evaluation of Neural Networks Approaches used for Software Effort Estimation

Role of Neural network in data mining

Component Ordering in Independent Component Analysis Based on Data Power

Feature Subset Selection in Spam Detection

Monotonicity Hints. Abstract

NEURAL networks [5] are universal approximators [6]. It

Chapter 2 The Research on Fault Diagnosis of Building Electrical System Based on RBF Neural Network

Chapter 6. The stacking ensemble approach

AUTOMATED CLASSIFICATION OF BLASTS IN ACUTE LEUKEMIA BLOOD SAMPLES USING HMLP NETWORK

Predictive Data modeling for health care: Comparative performance study of different prediction models

Optimum Design of Worm Gears with Multiple Computer Aided Techniques

Neural Network Design in Cloud Computing

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Evaluation of Feature Selection Methods for Predictive Modeling Using Neural Networks in Credits Scoring

Utilization of Neural Network for Disease Forecasting

STA 4273H: Statistical Machine Learning

AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.

Keywords data mining, prediction techniques, decision making.

Wireless Remote Monitoring System for ASTHMA Attack Detection and Classification

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR.

Artificial Neural Network Approach for Classification of Heart Disease Dataset

Evaluation of Machine Learning Techniques for Green Energy Prediction

Statistical Models in Data Mining

HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION

An Anomaly-Based Method for DDoS Attacks Detection using RBF Neural Networks

THREE DIMENSIONAL REPRESENTATION OF AMINO ACID CHARAC- TERISTICS

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

Novel Mining of Cancer via Mutation in Tumor Protein P53 using Quick Propagation Network

Recurrent Neural Networks

ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS

A Data Mining Study of Weld Quality Models Constructed with MLP Neural Networks from Stratified Sampled Data

Supervised Learning (Big Data Analytics)

Machine Learning in FX Carry Basket Prediction

American International Journal of Research in Science, Technology, Engineering & Mathematics

Data Mining. Nonlinear Classification

A Health Degree Evaluation Algorithm for Equipment Based on Fuzzy Sets and the Improved SVM

Stock Prediction using Artificial Neural Networks

Blood Vessel Classification into Arteries and Veins in Retinal Images

Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing Classifier

BREAST CANCER DIAGNOSIS VIA DATA MINING: PERFORMANCE ANALYSIS OF SEVEN DIFFERENT ALGORITHMS

KIDNEY DISEASE PREDICTION USING SVM AND ANN ALGORITHMS

Data Mining. Supervised Methods. Ciro Donalek Ay/Bi 199ab: Methods of Sciences hcp://esci101.blogspot.

Machine learning in financial forecasting. Haindrich Henrietta Vezér Evelin

Neural Networks for Sentiment Detection in Financial Text

IBM SPSS Neural Networks 22

2. IMPLEMENTATION. International Journal of Computer Applications ( ) Volume 70 No.18, May 2013

Transcription:

ISTANBUL UNIVERSITY JOURNAL OF ELECTRICAL & ELECTRONICS ENGINEERING YEAR VOLUME NUMBER : 004 : 4 : (1149-1153) BREAST CANCER DIAGNOSIS USING STATISTICAL NEURAL NETWORKS Tüba KIYAN 1 Tülay YILDIRIM 1, Electronics and Communication Eng. Department Yildiz Technical University Besiktas, Istanbul 34349 TURKEY 1 E-mail: tkiyan@yildiz.edu.tr E-mail: tulay@yildiz.edu.tr ABSTRACT Breast cancer is the second largest cause of cancer deaths among women. The performance of the statistical neural network structures, radial basis network (RBF), general regression neural network (GRNN) and probabilistic neural network (PNN) are examined on the Wisconsin breast cancer data (WBCD) in this paper. This is a well-used database in machine learning, neural network and signal processing. Statistical neural networks are used to increase the accuracy and objectivity of breast cancer diagnosis. Keywords: Radial basis function, general regression neural networks, probabilistic neural network, wisconsin breast cancer data. 1. INTRODUCTION The automatic diagnosis of breast cancer is an important, real-world medical problem. A major class of problems in medical science involves the diagnosis of disease, based upon various tests performed upon the patient. When several tests are involved, the ultimate diagnosis may be difficult to obtain, even for a medical expert. This has given rise, over the past few decades, to computerized diagnostic tools, intended to aid the physician in making sense out of the confusion of data [1]. There is much research on medical diagnosis of breast cancer with WBCD data in neural network literature. In [], a learning algorithm that combines logarithmic simulated annealing with the Perceptron algorithm is used and reported accuracy is 98.8%. In [3], the classification technique uses fuzzy modeling and cooperative coevolution reaching to a classification accuracy result of 98.98% over the entire WBCD database. In [4], the classification is based on a Feed forward Neural Network Rule Extraction Algorithm. The reported accuracy is 98.4%. The first 367 chronologically collected instances where used in [6], where the reported accuracy is 93.7%. This breast cancer database was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg. The database contains 699 samples with 683 complete data and 16 samples with missing attributes. There are Received Date : 05.03.003 Accepted Date: 15.06.004

1150 Breast Cancer Diagnosis Using Statistical Neural Networks 9 integer-valued attributes and each data values range from 1 to 10, as follows: (1) Lump Thickness; () Uniformity of Cell Size; (3) Uniformity of Cell Shape; (4) Marginal Adhesion fibrous bands tissue that form between two surfaces; (5) Single Epithelial Cell Size the size of a single cell that forms tissues that lines the outside of the body and the passageways that lead to or from the surface; (6) Bare Nuclei; (7) Bland Chromatin evaluates for the presence of Barr bodies; (8) Normal Nucleoli; (9) Mitoses cell growth. These attributes measure the external appearance and internal chromosome changes in nine different scales. There are two values in the class variable of breast cancer: benign (non-cancerous) and malignant (cancerous), which is represented numerically by and 4 respectively [6]. Table 1 shows the data distributions. Table 1. Classes And Their Data Distributions Class Number Total of Number of Data Training Test Data Number Data 444 4 39 10 119 This paper compares the statistical neural networks with multi layer perceptron on the WBCD database. Radial Basis Function (RBF), General Regression Neural Network (GRNN) and Probabilistic Neural Network (PNN) were used to classify WBCD data and these results were compared with Multilayer Perceptron (MLP).. APPLIED NEURAL NETWORK STRUCTURES.1. Radial Basis Functions (RBF) [7]: RBF is a different approach by viewing the design of a neural network as a curve-fitting problem in a high-dimensional space. According to this viewpoint, learning is equivalent to finding a surface in a multidimensional space that provides a best fit to the training data, with the criterion for best fit being measured in some statistical sense. The construction of a radial-basis function network in its most basic form involves three entirely different layers. The input layer is made up of source nodes. The second layer is a hidden layer of high enough dimension, which serves a different purpose from that in a multilayer perceptron. The output layer supplies the response of the network to the activation patterns applied to the input layer. The transformation from the input space to the hidden-unit space is nonlinear where as the transformation from the hidden-unit space to the output space is linear... Probabilistic Neural Networks (PNN) [8]: The PNN introduced by Specht is essentially based on the well-known Bayesian classifier technique commonly used in many classical pattern-recognition problems. Consider a pattern vector x with m dimensions that belongs to one of two categories K 1 and K. Let F 1 (x) and F (x) be the probability density functions (pdf) for the classification categories K 1 and K, respectively. From Bayes discriminant decision rule, x belongs to K 1 if F L P 1 > (1) L P1 Conversely, x belongs to K if L1 P < () F L P 1 where L 1 is the loss or cost function associated with misclassifying the vector as belonging to category K 1 while it belongs to category K, L is the loss function associated with misclassifying the vector as belonging to category K while it belongs to category K 1, P 1 is the prior probability of occurrence of category K 1, and P is the prior probability of occurrence of category K. In many situations, the loss functions and the prior probabilities can be considered equal. Hence the key to using the decision rules given by equations (1) and () is to estimate the probability density functions from the training patterns.

Breast Cancer Diagnosis Using Statistical Neural Networks 1151 In the PNN, a nonparametric estimation technique known as Parzen windows is used to construct the class-dependent probability density functions (pdf) for each classification category required by Bayes theory. This allows determination of the chance a given vector pattern lies within a given category. Combining this with the relative frequency of each category, the PNN selects the most likely category for the given pattern vector. Both Bayes theory and Parzen windows are theoretically well established, have been in use for decades in many engineering applications, and are treated at length in a variety of statistical textbooks. If the jth training pattern for category K 1 is x j, then the Parzen estimate of the pdf for category K 1 is T = 1 ( x xj) ( x xj) exp (3) m / m (π ) σ n σ where n is the number of training patterns, m is the input space dimension, j is the pattern number, and σ is an adjustable smoothing parameter. However, the choice of σ in general has been found to be not too sensitive to variations in its value..3. Generalized Regression Neural Networks (GRNN) [9]: The generalized regression neural networks (GRNNs) are the paradigms of radial basis function (RBF) networks, often used for function approximations. It s another term for Nadaraya- Watson kernel regression, and has the following form for the function mapping. k { x xk / h } { x xk / h } tk exp y = (4) exp k GRNNs share a special property, namely that they do not require iterative training; the hiddento-output weights are just the target values t k, so the output y(x), is simply a weighted average of the target values t k of training cases x k close to the given input case x. It can be viewed as a normalized RBF network in which there is a hidden unit centered at every training case. These RBF units are called "kernels" and are usually probability density functions such as the Gaussians considered in (4). The only weights that need to be learned are the widths of the RBF units h. These widths (often a single width is used) are called "smoothing parameters" or "bandwidths" and are usually chosen by cross validation. GRNN is a universal approximator for smooth functions, so it should be able to solve any smooth function approximation problem given enough data. The main drawback of GRNNs is that, like kernel methods in general, they suffer seriously from the curse of dimensionality. GRNNs cannot ignore irrelevant inputs without major modifications to the basic algorithm. 3. SIMULATION RESULTS The simulations were realized by using MATLAB 6.0 Neural Network Toolbox. Four different neural network structure, multi layer perceptron, radial basis function, probabilistic neural network and generalized regression neural network were applied to WBCD database to show the performance of statistical neural networks on breast cancer data. The spread value of RBF, PNN and GRNN was chosen 4.4, 1 and 3, respectively. In MLP, learning rate was 0.6. 3.1.Training Data Simulation: Half of the database was used for training. samples of the training data belong to benign class and 10 samples belong to malignant class. The classification results of the training set by RBF, PNN, GRNN and MLP were given in the Table, 3, 4, and 5. Table. Classification Of Training Data By RBF True 10 False 0 0 Table 3. Classification Of Training Data By PNN True 10 False 0 0

115 Breast Cancer Diagnosis Using Statistical Neural Networks Table 4. Classification Of Training Data By GRNN True 17 113 False 5 7 Table 5. Classification Of Training Data By MLP (Average) True 17 118 False 5 Table 6. Performance For Training Data Classification Type Performance RBF %100 PNN %100 GRNN %96.4 MLP %98.04 RBF and PNN gives the best classification accuracy with 34 correct classifications while GRNN has the lowest accuracy with 330 correct classifications for the training set. MLP has 335 correct classifications. 3.. Test Data Simulation : Table 7. Classification Of Test Data By RBF True 15 113 False 7 6 Table 8. Classification Of Test Data By PNN True 19 11 False 3 7 Table 9. Classification Of Test Data By GRNN True 1 116 False 1 3 Table 10. Classification Of Test Data By MLP (Average) True 1 114 False 10 5 Table 11. Performance For Test Data Classification Type PERFORMANCE RBF %96.18 PNN %97.0 GRNN %98.8 MLP %95.74 A total of 341 samples were applied to the networks as test data; that is, 50% percent of the database was used for testing. samples, which belong to benign class data, and 119 samples, which belong to malignant class, were chosen for the test. The results for RBF, PNN, GRNN and MLP are shown in the Table 7, 8, 9, and 10. For the test set GRNN gives the best classification accuracy with 337 correct classifications while MLP has the lowest accuracy with 36 correct classifications. RBF classified 38 samples correctly and PNN was the second best network with 331 correct classifications. Overall classification performances were 96.18% for RBF, 97.0% for PNN, 98.8% for GRNN and 95.74% for MLP. 4. CONCLUSION How statistical neural networks are used in actual clinical diagnosis of breast cancer is shown in this paper. By applying statistical neural networks, a diagnostic system that performs at an accuracy level is constructed here. In this work, the performance of statistical neural network structures was investigated for breast cancer diagnosis problem. RBF and PNN are the best classifiers in training set, however the most important result must be considered with test data since it shows the future performance of the network. GRNN gives the best classification

Breast Cancer Diagnosis Using Statistical Neural Networks 1153 accuracy when the test set is considered. According to overall results, it is seen that the most suitable neural network model for classifying WBCD data is GRNN. This work also indicates that statistical neural networks can be effectively used for breast cancer diagnosis to help oncologists. REFERENCES [1] Carlos Andres Pena-Reyes, Moshe Sipper, A fuzzy-genetic approach to breast cancer diagnosis, Artificial Intelligence in Medicine, 131 155, 1999 [] of the LSA machine, ICONIP A. A. Albrecht, G. Lappas, S. A. Vinterbo, C.K. Wong, L. Ohno-Machado, Two applications 00. [3] C. A. Pena-Reyes, M. Sipper, Fuzzy CoCo: A cooperative coevolutionary approach to fuzzy modeling, IEEE Transactions on Fuzzy Systems, 9(5):77-737, 001. [5] J. Zhang, Selecting typical instances in instance-based learning, Proc. Ninth International Machine Learning Workshop, pp. 470-479, Morgan-Kaufmann, 199. [6] Newton Cheung, Machine Learning Techniques for Medical Analysis, Thesis for the degree of Bachelor of Engineering, 001. [7] S. Haykin, Neural Networks: A Comprehensive Foundation, Mac Millan College Publishing Company, 1994 [8] Anthony T. C. Goh, Probabilistic neural network for seismic liquefaction potential, NRC Research Press Web site, 00 [9] C. Lu, J. De Brabanter, S. Van Huffel, I. Vergote, D. Timmerman, Using artificial neural networks to predict malignancy of ovarian tumors, 3rd Annual Int. Conf. Of the IEEE Engineering in Medicine and Biology Society. [4] R. Setiono, Generating concise and accurate classification rules for breast cancer diagnosis, Artificial Intelligence in Medicine, 18(3): 05-17, 000.