DETECTION OF HEALTH CARE USING DATAMINING CONCEPTS THROUGH WEB
|
|
|
- Antonia Bates
- 10 years ago
- Views:
Transcription
1 DETECTION OF HEALTH CARE USING DATAMINING CONCEPTS THROUGH WEB Mounika NaiduP, Mtech(CS), ASCET, Gudur, C Rajendra, Prof, HOD, CSE Dept, ASCET, Gudur, Abstract: A major challenge facing healthcare organizations (hospitals, medical centers) is the provision of quality services at affordable costs Quality service implies diagnosing patients correctly and administering treatments that are effective Most hospitals today employ some sort of hospital information systems to manage their healthcare or patient data These systems are designed to support patient billing, inventory management and generation of simple statistics Some hospitals use decision support systems, but they are largely limited Clinical decisions are often made based on doctors intuition and experience rather than on the knowledge rich data hidden in the database This practice leads to unwanted biases, errors and excessive medical costs which affects the quality of service provided to patients The main objective of this research is to develop a Intelligent Heart Disease Prediction System using data mining modeling technique, namely, Naïve Bayes It can answer complex queries for diagnosing heart disease and thus assist healthcare practitioners to make intelligent clinical decisions which traditional decision support systems cannot By providing effective treatments, it also helps to reduce treatment costs Keywords-Data mining; Heart Disease; Frequent Patterns; ID3 Algorithm I INTRODUCTION Hospitals and clinics accumulate a huge amount of patient data over the years These data provide a basis for the analysis of risk factors for many diseases For example, we can predict the level of heart attack to find patterns associated with heart disease One of the major topics in data mining research is the discovery of interesting patterns in data From the introduction of frequent itemset mining and association rules, the pattern explosion was acknowledged: at high frequency thresholds only common knowledge is revealed, while at low thresholds prohibitively many patterns are returned A majority of areas related to medical services such as prediction of effectiveness of surgical procedures, medical tests, medication, and the discovery of relationships among clinical and diagnosis data also make use of Data Mining methodologies [1] The effectiveness of medical treatments can be estimated by developing the data mining applications Data mining is capable of delivering an analysis of which courses of action prove effective [2], achieved by comparing and contrasting causes, symptoms, and courses of treatments In this paper, we presented prediction of the risk levels of heart attack from the heart disease database The heart disease database consists of mixed attributes containing both the numerical and categorical data These records are cleaned and filtered with the intention that the irrelevant data from the database would be removed before mining process occurs Then clustering is performed on the preprocessed data warehouse using K- means clustering algorithm with K value so as to extract data relevant to heart attack Subsequently the frequent patterns significant to heart disease diagnosis are mined from the extracted data using the MAFIA algorithm Finally, we used the ID3 as training algorithm to show the effective risk level with decision tree The remaining sections of the paper are organized as follows: In Section 2, a brief review of some of the works on heart disease diagnosis is presented The fragmentation of risk level from heart disease database and extraction of significant patterns from heart disease database is presented detailed in Section 3 Architecture of the system is depicted in section 4 Experimental results are specified in section 5The conclusion and future works are described in Section 6 II RELATED WORK Tremendous works in literature related with heart disease diagnosis using data mining techniques have motivated our work The researchers in the medical field diagnose and predict the diseases in addition to providing effective care for patients [3] by employing the data mining techniques The data mining techniques have been employed by numerous works in the literature to diagnose diverse diseases, for instance: Diabetes, Hepatitis, Cancer, Heart diseases and more [4] A model Intelligent Heart Disease Prediction System (IHDPS) built with the aid of data mining techniques like Decision Trees, Naïve Bayes and Neural Network was proposed by Sellappan Palaniappan et al [5]The problem of identifying constrained association rules for heart disease prediction was studied by Carlos Ordonez [6] The assessed data set encompassed medical records of people having heart disease with attributes for risk factors, heart perfusion measurements and artery narrowing 45
2 Association rule mining is a major data mining technique, and is a most commonly used pattern discovery method It retrieves all frequent patterns in a data set and forms interesting rules among frequent patterns Most frequently used association rule mining methods are Apriority and FPgrowth [7] Frequent Itemset Mining (FIM) is considered to be one of the elemental data mining problems that intends to discover groups of items or values or patterns that co-occur frequently in a dataset [8]The term Heart disease encompasses the diverse diseases that affect the heart Heart disease was then major cause of casualties in the United States, England, Canada and Wales as in 2007 Heart disease kills one person every 34 seconds in the United States [9] Coronary heart disease, Cardiomyopathy and Cardiovascular disease are some categories of heart diseases The term cardiovascular disease includes a wide range of conditions that affect the heart and the blood vessels and the manner in which blood is pumped and circulated through the body Cardiovascular disease (CVD) results in severe illness, disability, and death [10] III THEORITICAL BACKGROND A Fragmentation of heart attack data using K_mean clustering The wide acceptance of the relational approach in dataprocessing applications and the continuous improvement of commercially existing relational databases has increased the interest for using database in non-conventional applications, such as computer-aided design (CAD), geographic information systems, image, and graphic database systems [10] The relational approach distinguishes two kinds of fragmentation: horizontal and vertical There are many algorithms developed for horizontal and vertical fragmentation [11] In this paper, we proposed k-mean clustering based fragmentation Clustering is the process to divide a data set into several classes or clusters, and the same data objects within a group are of higher similarity while data objects in different groups are of lower similarity Clustering medical data into small yet meaningful clusters can aid in the discovery of patterns by supporting the extraction of numerous appropriate features from each of the clusters thereby introducing structure into the data and aiding the application of conventional data mining techniques In this paper, we proposed the most popular distance measure, Euclidean distance, which is defined as dist (i, j) = x i x j (x i1 x j1 ) 2 (x i2 x j2 ) 2 (x ip x jp ) 2 where i x i 1, x i 2,, x ip and j x j 1, x j 2,, x p_dimensional data objects The steps involved in a K-means algorithm: jp are two 1 x i points denoting the data to be clustered are placed into the space These points denote the primary group centroids 2 The data are assigned to the group that is adjacent to the centroid 3 The positions of all the x j centroids are recalculated as soon as all the data are assigned Steps 2 and 3 are reiterated until the centroids stop moving any further This results in the separation of data into groups from which the metric to be minimized can be deliberated The preprocessed heart disease database is clustered using the K-means algorithm with K value as 2 One cluster consists of the data relevant to the heart disease and the other contains the remaining data Later on, the frequent patterns are mined from the cluster relevant to heart disease, using the MAFIA algorithm B Frequent pattern mining using MAFIA Frequent Itemset Mining (FIM) is considered to be one of the elemental data mining problems that intends to discover groups of items or values or patterns that co-occur frequently in a dataset The extraction of significant patterns from the heart disease data warehouse is presented in this section The heart disease database contains the screening clinical data of heart patients Frequent Itemset Mining (FIM) is considered to be one of the elemental data mining problems that intends to discover groups of items or values or patterns that co-occur frequently in a dataset It is of vital significance in a variety of Data Mining tasks that aim to mine interesting patterns from databases, like association rules, correlations, sequences, episodes, classifiers, clusters and the like The proposed approach utilizes an efficient algorithm called MAFIA (Maximal Frequent Itemset Algorithm) which combines diverse old and new algorithmic ideas to form a practical algorithm [8] In this paper, we presented the MFPA (Maximal Frequent Pattern Algorithm) algorithm for the frequent patterns applicable to heart disease are mined from the data extracted MFPA algorithm: Pseudo code for MAFIA : MAFIA(C, MFI, Boolean IsHUT) { name HUT = Chead Ctail; if HUT is in MFI Stop generation of children and return Count all children; use PEP to trim the tail, and recorder by increasing support, For each item i in C, trimmed_tail { IsHUT = whether i is the first item in the tail newnode = C I MAFIA (newnode, MFI, IsHUT)} if (IsHUT and all extensions are frequent) Stop search and go back up subtree If (C is a leaf and Chead is not in MFI) 46
3 Add Chead to MFI } The cluster that contains data most relevant to heart attack is fed as input to MAFIA algorithm to mine the frequent patterns present in it C Decision Tree Representation Decision trees are a popular logical method for classification A decision tree is a hierarchical structure that partitions data into some disjoint groups based on their different attribute values Leafs of a decision tree contain records of one or nearly one class, and so it has been used for classification An advantage of decision tree methods is that decision trees can be converted into understandable rules A most widely used decision tree system is C45, its ancestor ID3, and a commercial version C50 Decision trees have been mainly used to build diagnosis models for medical data When it is used for exploring patterns in medical data, work in shows that it is inadequate for such exploration [1] In this paper, we used the ID3 algorithm, uses the information gain measure to select among the candidate attributes at each step while growing the tree Information gain, is simply the expected reduction in entropy caused by portioning the examples according to this attribute The information gain, Gain (S, A) of an attribute A, relative to a collection of example S, is defined as, S Gain(S, A) Entropy (S) ₃ v Entropy (Sv ) (1) v Values ( A) S K-Mean clustering Cluster relevant and irrelevant data MAFIA(Maximal Frequent Itemset Algorithm) Select Frequent Pattern Classify the pattern using ID3 algorithm Display effective heart attack level V EXPERIMENTAL RESULTS Heart Disease dataset c The results of our experimental analysis in finding Entropy(S) ₃ p i log 2 p i significant patterns for heart attack prediction are presented i 1 in this section With the help of the database, the patterns significant to the heart attack prediction are extracted using where p i is the proportion of S belonging to class i the approach discussed The heart disease database is preprocessed successfully by removing duplicate records IV SYSTEM ARCHITECTURE and supplying missing values as shown in table I The refined heart disease data set, resultant from preprocessing, is then clustered using K-means algorithm with K value as 2 One cluster consists of the data relevant to the heart disease as shown in table II and the other contains the remaining data Then the frequent patterns are mined efficiently from the cluster relevant to heart disease, using the MAFIA algorithm The sample combinations of heart attack parameters for normal and risk level along with their values and levels are detailed below In that, lesser value (01) of weight comprises the normal level of prediction and higher values other than 01 comprise the higher risk levels Table III show the parameters of heart attack prediction with corresponding values and their levels Table IV show the example of training data to predict the heart attack level and then figure 2 shows the efficient heart attack level with tree using the ID3 by information gain 47
4 TABLE I HEART DISEASE DATA SET ID Attribute 1 Age 2 Sex 3 painloc: chest pain location 4 relrest 5 cp: chest pain type 6 trestbps: resting blood pressure 7 chol: serum cholesterol in mg/dl 8 smoke 9 cigs (cigarettes per day) 10 years (number of years as a smoker) 11 fbs: (fasting blood sugar > 120 mg/dl) 12 dm (1 = history of diabetes; 0 = no such history) 13 famhist: family history of coronary artery disease TABLE II CLUSTERED RELEVANT DATA BASED ON HEART DISEASE DATASET ID Reference ID Attribute 1 #1 Age 2 #2 Sex 3 #5 overweight 4 #6 trestbps: resting blood pressure 5 #7 chol: serum cholestoral in mg/dl 6 #11 fbs: (fasting blood sugar > 120 mg/dl) 7 #14 Alchol intake 8 #27 thalach: maximum heart rate achieved 9 #32 exang: exercise induced angina 10 #34 Sedentary Lifestyle/inactivity 11 #35 slope: the slope of the peak exercise ST segment 12 #37 ca: number of major vessels (0-3) colored by flourosopy 13 #40 Hereditary 14 #44 num: diagnosis of heart disease 38 exeref: exercise radinalid (sp?) ejection 39 exerwm: exercise wall (sp?) motion 40 thal: 3 = normal; 6 = fixed defect; 7 = reversable defect 41 Cmo: month of cardiac cath (sp?) (perhaps "call") 42 Cday: day of cardiac cath (sp?) 43 Cyr: year of cardiac cath (sp?) 44 num: diagnosis of heart disease (angiographic disease status) High Salt Diet Yes 09 No 01 High saturated diet Yes 09 No 01 Exercise Regular 01 Never 06 Sedentary Yes 07 Lifestyle/inactivity No 01 Hereditary Yes 07 No 01 Bad cholesterol High 08 Normal 01 Blood Pressure Normal (130/89) 01 Low (< 119/79) 08 High (>200/160) 09 Blood sugar High (>120&<400) 05 Normal (>90&<120) 01 Low ( <90) 04 Heart Rate Low (< 60bpm) 09 Normal (60 to 100) 01 High (>100bpm) 09 TABLE III HEART ATTACK PARAMETERS WITH CORRESPONDING VALUES AND THEIR WEIGHT Parameter Weight Risk level Male and Female Age<30 01 Age>30 08 Smoking Never 01 Past 03 Current 06 Overweight Yes 08 No 01 Alcohol Intake Never 01 Past 03 Current 06 48
5 If (Or) Age=>70 and Blood pressure=high and Smoking=current Then Heart attack level is High Figure 3: An example pattern of proposed system Figure 2: A decision tree for the concept heart attack level by information gain (ID3) If Age=<30 and Overweight=no and Alcohol Intake=never Then Heart attack level is Low The experimental results of our approach as presented in Table V The goal is to have high accuracy, besides high precision and recall metrics These metrics can be derived from the confusion matrix and can be easily converted to true-positive (TP) and false-positive (FP) metrics TP precision TP FP,recall TP TP FN 1 True Positive (TP): Total percentage of members classified as Class A belongs to Class A 2 False Positive (FP): Total percentage of members of Class A but does not belong to Class A 3 False Negative (FN): Total percentage of members of Class A incorrectly classified as not belonging to Class A 4 True Negative (TN): Total percentage of members which do not belong to Class A are classified not a part of Class A It can also be given as (100% - FP) TABLE IV CLASSIFY THE LEVELBASED ON TABLE ID Age Smok-ing Over Alcohol Intake Heart rate Blood Pressure Class weight 1 <30 Never No Never Normal Normal Low 2 >50 & <70 Current No Past Normal Normal Low 3 >30 & <50 Current Yes Current Low Low High 4 >70 Never Yes Past High High High 5 >30 & <50 Past Yes Past Normal High Low TABLE V COMPARISON BETWEEN SIMPLE MAFIA AND PROPOSED K-MEAN BASED MAFIA Technique Precision Recall Accuracy (%) K-mean based % MAFIA K-mean based % MAFIA with ID3 VI CONCLUSION AND FUTURE WORK Health care related data are voluminous in nature and they arrive from diverse sources all of them not entirely appropriate in structure or quality These days, the exploitation of knowledge and experience of numerous specialists and clinical screening data of patients gathered in a database during the diagnosis procedure, has been widely recognized In this paper we have presented an efficient approach for fragmenting and extracting significant patterns from the heart disease data warehouses for the efficient prediction of heart attack In our future work, we have 49
6 International Journal of Advanced Research in Computer Engineering & Technology planned to design and develop an efficient heart attack prediction system with the aid of xml data using XParser and XQuery language REFERENCES [1] Tzung-I Tang, Gang Zheng, Yalou Huang, Guangfu Shu, Pengtao Wang, "A Comparative Study of Medical Data Classification Methods Based on Decision Tree and System Reconstruction Analysis", IEMS Vol 4, No 1, pp ,June 2005 [2] Hian Chye Koh and Gerald Tan,"Data Mining Applications in Healthcare", Journal of healthcare information management, Vol 19, Issue 2, Pages 64-72, 2005 [3] S Stilou, P D Bamidis, N Maglaveras, C Pappas, Mining association rules from clinical databases: an intelligent diagnostic process in healthcare [4] A Bellaachia and Erhan Guven, Predicting Breast Cancer Survivability using Data Mining Techniques, the Sixth SIAM International Conference on Data Mining (SDM 2006), Saturday, April 22, 2006 [5] Sellappan Palaniappan, Rafiah Awang, "Intelligent Heart Disease Prediction System Using Data Mining Techniques", IJCSNS International Journal of Computer Science and Network Security, Vol8 No8, August 2008[8] Carlos Ordonez, "Improving Heart Disease Prediction Using Constrained Association Rules," Seminar Presentation at University of Tokyo, 2004 [6] Carlos Ordonez, "Improving Heart Disease Prediction Using Constrained Association Rules,"Seminar Presentation at University of Tokyo, 2004 [7] Agrawal, R, Imielinski, T and Swami, A, Mining association rules between sets of items in large databases [8] Douglas Burdick, Manuel Calimlim, Johannes Gehrke, MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases, Proceedings of the 17th International Conference on Data Engineering [9] "Heart disease" from [10] "Hospitalization for Heart Attack, Stroke, or Congestive Heart Failure among Persons with Diabetes", Special report: , New Mexico [11] Cornell, D, A Vertical Partitioning Algorithm for Relational Databases, Proceedings of the Third International Conference on Data Engineering, Los Angeles, CA, February
Predicting the Analysis of Heart Disease Symptoms Using Medicinal Data Mining Methods
Predicting the Analysis of Heart Disease Symptoms Using Medicinal Data Mining Methods V. Manikantan & S. Latha Department of Computer Science and Engineering, Mahendra Institute of Technology Tiruchengode,
Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network
General Article International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347-5161 2014 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Impelling
Prediction of Heart Disease Using Naïve Bayes Algorithm
Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
Decision Support in Heart Disease Prediction System using Naive Bayes
Decision Support in Heart Disease Prediction System using Naive Bayes Mrs.G.Subbalakshmi (M.Tech), Kakinada Institute of Engineering & Technology (Affiliated to JNTU-Kakinada), Yanam Road, Korangi-533461,
Decision Support System on Prediction of Heart Disease Using Data Mining Techniques
International Journal of Engineering Research and General Science Volume 3, Issue, March-April, 015 ISSN 091-730 Decision Support System on Prediction of Heart Disease Using Data Mining Techniques Ms.
Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction
Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction Jyoti Soni Ujma Ansari Dipesh Sharma Student, M.Tech (CSE). Professor Reader Raipur Institute of Technology Raipur
Heart Disease Diagnosis Using Predictive Data mining
ISSN (Online) : 2319-8753 ISSN (Print) : 2347-6710 International Journal of Innovative Research in Science, Engineering and Technology Volume 3, Special Issue 3, March 2014 2014 International Conference
Web-Based Heart Disease Decision Support System using Data Mining Classification Modeling Techniques
Web-Based Heart Disease Decision Support System using Data Mining Classification Modeling Techniques Sellappan Palaniappan 1 ), Rafiah Awang 2 ) Abstract The healthcare industry collects huge amounts of
DATA MINING AND REPORTING IN HEALTHCARE
DATA MINING AND REPORTING IN HEALTHCARE Divya Gandhi 1, Pooja Asher 2, Harshada Chaudhari 3 1,2,3 Department of Information Technology, Sardar Patel Institute of Technology, Mumbai,(India) ABSTRACT The
Intelligent and Effective Heart Disease Prediction System using Weighted Associative Classifiers
Intelligent and Effective Heart Disease Prediction System using Weighted Associative Classifiers Jyoti Soni, Uzma Ansari, Dipesh Sharma Computer Science Raipur Institute of Technology, Raipur C.G., India
Artificial Neural Network Approach for Classification of Heart Disease Dataset
Artificial Neural Network Approach for Classification of Heart Disease Dataset Manjusha B. Wadhonkar 1, Prof. P.A. Tijare 2 and Prof. S.N.Sawalkar 3 1 M.E Computer Engineering (Second Year)., Computer
CLINICAL DECISION SUPPORT FOR HEART DISEASE USING PREDICTIVE MODELS
CLINICAL DECISION SUPPORT FOR HEART DISEASE USING PREDICTIVE MODELS Srpriva Sundaraman Northwestern University [email protected] Sunil Kakade Northwestern University [email protected]
Keywords data mining, prediction techniques, decision making.
Volume 5, Issue 4, April 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analysis of Datamining
STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table
Feature Selection for Classification in Medical Data Mining
Feature Selection for Classification in Medical Data Mining Prof.K.Rajeswari 1, Dr.V.Vaithiyanathan 2 and Shailaja V.Pede 3 1 Associate Professor, PCCOE, Pune University, & Ph.D Research Scholar, SASTRA
A Novel Approach for Heart Disease Diagnosis using Data Mining and Fuzzy Logic
A Novel Approach for Heart Disease Diagnosis using Data Mining and Fuzzy Logic Nidhi Bhatla GNDEC, Ludhiana, India Kiran Jyoti GNDEC, Ludhiana, India ABSTRACT Cardiovascular disease is a term used to describe
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
Intelligent Heart Disease Prediction System Using Data Mining Techniques *Ms. Ishtake S.H, ** Prof. Sanap S.A.
Intelligent Heart Disease Prediction System Using Data Mining Techniques *Ms. Ishtake S.H, ** Prof. Sanap S.A. *Department of Copmputer science, MIT, Aurangabad, Maharashtra, India. ** Department of Computer
Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1
Data Mining 1 Introduction 2 Data Mining methods Alfred Holl Data Mining 1 1 Introduction 1.1 Motivation 1.2 Goals and problems 1.3 Definitions 1.4 Roots 1.5 Data Mining process 1.6 Epistemological constraints
REVIEW ON PREDICTION SYSTEM FOR HEART DIAGNOSIS USING DATA MINING TECHNIQUES
International Journal of Latest Research in Engineering and Technology (IJLRET) ISSN: 2454-5031(Online) ǁ Volume 1 Issue 5ǁOctober 2015 ǁ PP 09-14 REVIEW ON PREDICTION SYSTEM FOR HEART DIAGNOSIS USING
Cardiac Rehabilitation
Cardiac Rehabilitation Introduction Experiencing heart disease should be the beginning of a new, healthier lifestyle. Cardiac rehabilitation helps you in two ways. First, it helps your heart recover through
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES
REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES R. Chitra 1 and V. Seenivasagam 2 1 Department of Computer Science and Engineering, Noorul Islam Centre for
Effective Analysis and Predictive Model of Stroke Disease using Classification Methods
Effective Analysis and Predictive Model of Stroke Disease using Classification Methods A.Sudha Student, M.Tech (CSE) VIT University Vellore, India P.Gayathri Assistant Professor VIT University Vellore,
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Heart Diseases and their Complications
Heart Diseases and their Complications Health Promotion and Education Program Rev. 2014 2014, MMM Healthcare, Inc. - PMC Medicare Choice, Inc. Reproduction of this material is prohibited. MP-HEP-PPT-252-01-021914-E
Data Mining On Diabetics
Data Mining On Diabetics Janani Sankari.M 1,Saravana priya.m 2 Assistant Professor 1,2 Department of Information Technology 1,Computer Engineering 2 Jeppiaar Engineering College,Chennai 1, D.Y.Patil College
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Using Data Mining for Mobile Communication Clustering and Characterization
Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer
How To Cluster
Data Clustering Dec 2nd, 2013 Kyrylo Bessonov Talk outline Introduction to clustering Types of clustering Supervised Unsupervised Similarity measures Main clustering algorithms k-means Hierarchical Main
USING DATA SCIENCE TO DISCOVE INSIGHT OF MEDICAL PROVIDERS CHARGE FOR COMMON SERVICES
USING DATA SCIENCE TO DISCOVE INSIGHT OF MEDICAL PROVIDERS CHARGE FOR COMMON SERVICES Irron Williams Northwestern University [email protected] Abstract--Data science is evolving. In
Machine Learning using MapReduce
Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Data Mining: A Closer Look
Chapter 2 Data Mining: A Closer Look Chapter Objectives Determine an appropriate data mining strategy for a specific problem. Know about several data mining techniques and how each technique builds a generalized
Association Technique on Prediction of Chronic Diseases Using Apriori Algorithm
Association Technique on Prediction of Chronic Diseases Using Apriori Algorithm R.Karthiyayini 1, J.Jayaprakash 2 Assistant Professor, Department of Computer Applications, Anna University (BIT Campus),
Chapter 20: Data Analysis
Chapter 20: Data Analysis Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 20: Data Analysis Decision Support Systems Data Warehousing Data Mining Classification
DataMining Clustering Techniques in the Prediction of Heart Disease using Attribute Selection Method
DataMining Clustering Techniques in the Prediction of Heart Disease using Attribute Selection Method Atul Kumar Pandey*, Prabhat Pandey**, K.L. Jaiswal***, Ashish Kumar Sen**** ABSTRACT Heart disease is
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE Kasra Madadipouya 1 1 Department of Computing and Science, Asia Pacific University of Technology & Innovation ABSTRACT Today, enormous amount of data
Association rules for improving website effectiveness: case analysis
Association rules for improving website effectiveness: case analysis Maja Dimitrijević, The Higher Technical School of Professional Studies, Novi Sad, Serbia, [email protected] Tanja Krunić, The
Genetic Neural Approach for Heart Disease Prediction
Genetic Neural Approach for Heart Disease Prediction Nilakshi P. Waghulde 1, Nilima P. Patil 2 Abstract Data mining techniques are used to explore, analyze and extract data using complex algorithms in
Investigating Clinical Care Pathways Correlated with Outcomes
Investigating Clinical Care Pathways Correlated with Outcomes Geetika T. Lakshmanan, Szabolcs Rozsnyai, Fei Wang IBM T. J. Watson Research Center, NY, USA August 2013 Outline Care Pathways Typical Challenges
INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR. [email protected]
IJFEAT INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR Bharti S. Takey 1, Ankita N. Nandurkar 2,Ashwini A. Khobragade 3,Pooja G. Jaiswal 4,Swapnil R.
International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
Building A Smart Academic Advising System Using Association Rule Mining
Building A Smart Academic Advising System Using Association Rule Mining Raed Shatnawi +962795285056 [email protected] Qutaibah Althebyan +962796536277 [email protected] Baraq Ghalib & Mohammed
Coronary Heart Disease (CHD) Brief
Coronary Heart Disease (CHD) Brief What is Coronary Heart Disease? Coronary Heart Disease (CHD), also called coronary artery disease 1, is the most common heart condition in the United States. It occurs
Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing E-mail Classifier
International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume-1, Issue-6, January 2013 Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
Final Project Report
CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes
Detection of Heart Diseases by Mathematical Artificial Intelligence Algorithm Using Phonocardiogram Signals
International Journal of Innovation and Applied Studies ISSN 2028-9324 Vol. 3 No. 1 May 2013, pp. 145-150 2013 Innovative Space of Scientific Research Journals http://www.issr-journals.org/ijias/ Detection
BIDM Project. Predicting the contract type for IT/ITES outsourcing contracts
BIDM Project Predicting the contract type for IT/ITES outsourcing contracts N a n d i n i G o v i n d a r a j a n ( 6 1 2 1 0 5 5 6 ) The authors believe that data modelling can be used to predict if an
ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA
ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA D.Lavanya 1 and Dr.K.Usha Rani 2 1 Research Scholar, Department of Computer Science, Sree Padmavathi Mahila Visvavidyalayam, Tirupati, Andhra Pradesh,
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Erkan Er Abstract In this paper, a model for predicting students performance levels is proposed which employs three
Categorical Data Visualization and Clustering Using Subjective Factors
Categorical Data Visualization and Clustering Using Subjective Factors Chia-Hui Chang and Zhi-Kai Ding Department of Computer Science and Information Engineering, National Central University, Chung-Li,
Web Forensic Evidence of SQL Injection Analysis
International Journal of Science and Engineering Vol.5 No.1(2015):157-162 157 Web Forensic Evidence of SQL Injection Analysis 針 對 SQL Injection 攻 擊 鑑 識 之 分 析 Chinyang Henry Tseng 1 National Taipei University
Experiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
Clustering Technique in Data Mining for Text Documents
Clustering Technique in Data Mining for Text Documents Ms.J.Sathya Priya Assistant Professor Dept Of Information Technology. Velammal Engineering College. Chennai. Ms.S.Priyadharshini Assistant Professor
How To Understand The Impact Of A Computer On Organization
International Journal of Research in Engineering & Technology (IJRET) Vol. 1, Issue 1, June 2013, 1-6 Impact Journals IMPACT OF COMPUTER ON ORGANIZATION A. D. BHOSALE 1 & MARATHE DAGADU MITHARAM 2 1 Department
The Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
Web Usage Mining: Identification of Trends Followed by the user through Neural Network
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 617-624 International Research Publications House http://www. irphouse.com /ijict.htm Web
Diabetes and Heart Disease
Diabetes and Heart Disease Diabetes and Heart Disease According to the American Heart Association, diabetes is one of the six major risk factors of cardiovascular disease. Affecting more than 7% of the
Use of Data Mining Techniques to Improve the Effectiveness of Sales and Marketing
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 4, April 2015,
IDENTIFIC ATION OF SOFTWARE EROSION USING LOGISTIC REGRESSION
http:// IDENTIFIC ATION OF SOFTWARE EROSION USING LOGISTIC REGRESSION Harinder Kaur 1, Raveen Bajwa 2 1 PG Student., CSE., Baba Banda Singh Bahadur Engg. College, Fatehgarh Sahib, (India) 2 Asstt. Prof.,
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS
DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS 1 AND ALGORITHMS Chiara Renso KDD-LAB ISTI- CNR, Pisa, Italy WHAT IS CLUSTER ANALYSIS? Finding groups of objects such that the objects in a group will be similar
A Study of Web Log Analysis Using Clustering Techniques
A Study of Web Log Analysis Using Clustering Techniques Hemanshu Rana 1, Mayank Patel 2 Assistant Professor, Dept of CSE, M.G Institute of Technical Education, Gujarat India 1 Assistant Professor, Dept
Introduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
Absolute cardiovascular disease risk assessment
Quick reference guide for health professionals Absolute cardiovascular disease risk assessment This quick reference guide is a summary of the key steps involved in assessing absolute cardiovascular risk
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
PERFORMANCE ANALYSIS OF CLASSIFICATION DATA MINING TECHNIQUES OVER HEART DISEASE DATA BASE
PERFORMANCE ANALYSIS OF CLASSIFICATION DATA MINING TECHNIQUES OVER HEART DISEASE DATA BASE N. Aditya Sundar 1, P. Pushpa Latha 2, M. Rama Chandra 3 1 Asst.professor, CSE Department, GMR Institute of Technology,
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Clinic + - A Clinical Decision Support System Using Association Rule Mining
Clinic + - A Clinical Decision Support System Using Association Rule Mining Sangeetha Santhosh, Mercelin Francis M.Tech Student, Dept. of CSE., Marian Engineering College, Kerala University, Trivandrum,
COURSE RECOMMENDER SYSTEM IN E-LEARNING
International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand
Clustering. Adrian Groza. Department of Computer Science Technical University of Cluj-Napoca
Clustering Adrian Groza Department of Computer Science Technical University of Cluj-Napoca Outline 1 Cluster Analysis What is Datamining? Cluster Analysis 2 K-means 3 Hierarchical Clustering What is Datamining?
Data Mining Project Report. Document Clustering. Meryem Uzun-Per
Data Mining Project Report Document Clustering Meryem Uzun-Per 504112506 Table of Content Table of Content... 2 1. Project Definition... 3 2. Literature Survey... 3 3. Methods... 4 3.1. K-means algorithm...
Université de Montpellier 2 Hugo Alatrista-Salas : [email protected]
Université de Montpellier 2 Hugo Alatrista-Salas : [email protected] WEKA Gallirallus Zeland) australis : Endemic bird (New Characteristics Waikato university Weka is a collection
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
Rule based Classification of BSE Stock Data with Data Mining
International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 4, Number 1 (2012), pp. 1-9 International Research Publication House http://www.irphouse.com Rule based Classification
Understanding Web personalization with Web Usage Mining and its Application: Recommender System
Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,
Application of Data Mining in Medical Decision Support System
Application of Data Mining in Medical Decision Support System Habib Shariff Mahmud School of Engineering & Computing Sciences University of East London - FTMS College Technology Park Malaysia Bukit Jalil,
3.5% 3.0% 3.0% 2.4% Prevalence 2.0% 1.5% 1.0% 0.5% 0.0%
S What is Heart Failure? 1,2,3 Heart failure, sometimes called congestive heart failure, develops over many years and results when the heart muscle struggles to supply the required oxygen-rich blood to
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS Breno C. Costa, Bruno. L. A. Alberto, André M. Portela, W. Maduro, Esdras O. Eler PDITec, Belo Horizonte,
International Journal of Computer Science and Communication Engineering Volume 2 issue 4(November 2013 issue)
Implementation of Apriori Algorithm in Health Care Sector: A Survey Divya Jain 1, Sumanlata Gautam 2 1, 2 Department of Computer Science, ITM University, Gurgaon, (Haryana), India 1 [email protected],
A Medical Decision Support System (DSS) for Ubiquitous Healthcare Diagnosis System
, pp. 237-244 http://dx.doi.org/10.14257/ijseia.2014.8.10.22 A Medical Decision Support System (DSS) for Ubiquitous Healthcare Diagnosis System Regin Joy Conejar 1 and Haeng-Kon Kim 1* 1 School of Information
An Overview of Data Mining Techniques Applied for Heart Disease Diagnosis and Prediction
Lecture Notes on Information Theory Vol. 2, No. 4, December 2014 An Overview of Data Mining Techniques Applied for Heart Disease Diagnosis and Prediction Salha M. Alzahani, Afnan Althopity, Ashwag Alghamdi,
A Review of Anomaly Detection Techniques in Network Intrusion Detection System
A Review of Anomaly Detection Techniques in Network Intrusion Detection System Dr.D.V.S.S.Subrahmanyam Professor, Dept. of CSE, Sreyas Institute of Engineering & Technology, Hyderabad, India ABSTRACT:In
Applications of Data Mining Techniques in Healthcare and Prediction of Heart Attacks
K.Srinivas et al. / (IJCSE) International Journal on Computer Science and Engineering Applications of Data Mining Techniques in Healthcare and Prediction of Heart Attacks K.Srinivas B.Kavihta Rani Dr.
Enhancing Quality of Data using Data Mining Method
JOURNAL OF COMPUTING, VOLUME 2, ISSUE 9, SEPTEMBER 2, ISSN 25-967 WWW.JOURNALOFCOMPUTING.ORG 9 Enhancing Quality of Data using Data Mining Method Fatemeh Ghorbanpour A., Mir M. Pedram, Kambiz Badie, Mohammad
Cardiovascular Disease Risk Factors
Cardiovascular Disease Risk Factors Risk factors are traits and life-style habits that increase a person's chances of having coronary artery and vascular disease. Some risk factors cannot be changed or
Overview. Evaluation Connectionist and Statistical Language Processing. Test and Validation Set. Training and Test Set
Overview Evaluation Connectionist and Statistical Language Processing Frank Keller [email protected] Computerlinguistik Universität des Saarlandes training set, validation set, test set holdout, stratification
Introduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
Classification of Heart Disease Dataset using Multilayer Feed forward backpropogation Algorithm
Classification of Heart Disease Dataset using Multilayer Feed forward backpropogation Algorithm Miss. Manjusha B. Wadhonkar 1, Prof. P. A. Tijare 2 and Prof. S. N. Sawalkar 3 1 M.E Computer Engineering,
Know your Numbers The D5 Goals for Diabetes Care. Shelly Hanson RN, CNS, CDE Cuyuna Regional Medical Center November 6, 2014
Know your Numbers The D5 Goals for Diabetes Care Shelly Hanson RN, CNS, CDE Cuyuna Regional Medical Center November 6, 2014 The D5 What is it 5 different treatment goals identified for optimal diabetes
An innovative application of a constrained-syntax genetic programming system to the problem of predicting survival of patients
An innovative application of a constrained-syntax genetic programming system to the problem of predicting survival of patients Celia C. Bojarczuk 1, Heitor S. Lopes 2 and Alex A. Freitas 3 1 Departamento
Manitoba EMR Data Extract Specifications
MANITOBA HEALTH Manitoba Data Specifications Version 1 Updated: August 14, 2013 1 Introduction The purpose of this document 1 is to describe the data to be included in the Manitoba Data, including the
1. Classification problems
Neural and Evolutionary Computing. Lab 1: Classification problems Machine Learning test data repository Weka data mining platform Introduction Scilab 1. Classification problems The main aim of a classification
