Web-Based Heart Disease Decision Support System using Data Mining Classification Modeling Techniques
|
|
|
- Maryann Lucas
- 10 years ago
- Views:
Transcription
1 Web-Based Heart Disease Decision Support System using Data Mining Classification Modeling Techniques Sellappan Palaniappan 1 ), Rafiah Awang 2 ) Abstract The healthcare industry collects huge amounts of healthcare data which, unfortunately, are not mined to discover hidden information for effective decision making by healthcare practitioners. Discovery of hidden patterns and relationships often goes unexploited. Advanced data mining modeling techniques can help overcome this situation. This research has developed a Web-based prototype Heart Disease Decision Support System (HDDSS) using Data Mining Classification Modeling Techniques, namely, Decision Trees, Naïve Bayes and Neural Network. Results show that each technique has its unique strength in realizing the objectives of the defined mining goals. HDDSS can answer complex what if queries which traditional decision support systems cannot. Using medical profiles such as age, sex, blood pressure and blood sugar it can predict the likelihood of patients getting heart disease. It enables significant knowledge, e.g. patterns, relationships between medical factors related to heart disease, to be established. HDDSS is userfriendly, scalable, reliable and expandable. It is implemented on the.net platform. 1. Motivation A major challenge facing healthcare organizations (hospitals, medical centers) is the provision of quality services at affordable prices. Quality service implies diagnosing patients correctly and administering treatments that are effective. Making poor clinical decisions can be disastrous and therefore is not acceptable. Hospitals must also minimize the cost of clinical tests. To do these, they must employ appropriate computer-based information and/or decision support systems. Most hospitals today use some sort of hospital information systems to manage their healthcare or patient data [12]. These systems generate huge amounts of data which take the form of numbers, text, charts and images. Unfortunately, these data are rarely used to support clinical decision making. There is a wealth of hidden information in these data that is largely untapped. This raises an important question: How do we turn the data into useful information to support decision making by healthcare practitioners? This is the main motivation for this research. 1 Department of Information Technology, Malaysia University of Science and Technology, Kelana Square, Kelana Jaya, 17 Jalan SS7/26, Petaling Jaya, Selangor, Malaysia, [email protected] 2 Department of Information Technology, Malaysia University of Science and Technology, Kelana Square, Kelana Jaya, 17 Jalan SS7/26, Petaling Jaya, Selangor, Malaysia, [email protected] 157
2 2. Problem Statement Many hospital information systems are designed to support patient billing, inventory management and generation of simple statistics. Where there are decision support systems, they are usually very limited. They can answer simple queries like What is the average age of patients who have heart disease?, How many surgeries had resulted in hospital stays longer than 10 days?, Identify the female patients who are single, above 30 years and treated for cancer. However, they cannot answer complex queries like Identify the important preoperative predictors that increase the length of hospital stay, Given patient records on cancer, should the treatment include chemotherapy alone, radiation alone, or both chemotherapy and radiation?, Given patient records, predict the probability of patients who are likely to have heart disease. Often, clinical decisions are made based on doctors intuition rather than on the knowledge-rich data in the database. This practice leads to unwanted biases, errors and excessive medical costs which affects the quality of service provided to patients. Wu, et al proposed that integration of clinical decision support with computer-based patient records could reduce medical errors, enhance patient safety, decrease unwanted practice variation, and improve patient outcome [17]. This is promising as data modeling and analysis tools have the potential to generate knowledge-rich environment which can significantly improve the quality of clinical decisions. 3. Research Objective This research will develop a prototype Web-based Heart Disease Decision Support System (HDDSS) using three data mining classification modeling techniques, namely, Decision Trees, Naïve Bayes and Neural Network. It can extract hidden knowledge (patterns and relationships) associated with heart disease from a historical heart disease database. It can answer complex queries related to heart disease diagnosis and thus assist healthcare practitioners to make intelligent clinical decisions which traditional decision support systems cannot. It will assist them to provide effective treatments at lower costs. To enhance visualization and ease of interpretation, it will display the results both in tabular and graphical forms. 4. Data Mining Review Although data mining has been around for more than two decades, its potential is only being realized now. Data mining combines statistical analysis, machine learning and database technology to extract hidden patterns and relationships from large databases [15]. Fayyad defines data mining as a process of nontrivial extraction of implicit, previously unknown and potentially useful information from the data stored in a database [4]. Giudici defines it as a process of selection, exploration and modeling of large quantities of data to discover regularities or relations that are at first unknown with the aim of obtaining clear and useful results for the owner of database [5]. 158
3 Data mining uses two strategies: supervised and unsupervised learning. Supervised learning uses values of input variables to predict a target variable with a known value while unsupervised learning works in a similar way, but more frequently it predicts a target variable where a known value does not exist [12]. Each data mining technique serves a different purpose depending on the modeling objective. The two most common modeling objectives are classification and prediction. Classification models predict categorical labels (discrete, unordered) while prediction models predict continuous-valued functions [6]. Decision Trees and Neural Networks use classification algorithms while Regression, Association Rules and Clustering use prediction algorithms [3]. Decision Tree algorithms include CART (Classification and Regression Tree), ID3 (Iterative Dichotomized 3) and C4.5. These algorithms differ in selection of splits, when to stop a node from splitting, and assignment of class to a non-split node [7]. CART uses Gini index to measure the impurity of a partition or set of training tuples [6]. It handles high dimensional, categorical and continuous data. Naive Bayes or Bayes Rule is the basis for many machine-learning and data mining methods [14]. The rule (algorithm) is used to create models with predictive capabilities. It provides new ways of exploring and understanding data. It learns from the evidence by calculating the correlation between the target and other variables. Neural Networks consists of three layers: input, hidden and output units (variables). Connection between input units and hidden and output units are based on relevance of the assigned value (weight) of that particular input unit. The higher the weight the more important it is. Neural Network algorithms use Linear and Sigmoid transfer functions. Neural Networks are suitable for training large amounts of data with few inputs. It is used when other techniques are unsatisfactory. 5. Methodology HDDSS uses the six phases of CRISP-DM methodology to build the data mining models [3]. Data Mining Extension (DMX), a SQL-style query language for data mining, is used for building and accessing the models contents. Tabular and graphical visualizations are incorporated to enhance analysis and interpretation of results Data Source A total of 909 records with 15 medical attributes (factors) were obtained from the Cleveland Heart Disease database [1]. The records were split equally into two datasets: training dataset (455 records) and testing dataset (454 records). To avoid bias, the records for each set were selected randomly. The classification modeling objective was chosen as Naïve Bayes algorithm only supports categorical (discrete) attributes. Both Decision Trees and Neural Network algorithms support categorical and continuous attributes. For consistency, only categorical attributes are used for all 159
4 three models. All the medical attributes in Figure 1 were transformed from numerical to categorical data. The attribute Diagnosis was identified as the predictable attribute with value 1 for patients with heart disease and value 0 for patients with no heart disease. The attribute PatientID was used as the key; the rest are input attributes. It is assumed that missing, inconsistent and duplicate data have been resolved. Predictable Attribute 1. Diagnosis (value 0: < 50% diameter narrowing (no heart disease); value 1: > 50% diameter narrowing (has heart disease)) Key Attribute 1. PatientId Patient s identification number Input Attributes 1. Sex (value 1: Male; value 0 : Female) 2. Chest Pain Type (value 1: typical type 1 angina, value 2: typical type angina, value 3: non-angina pain; value 4: asymptomatic) 3. Fasting Blood Sugar (value 1: > 120 mg/dl; value 0: < 120 mg/dl) 4. Restecg resting electrographic results (value 0: normal; value 1: 1 having ST-T wave abnormality; value 2: showing probable or definite left ventricular hypertrophy) 5. Exang exercise induced angina (value 1: yes; value 0: no) 6. Slope the slope of the peak exercise ST segment (value 1: unsloping; value 2: flat; value 3: downsloping) 7. CA number of major vessels colored by floursopy (value 0 3) 8. Thal (value 3: normal; value 6: fixed defect; value 7: reversible defect) 9. Trest Blood Pressure (mm Hg on admission to the hospital) 10. Serum Chlolestoral (mg/dl) 11. Thalach maximum heart rate achieved 12. Oldpeak ST depression induced by exercise relative to rest 13. Age in Year Figure 1: Description of Attributes 5.2. Mining Models Data Mining Extension (DMX) query language was used for model creation, model training, model prediction and model content access. All parameters were set to the default setting except for parameters Minimum Support = 1 for Decision Tree and Minimum Dependency Probability = for Naïve Bayes [10]. The trained models were evaluated against the test datasets for accuracy and effectiveness before they were deployed in HDDSS. The models were validated using Lift Chart and Classification Matrix Validating Model Effectiveness The effectiveness of models was tested using two methods: Lift Chart and Classification Matrix. The purpose was to determine which model gave the highest percentage of correct predictions for diagnosing patients with heart disease. Lift Chart. To determine if there was sufficient information to learn patterns in response to the predictable attribute, columns in the trained model were mapped to columns in the test dataset. The 160
5 model, predictable column to chart against, and the state of the column to predict patients with heart disease (predict value = 1) were also selected. Figure 2 shows the Lift Chart output. The X-axis shows the percentage of the test dataset used to compare predictions while the Y-axis shows the percentage of values predicted to the specified state. The blue and green lines show the results for random-guess and ideal models respectively. The purple, yellow and red lines show the results of Neural Network, Naïve Bayes and Decision Tree models respectively. Figure 2: Result of Lift Chart with Predictable Value The top green line shows the ideal model; it captured 100% of the target population for patients with heart disease using 46% of the test dataset. The bottom blue line shows the random line which is always a 45-degree line across the chart. It shows that if we randomly guess the result for each case, 50% of the target population would be captured using 50% of the test dataset. All three model lines (purple, yellow and red) fall between the random-guess and ideal lines, showing that all three have sufficient information to learn patterns in response to the predictable state. Lift Chart with No Prediction Value. The steps for producing Lift Chart are similar to the above except that the state of the predictable column is left blank. It does not include a line for the random-guess model. It tells how well each model fared at predicting the correct number of the predictable attribute. Figure 3 shows the Lift Chart output. The X-axis shows the percentage of test dataset used to compare predictions while the Y-axis shows the percentage of predictions that are correct. The blue, purple, green and red lines show the ideal, Neural Network, Naïve Bayes and Decision Trees models respectively. The chart shows the performance of the models across all possible states. The ideal line (blue) is at 45-degree angle, showing that if 50% of the test dataset is processed, 50% of test dataset is predicted correctly. 161
6 Figure 3: Result of Lift Chart without Predictable Value The chart shows that if 50% of the population is processed, Neural Network has the highest percentage of correct predictions (49.34%) followed by Naïve Bayes (47.58%) and Decision Trees (41.85%). If the whole population is processed, Naïve Bayes model appears to be better than the other two as it has the highest number of correct predictions (86.12%) followed by Neural Network (85.68%) and Decision Trees (80.4%). If less than 50% of the population is processed the Lift lines for Neural Network and Naïve Bayes will always be higher than that for Decision Trees. This shows that Neural Network and Naïve Bayes are better at making high percentage of correct predictions than Decision Trees. Along the X- axis the Lift lines for Neural Network and Naïve Bayes overlap. This shows that both models are equally good for predicting correctly. If more than 50% of population is processed, Neural Network and Naïve Bayes appear to be better as they give high percentage of correct predictions than Decision Trees. This is so because the Lift line for Decision Trees will always be below that of Neural Network and Naïve Bayes. For certain population range, Neural Network appears to fare better than Naives Bayes and vice-versa. Classification Matrix. Classification Matrix displays the frequency of correct and incorrect predictions. It compares the actual values in the test dataset with the predicted values in the trained model. In this example, the test dataset contained 208 patients with heart disease and 246 patients without heart disease. Figure 4 shows the results of the classification matrix for all three models. The rows represent predicted values while the columns represent actual values (1 for patients with heart disease, 0 for patients with no heart disease). The left-most columns show the values predicted by the models. The diagonal values show correct predictions. Figure 4: Results of Classification Matrix for the Three Models 162
7 Figure 5 summarizes the results of all three models. Naïve Bayes appears to be most effective as it has the highest percentage of correct predictions (86.53%) for patients with heart disease, followed by Neural Network (with a difference of less than 1%) and Decision Trees. Decision Trees, however, appears to be most effective for predicting patients with no heart disease (89%) compared to the other two models. Model No. of cases Prediction Decision Patients with heart disease, predicted as having heart disease 146 Correct Tree Patients with no heart disease, predicted as having heart disease 27 Incorrect Patients with no heart disease, predicted as having no heart disease 219 Correct Patients with heart disease, predicted as having no heart disease 62 Incorrect Naïve Patients with heart disease, predicted as having heart disease 180 Correct Bayes Patients with no heart disease, predicted as having heart disease 35 Incorrect Patients with no heart disease, predicted as having no heart disease 211 Correct Patients with heart disease, predicted as having no heart disease 28 Incorrect Neural Patients with heart disease, predicted as having heart disease 178 Correct Network Patients with no heart disease, predicted as having heart disease 35 Incorrect Patients with no heart disease, predicted as having no heart disease 211 Correct Patients with heart disease, predicted as having no heart disease 30 Incorrect Figure 5: Model Results 5.4. Evaluation of Mining Goals Five mining goals were defined based on exploration of the heart disease dataset and objectives of this research. They were evaluated against the trained models. Results show that all three models had achieved the stated goals, suggesting that they can be used to provide decision support to doctors for diagnosing patients and discovering medical factors associated with heart disease. The five goals are listed and discussed below. Goal 1: Given patients medical profiles, predict those who are likely to be diagnosed with heart disease. All three models could answer this question using singleton query and batch or prediction join query. Both queries could predict on single input cases and multiple input cases respectively. HDDSS supports prediction using what if scenarios. Users enter the values of medical attributes for diagnosing patients with heart disease. For example, entering the values Age = 70, CA = 2, Chest Pain Type = 4, Sex = M, Slope = 2 and Thal = 3 into the models, would produce the output shown in Figure 6. All three models showed that the person has a heart disease. Naïve Bayes gives the highest probability (95%) with 432 supporting cases, followed closely by Decision Tree (94.93%) with 106 supporting cases and Neural Network (93.54%) with 298 supporting cases. As the values are high, doctors can recommend that the patient undergoes further heart examination. Performing what if scenarios can thus help prevent a potential heart attack Goal 2: Identify the significant influences and relationships in the medical inputs associated with the predictable state heart disease. The Dependency viewer in Decision Trees and Naïve Bayes models shows the results from the most significant to the least (weakest) medical predictors. The viewer is especially useful when there are many predictable attributes. Figures 7 and 8 show that in 163
8 both models, the most significant factor influencing heart disease is Chest Pain Type. Other significant factors include Thal, CA and Exang. Decision Trees model shows Trest Blood Pressure as the weakest factor while Naïve Bayes model shows Fasting Blood Sugar as the weakest factor. Naïve Bayes appears to be better than Decision Trees as it gives the significance of all input attributes. Doctors can use this information to further analyze the strengths and weaknesses of medical attributes associated with heart disease. Figure 6: Screen Shot for Singleton Query Module Figure 7: Decision Trees Dependency Network Figure 8: Dependency Network for Naïve Bayes Goal 3: Identify the impact and relationship between the medical attributes in relation to the predictable state heart disease. Identifying the impact and relationship between the medical attributes in relation to heart disease is only found in Decision Trees viewer (Figure 9). It gives the highest probability (99.61%) that patients with heart disease are found in the relationship between the attributes (nodes): Chest Pain Type = 4 and CA = 0 and Exang = 0 and Trest Blood Pressure >= and < By using this information, doctors can perform medical screening on these four attributes instead of on all attributes on prospective patients who are likely to be diagnosed with heart disease. This will reduce medical expenses, administrative costs and diagnosis time. Information on least impact (5.88%) is found in the relationship between the attributes: Chest 164
9 Pain Type not = 4 and Sex = F. Also given is the relationship between attributes for patients with no heart disease. Results show that the relationship between the attributes: Chest Pain Type not = 4 and Sex = F has the highest impact (92.58%). The least impact (0.2%) is found in the attributes: Chest Pain Type = 4 and CA = 0 and Exang = 0 and Trest Blood Pressure >= and < Additional information such as identifying patients and their medical profiles based selected nodes can also be accessed using the drill through function. Doctors can use the Decision Tree viewer to perform further analysis. Figure 9: Decision Tree Viewer Note: Red is for predictable state: patients with heart disease; blue is for predictable state: patients with no heart disease. Goal 4: Identify characteristics of patients with heart disease. Only Naïve Bayes model identifies the characteristics of patients with heart disease. It shows the probability of each input attribute for the predictable state. Figure 10 shows that 80% of the heart disease patients are males (Sex = 1) and 43% are between ages 56 and 63. Other significant characteristics are: high probability in fasting blood sugar with less than 120 mg/dl reading, chest pain type is asymptomatic, slope of peak exercise is flat, etc. Figure 11 shows the characteristics of patients with no heart disease with high probability in fasting blood sugar with less than 120 mg/dl reading, no exercise induced, number of major vessels is zero, etc. These results can be further analyzed. Figure 10: Naïve Bayes Attribute Characteristics Viewer in Descending Order for Patients with Heart Disease Figure 11: Naïve Bayes Attribute Characteristic Viewer in Descending Order for Patients with No Heart Disease Goal 5: Determine the attribute values that differentiate nodes favoring and disfavoring the predictable states: (1) patients with heart disease (2) patients with no heart disease. This query can be answered by analyzing the results of attribute discrimination viewer of Naïve Bayes and Neural Network models. The viewer provides information on the impact of all attribute values related to 165
10 the predictable state. Naïve Bayes model (Figure 12) shows the most important attribute favoring patients with heart disease: Chest Pain Type = 4 with 158 cases and 56 patients with no heart disease. The input attributes Thal = 7 with 123 (75.00%) patients, Exang = 1 with 112 (73.68%) patients, Slope =2 with 138 (66.34%) patients, etc. also favor predictable state. In contrast, the attributes Thal = 3 with 195 (73.86%) patients, CA = 0 with 198 (73.06%) patients, Exang = 0 with 206 (67.98%), etc. favor predictable state for patients with no heart disease. Neural Network model (Figure 13) shows that the most important attribute value that favors patients with heart disease is Old peak = (98%). Other attributes that favor heart disease include Old peak >= 3.81, CA=2, CA=3, etc. Attributes like Serum Cholesterol >= , Chest Pain Type = 2, CA =0, etc. also favor the predictable state for patients with no heart disease. Figure 12: A Tornado Chart for Attribute Discrimination Viewer in Descending Order for Naïve Bayes Figure 13: Attribute Discrimination Viewer in Descending Order for Neural Network 6. Conclusion A web-based prototype heart disease decision support system is developed using three data mining classification modeling techniques. It extracts hidden knowledge from a historical heart disease database. DMX query language and functions are used to build and access the models. The models are trained and validated against a test dataset. Lift Chart and Classification Matrix methods are used to evaluate the effectiveness of the models. All three models are able to extract patterns in response to the predictable state. The most effective model to predict patients who are likely to have a heart disease appears to be Naïve Bayes followed by Neural Network and Decision Trees. Five mining goals are defined based on business intelligence and data exploration. The goals are evaluated against the trained models. All three models could answer complex queries, each with its own strength with respect to ease of model interpretation, access to detailed information and accuracy. Naïve Bayes could answer four out of the five goals; Decision Trees, three; and Neural Network, two. Although not the most effective model, results from Decision Trees are easier to read and interpret. The drill through feature to access detailed patients profiles is only available in Decision Trees. Naïve Bayes fared better than Decision Trees as it could identify all significant medical predictors. The relationship between attributes provided by Neural Network is more difficult to understand. 166
11 HDDSS can be further enhanced. For example, it can incorporate other techniques such as Time Series, Clustering and Association Rules. It can also use Text Mining to mine the vast amount of unstructured data available in healthcare databases. Another challenge would be to integrate both data and text mining [16]. References [1] Blake, C. L., Mertz, C.J.: UCI Machine Learning Databases, [2] Chapman, P., Clinton, J., Kerber, R. Khabeza, T., Reinartz, T., Shearer, C., Wirth, R.: CRISP-DM 1.0: Step by step data mining guide, SPSS, 1-78, [3] Charly, K.: Data Mining for the Enterprise, 31st Annual Hawaii Int. Conf. on System Sciences, IEEE Computer, 7, , [4] Fayyad, U: Data Mining and Knowledge Discovery in Databases: Implications fro scientific databases, Proc. of the 9 th Int. Conf. on Scientific and Statistical Database Management, Olympia, Washington, USA, 2-11, [5] Giudici, P.: Applied Data Mining: Statistical Methods for Business and Industry, New York: John Wiley, [6] Han, J., Kamber, M.: Data Mining Concepts and Techniques, Morgan Kaufmann Publishers, [7] Ho, T. J.: Data Mining and Data Warehousing, Prentice Hall, [8] Kaur, H., Wasan, S. K.: Empirical Study on Applications of Data Mining Techniques in Healthcare, Journal of Computer Science 2(2), , [9] Mehmed, K.: Data mining: Concepts, Models, Methods and Algorithms, New Jersey: John Wiley, [10] Mohd, H., Mohamed, S. H. S.: Acceptance Model of Electronic Medical Record, J of Advancing Information and Management Studies. 2(1), 75-92, [11] Msdn2.microsoft.com. MSDN [12] Obenshain, M.K: Application of Data Mining Techniques to Healthcare Data, Infection Control and Hospital Epidemiology, 25(8), , [13] Sellapan, P., Chua, S.L.: Model-based Healthcare Decision Support System, Proc. Of Int. Conf. on Information Technology in Asia CITA 05, 45-50, Kuching, Sarawak, Malaysia, 2005 [14] Tang, Z. H., MacLennan, J.: Data Mining with SQL Server 2005, Indiapolis: Wiley, [15] Thuraisingham, B.: A Primer for Understanding and Applying Data Mining, IT Professional, 28-31, [16] Weiguo, F., Wallace, L., Rich, S., Zhongju, Z.: Tapping the Power of Text Mining, Communication of the ACM. 49(9), 77-82, [17] Wu, R., Peters, W., Morgan, M.W.: The Next Generation Clinical Decision Support: Linking Evidence to Best Practice, J Healthcare Information Managment. 16(4), 50-55,
Intelligent Heart Disease Prediction System Using Data Mining Techniques *Ms. Ishtake S.H, ** Prof. Sanap S.A.
Intelligent Heart Disease Prediction System Using Data Mining Techniques *Ms. Ishtake S.H, ** Prof. Sanap S.A. *Department of Copmputer science, MIT, Aurangabad, Maharashtra, India. ** Department of Computer
Decision Support in Heart Disease Prediction System using Naive Bayes
Decision Support in Heart Disease Prediction System using Naive Bayes Mrs.G.Subbalakshmi (M.Tech), Kakinada Institute of Engineering & Technology (Affiliated to JNTU-Kakinada), Yanam Road, Korangi-533461,
PERFORMANCE ANALYSIS OF CLASSIFICATION DATA MINING TECHNIQUES OVER HEART DISEASE DATA BASE
PERFORMANCE ANALYSIS OF CLASSIFICATION DATA MINING TECHNIQUES OVER HEART DISEASE DATA BASE N. Aditya Sundar 1, P. Pushpa Latha 2, M. Rama Chandra 3 1 Asst.professor, CSE Department, GMR Institute of Technology,
Decision Support System on Prediction of Heart Disease Using Data Mining Techniques
International Journal of Engineering Research and General Science Volume 3, Issue, March-April, 015 ISSN 091-730 Decision Support System on Prediction of Heart Disease Using Data Mining Techniques Ms.
Prediction of Heart Disease Using Naïve Bayes Algorithm
Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,
Heart Disease Diagnosis Using Predictive Data mining
ISSN (Online) : 2319-8753 ISSN (Print) : 2347-6710 International Journal of Innovative Research in Science, Engineering and Technology Volume 3, Special Issue 3, March 2014 2014 International Conference
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction
Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction Jyoti Soni Ujma Ansari Dipesh Sharma Student, M.Tech (CSE). Professor Reader Raipur Institute of Technology Raipur
Data Mining: A Closer Look
Chapter 2 Data Mining: A Closer Look Chapter Objectives Determine an appropriate data mining strategy for a specific problem. Know about several data mining techniques and how each technique builds a generalized
CLINICAL DECISION SUPPORT FOR HEART DISEASE USING PREDICTIVE MODELS
CLINICAL DECISION SUPPORT FOR HEART DISEASE USING PREDICTIVE MODELS Srpriva Sundaraman Northwestern University [email protected] Sunil Kakade Northwestern University [email protected]
Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network
General Article International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347-5161 2014 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Impelling
In this presentation, you will be introduced to data mining and the relationship with meaningful use.
In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
A Proposed Data Mining Model for the Associated Factors of Alzheimer s Disease
A Proposed Data Mining Model for the Associated Factors of Alzheimer s Disease Dr. Nevine Makram Labib and Mohamed Sayed Badawy Department of Computer and Information Systems Faculty of Management Sciences,
A STUDY ON CLINICAL PREDICTION USING DATA MINING TECHNIQUES
International Journal of Computer Science Engineering and Information Technology Research (IJCSEITR) ISSN 2249-6831 Vol. 3, Issue 1, Mar 2013, 239-248 TJPRC Pvt. Ltd. A STUDY ON CLINICAL PREDICTION USING
Artificial Neural Network Approach for Classification of Heart Disease Dataset
Artificial Neural Network Approach for Classification of Heart Disease Dataset Manjusha B. Wadhonkar 1, Prof. P.A. Tijare 2 and Prof. S.N.Sawalkar 3 1 M.E Computer Engineering (Second Year)., Computer
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Data Mining with SQL Server Data Tools
Data Mining with SQL Server Data Tools Data mining tasks include classification (directed/supervised) models as well as (undirected/unsupervised) models of association analysis and clustering. 1 Data Mining
Intelligent and Effective Heart Disease Prediction System using Weighted Associative Classifiers
Intelligent and Effective Heart Disease Prediction System using Weighted Associative Classifiers Jyoti Soni, Uzma Ansari, Dipesh Sharma Computer Science Raipur Institute of Technology, Raipur C.G., India
Chapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
STATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
Feature Selection for Classification in Medical Data Mining
Feature Selection for Classification in Medical Data Mining Prof.K.Rajeswari 1, Dr.V.Vaithiyanathan 2 and Shailaja V.Pede 3 1 Associate Professor, PCCOE, Pune University, & Ph.D Research Scholar, SASTRA
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
Azure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
The Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
Applications of Data Mining Techniques in Healthcare and Prediction of Heart Attacks
K.Srinivas et al. / (IJCSE) International Journal on Computer Science and Engineering Applications of Data Mining Techniques in Healthcare and Prediction of Heart Attacks K.Srinivas B.Kavihta Rani Dr.
COMP3420: Advanced Databases and Data Mining. Classification and prediction: Introduction and Decision Tree Induction
COMP3420: Advanced Databases and Data Mining Classification and prediction: Introduction and Decision Tree Induction Lecture outline Classification versus prediction Classification A two step process Supervised
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be
Predictive Data modeling for health care: Comparative performance study of different prediction models
Predictive Data modeling for health care: Comparative performance study of different prediction models Shivanand Hiremath [email protected] National Institute of Industrial Engineering (NITIE) Vihar
Naive Bayesian Classification Approach in Healthcare Applications
International Journal of Computer Science and Telecommunications [Volume 3, Issue, January ] 6 ISSN 333 Naive Bayesian Classification Approach in Healthcare Applications R. Bhuvaneswari and K. Kalaiselvi
Application of Data Mining in Medical Decision Support System
Application of Data Mining in Medical Decision Support System Habib Shariff Mahmud School of Engineering & Computing Sciences University of East London - FTMS College Technology Park Malaysia Bukit Jalil,
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
DATA MINING AND REPORTING IN HEALTHCARE
DATA MINING AND REPORTING IN HEALTHCARE Divya Gandhi 1, Pooja Asher 2, Harshada Chaudhari 3 1,2,3 Department of Information Technology, Sardar Patel Institute of Technology, Mumbai,(India) ABSTRACT The
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
A Novel Approach for Heart Disease Diagnosis using Data Mining and Fuzzy Logic
A Novel Approach for Heart Disease Diagnosis using Data Mining and Fuzzy Logic Nidhi Bhatla GNDEC, Ludhiana, India Kiran Jyoti GNDEC, Ludhiana, India ABSTRACT Cardiovascular disease is a term used to describe
BIG DATA What it is and how to use?
BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14
Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report
Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report G. Banos 1, P.A. Mitkas 2, Z. Abas 3, A.L. Symeonidis 2, G. Milis 2 and U. Emanuelson 4 1 Faculty
Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734
Cleveland State University Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734 SS Chung 14 Build a Data Mining Model using Data
Data Mining In Healthcare: A Survey of Techniques and Algorithms with Its Limitations and Challenges
RESEARCH ARTICLE OPEN ACCESS Data Mining In Healthcare: A Survey of Techniques and Algorithms with Its Limitations and Challenges Prakash Mahindrakar 1, Dr. M. Hanumanthappa 2 1 Research Scholar, Department
Data Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification
A Data Mining Study of Weld Quality Models Constructed with MLP Neural Networks from Stratified Sampled Data
A Data Mining Study of Weld Quality Models Constructed with MLP Neural Networks from Stratified Sampled Data T. W. Liao, G. Wang, and E. Triantaphyllou Department of Industrial and Manufacturing Systems
Data Mining is the process of knowledge discovery involving finding
using analytic services data mining framework for classification predicting the enrollment of students at a university a case study Data Mining is the process of knowledge discovery involving finding hidden
STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table
Statistical Models in Data Mining
Statistical Models in Data Mining Sargur N. Srihari University at Buffalo The State University of New York Department of Computer Science and Engineering Department of Biostatistics 1 Srihari Flood of
Categorical Data Visualization and Clustering Using Subjective Factors
Categorical Data Visualization and Clustering Using Subjective Factors Chia-Hui Chang and Zhi-Kai Ding Department of Computer Science and Information Engineering, National Central University, Chung-Li,
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Extend Table Lens for High-Dimensional Data Visualization and Classification Mining
Extend Table Lens for High-Dimensional Data Visualization and Classification Mining CPSC 533c, Information Visualization Course Project, Term 2 2003 Fengdong Du [email protected] University of British Columbia
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
not possible or was possible at a high cost for collecting the data.
Data Mining and Knowledge Discovery Generating knowledge from data Knowledge Discovery Data Mining White Paper Organizations collect a vast amount of data in the process of carrying out their day-to-day
Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing
Introduction to Data Mining and Machine Learning Techniques Iza Moise, Evangelos Pournaras, Dirk Helbing Iza Moise, Evangelos Pournaras, Dirk Helbing 1 Overview Main principles of data mining Definition
Big Data: Rethinking Text Visualization
Big Data: Rethinking Text Visualization Dr. Anton Heijs [email protected] Treparel April 8, 2013 Abstract In this white paper we discuss text visualization approaches and how these are important
Effective Analysis and Predictive Model of Stroke Disease using Classification Methods
Effective Analysis and Predictive Model of Stroke Disease using Classification Methods A.Sudha Student, M.Tech (CSE) VIT University Vellore, India P.Gayathri Assistant Professor VIT University Vellore,
BIDM Project. Predicting the contract type for IT/ITES outsourcing contracts
BIDM Project Predicting the contract type for IT/ITES outsourcing contracts N a n d i n i G o v i n d a r a j a n ( 6 1 2 1 0 5 5 6 ) The authors believe that data modelling can be used to predict if an
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
Index Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
Data Mining Applications in Higher Education
Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2
Data Mining Fundamentals
Part I Data Mining Fundamentals Data Mining: A First View Chapter 1 1.11 Data Mining: A Definition Data Mining The process of employing one or more computer learning techniques to automatically analyze
Classification of Heart Disease Dataset using Multilayer Feed forward backpropogation Algorithm
Classification of Heart Disease Dataset using Multilayer Feed forward backpropogation Algorithm Miss. Manjusha B. Wadhonkar 1, Prof. P. A. Tijare 2 and Prof. S. N. Sawalkar 3 1 M.E Computer Engineering,
Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction
Data Mining and Exploration Data Mining and Exploration: Introduction Amos Storkey, School of Informatics January 10, 2006 http://www.inf.ed.ac.uk/teaching/courses/dme/ Course Introduction Welcome Administration
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support Rok Rupnik, Matjaž Kukar, Marko Bajec, Marjan Krisper University of Ljubljana, Faculty of Computer and Information
ANALYSIS OF HEART DISEASES DATASET USING NEURAL NETWORK APPROACH
ANALYSIS OF HEART DISEASES DATASET USING NEURAL NETWORK APPROACH ABSTRACT Dr. K. Usha Rani Dept. of Computer Science Sri Padmavathi Mahila Visvavidyalayam (Women s University) Tirupati - 517502, Andhra
Predicting the Analysis of Heart Disease Symptoms Using Medicinal Data Mining Methods
Predicting the Analysis of Heart Disease Symptoms Using Medicinal Data Mining Methods V. Manikantan & S. Latha Department of Computer Science and Engineering, Mahendra Institute of Technology Tiruchengode,
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE Kasra Madadipouya 1 1 Department of Computing and Science, Asia Pacific University of Technology & Innovation ABSTRACT Today, enormous amount of data
Data Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
Towards applying Data Mining Techniques for Talent Mangement
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Towards applying Data Mining Techniques for Talent Mangement Hamidah Jantan 1,
ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA
ENSEMBLE DECISION TREE CLASSIFIER FOR BREAST CANCER DATA D.Lavanya 1 and Dr.K.Usha Rani 2 1 Research Scholar, Department of Computer Science, Sree Padmavathi Mahila Visvavidyalayam, Tirupati, Andhra Pradesh,
Information Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli ([email protected])
DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.
DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,
Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health
Lecture 1: Data Mining Overview and Process What is data mining? Example applications Definitions Multi disciplinary Techniques Major challenges The data mining process History of data mining Data mining
EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH
EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH SANGITA GUPTA 1, SUMA. V. 2 1 Jain University, Bangalore 2 Dayanada Sagar Institute, Bangalore, India Abstract- One
D A T A M I N I N G C L A S S I F I C A T I O N
D A T A M I N I N G C L A S S I F I C A T I O N FABRICIO VOZNIKA LEO NARDO VIA NA INTRODUCTION Nowadays there is huge amount of data being collected and stored in databases everywhere across the globe.
Introduction to Data Mining
Introduction to Data Mining José Hernández ndez-orallo Dpto.. de Systems Informáticos y Computación Universidad Politécnica de Valencia, Spain [email protected] Horsens, Denmark, 26th September 2005
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
Introduction to Data Mining Techniques
Introduction to Data Mining Techniques Dr. Rajni Jain 1 Introduction The last decade has experienced a revolution in information availability and exchange via the internet. In the same spirit, more and
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Comparison of Data Mining Techniques used for Financial Data Analysis
Comparison of Data Mining Techniques used for Financial Data Analysis Abhijit A. Sawant 1, P. M. Chawan 2 1 Student, 2 Associate Professor, Department of Computer Technology, VJTI, Mumbai, INDIA Abstract
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
Divide-n-Discover Discretization based Data Exploration Framework for Healthcare Analytics
for Healthcare Analytics Si-Chi Chin,KiyanaZolfaghar,SenjutiBasuRoy,AnkurTeredesai,andPaulAmoroso Institute of Technology, The University of Washington -Tacoma,900CommerceStreet,Tacoma,WA980-00,U.S.A.
Data Mining: A Preprocessing Engine
Journal of Computer Science 2 (9): 735-739, 2006 ISSN 1549-3636 2005 Science Publications Data Mining: A Preprocessing Engine Luai Al Shalabi, Zyad Shaaban and Basel Kasasbeh Applied Science University,
APPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION ANALYSIS. email [email protected]
Eighth International IBPSA Conference Eindhoven, Netherlands August -4, 2003 APPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION Christoph Morbitzer, Paul Strachan 2 and
How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning
How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume
A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING
A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING Ahmet Selman BOZKIR Hacettepe University Computer Engineering Department, Ankara, Turkey [email protected] Ebru Akcapinar
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Enhanced Boosted Trees Technique for Customer Churn Prediction Model
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V5 PP 41-45 www.iosrjen.org Enhanced Boosted Trees Technique for Customer Churn Prediction
A New Approach for Evaluation of Data Mining Techniques
181 A New Approach for Evaluation of Data Mining s Moawia Elfaki Yahia 1, Murtada El-mukashfi El-taher 2 1 College of Computer Science and IT King Faisal University Saudi Arabia, Alhasa 31982 2 Faculty
Keywords data mining, prediction techniques, decision making.
Volume 5, Issue 4, April 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analysis of Datamining
from Larson Text By Susan Miertschin
Decision Tree Data Mining Example from Larson Text By Susan Miertschin 1 Problem The Maximum Miniatures Marketing Department wants to do a targeted mailing gpromoting the Mythic World line of figurines.
Random forest algorithm in big data environment
Random forest algorithm in big data environment Yingchun Liu * School of Economics and Management, Beihang University, Beijing 100191, China Received 1 September 2014, www.cmnt.lv Abstract Random forest
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 11 Sajjad Haider Fall 2013 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS
ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS Abstract D.Lavanya * Department of Computer Science, Sri Padmavathi Mahila University Tirupati, Andhra Pradesh, 517501, India [email protected]
Robust Outlier Detection Technique in Data Mining: A Univariate Approach
Robust Outlier Detection Technique in Data Mining: A Univariate Approach Singh Vijendra and Pathak Shivani Faculty of Engineering and Technology Mody Institute of Technology and Science Lakshmangarh, Sikar,
Data Mining with Microsoft SQL Server 2005
International DSI / Asia and Pacific DSI 2007 Full Paper (July, 2007) Data Mining with Microsoft SQL Server 2005 Henning Stolz 1), Peter Lehmann 1),Waranya Poonnawat 3) 1) Institute for Business Intelligence,
DataMining Clustering Techniques in the Prediction of Heart Disease using Attribute Selection Method
DataMining Clustering Techniques in the Prediction of Heart Disease using Attribute Selection Method Atul Kumar Pandey*, Prabhat Pandey**, K.L. Jaiswal***, Ashish Kumar Sen**** ABSTRACT Heart disease is
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Healthcare Data Mining: Prediction Inpatient Length of Stay
3rd International IEEE Conference Intelligent Systems, September 2006 Healthcare Data Mining: Prediction Inpatient Length of Peng Liu, Lei Lei, Junjie Yin, Wei Zhang, Wu Naijun, Elia El-Darzi 1 Abstract
Visual Mining of E-Customer Behavior Using Pixel Bar Charts
Visual Mining of E-Customer Behavior Using Pixel Bar Charts Ming C. Hao, Julian Ladisch*, Umeshwar Dayal, Meichun Hsu, Adrian Krug Hewlett Packard Research Laboratories, Palo Alto, CA. (ming_hao, dayal)@hpl.hp.com;
