BIG DATA IN HEALTHCARE THE NEXT FRONTIER
|
|
|
- Logan Morgan
- 10 years ago
- Views:
Transcription
1 BIG DATA IN HEALTHCARE THE NEXT FRONTIER Divyaa Krishna Sonnad 1, Dr. Jharna Majumdar 2 2 Dean R&D, Prof. and Head, 1,2 Dept of CSE (PG), Nitte Meenakshi Institute of Technology Abstract: The world of health care has tremendously taken a vast up-gradation due to various situations which are both in the sides of positive and negative. Increasingly upgrading of technology at a phase where the old has turned out to be completely outdated which are no more applicable (or) of no use in the present modern digital world. In the field of health care the due to heavy technological up-gradation the accessibility of consumers has been increased and the amount of data being generated every day is huge. The huge amount of data being generated created a problem for both storage and accessibility. The problem was tried to be fixed by various means and turned out to be a non-fruitable one, so which finally turns the researchers view towards Big Data. Research in this area has been an ongoing process for over a decade. This paper proposes the use of Big Data for the storage and extraction of enormous data. Keywords: Hadoop, Big data I. INTRODUCTION Almost every day tremendous data is being generated where every bit of information that is being stored and being accessed has its own part to solve various problems.[1][2] In the field of health care the data is precious; every single information helps to solve a particular type of problem and also leads to solve even bigger problems. The databases are filling up as high speeds where the storage is a heavy problem; missing some information in this particular field may lead to casualties. As this cumbersome task of storing data is a high priority problem various databases are developed to solve it but still the solution is no no situation. To overcome this particular problem the researches took a step towards Big data.[1][2][3] The use of big data analysis in the field of health care seems to be the best logical solution for solving the problem. In the field of health care a nano second can determine the survival of a particular person, so data regarding the particular person, research information, past evaluation etc., all information is to be accessed. In the words of Executive Chairman of Google Mr.Eric Schmidt, From the very beginning of civilization of humans until the year 2003, human world had generated five Exabyte amount of data but now the modern human world generates that five Exabyte amount of data every two days and this pace is going on accelerating as the time changes. These words are just as true as we can find enormous amount of data in every aspect especially in the field of healthcare various information regarding the patients, research details, medical details, etc., have been generated. Using various data mining techniques in big data the problems of the existing technology of storing data can be resolved. II. PROPOSED WORK The project deals with specialized techniques which allow the consumers i.e., doctors to predict the future consequence for a particular patient. The application starts with the Home Page where we provide the users to login with there credentials or register to become an user. So if the user is a new user he goes for the registration where he register himself/herself as a doctor/patient. Once the registration is done the user can login into the application where the application is designed in a way thet there is no separate log-in for a patient/doctor, the application is designed to segrigate the particular users and provide the requirements. As a patient user, the user can enter all the essiential requirements related to himself/herself and will be assigned to a doctor. As a doctor user there are 6 flows in this application. Search Patient s Medical Fitness: Here the doctor can enter the unique id of the patient and checks the fitness of a particular patient. Estimate Patients Medical Fitness: Here in this flow the doctor enters Patient s Test Details and find the Medical Fitness of patients. Patient s Dosage Details: Here in this flow the doctor checks by clicking for all or In and Out Patient Dosage and gets to find medical details where again he canclick On Medicine for Medicine Detail and get Patient Dosage details. Medicine Price Details: Here in this flow the doctor Copyright 2015.All rights reserved. 3164
2 enters Medicine Code and gets to all list of Medicine Details once he/she clicks on a particular record the price ofmedicine is found. Emergency Admission: Inthis flow the doctor enter the admission total and finds out the results of the admissions. Gastro Palpation Details: In this flow the doctor enters patient s Palpation and gets the result of Rectal Exam. In this application the use of Data mining algorithms is the main functionality for Big Data Analytics. Here in this project five such algorithms have been used. They are as follows: Naïve Bayes Algorithm. Linear Regression Algorithm. Decision Tree Algorithm. K Nearest Neighbors Algorithm. Artificial Bee Colony Algorithm. The application and description of each algorithm is a follows: A. Naïve Bayes Algorithm: The Naive Bayes classifier is considered as a probabilistic Classifier which was developed basing the theorem of Bayes where there with strong independent assumptions. This particular algorithm is also called as Independent Feature Model as the parameters are independent with one another which taking up this algorithm. In this algorithm, even if any of the particular parameter is dependent over other parameter directly (or) indirectly this algorithm will consider them as independent only. The naïve bayes depending up on the particular nature of the probability model it is trained at a very high supervised learning setting. In many of its applications naive Bayes uses method of maximum likelihood. The main advantage of this algorithm is that it needs only a small amount of training data to estimate the parameters which are necessary for the entire classification. B. Decision Tree Algorithm Decision Tree is a predictive model which has the capability of maps observations regarding an item for concluding about the target value. It is a model which used in many data mining and statistics. In the tree model the target variables is able to take a finite set of variables and are called classification trees. In these particular trees the leaves represent the class labels and all the branches represent the conjunctions of different features that allow getting a lead to particular class labels. Decision Tree is one of the Data Mining techniques which cannot handle continuous variables directly. So, these particular continuous attributes are to be converted to discrete attributes, a process called Discretization. The Decision tree algorithm uses Binary discretization for continuous-valued features. However, the multi-interval discretization methods are known to produce more accurate decision tree than binary discretization. The main two issues that affect the performance of Decision Trees are: The data discretization method used The type of Decision Tree used. Fig 2: Flow Chart of Decision Tree Algorithm Figure 1: Flow Chart of Naïve Bayes Algorithm Linear Regression Algorithm Linear regression algorithm is one of the mathematical technique that relates one variable to another variable i.e., independent variable to a dependent variable which is in the form of an equation for a straight line. The linear equation is as follows, Where, y = Dependent Variable. a = Intercept. b = Slope of the line x = Independent Variable Copyright 2015.All rights reserved. 3165
3 Fig 3: Flow Chart of Linear Regression Algorithm C. K - Nearest Neighbors Algorithm: K Nearest Neighbors Algorithm is one of the type of predicting algorithm which predicts the next possible values based on the stores of all available cases and therefore classifies new cases based on similarity factor The algorithm specifies that a case is been classified by a majority no.of votes by its neighbors where a case being assigned to the class which is most common in amongst its available nearest neighbors which is called as (K nearest neighbors). This is measured by the use of a distance function. The algorithm brings up the issue of standardization of the numerical variables which are between 0 and 1 whenever there is a mixture of both numerical and categorical variables in the provided dataset. The KNN has been used for the statistical estimation and pattern recognition fields. D. Artificial Bee Colony Algorithm: The Artificial Bee Colony (ABC) algorithm is a unique and population based meta-heuristic algorithm. This algorithm was inspired by the intelligent behavior of honey bees. In this particular algorithm a methodology called clustering is been used where the data is taken in the form of groups called clusters. In these particular clusters all one related types are stored in one cluster i.e., similar data is saved under one particular cluster. The advantages of this algorithm are as follows: It employs a total of only three control parameters. It has a very fast convergence Speed. It is robust. Simple. Flexible It can be easily be optimized with any of the algorithms. It has three phases Onlooker bee Phase. Scout bee Phase. Employed bee Phase. Fig 5: Flow Chart of Artificial Bee Colony Algorithm Fig 4: Flow Chart of K-Nearest Neighbour Algorithm III. IMPLEMENTATION Big data is an upcoming technology where the utilization of big data in many of the application features in not yet implemented but whereas till now where ever big data has been implemented it gave tremendous results in the form of efficiency etc., in this project I am using Data Mining Techniques to estimate the patient data and analyse the requirement. Now big data happens to be a source for a million problems in the field of healthcare. Copyright 2015.All rights reserved. 3166
4 Comparison of the Algorithms: Table: Comparison of Algorithm From the above provided table, it is shown as, For Diabetes scenario the Naïve Bayes Algorithm is compared with Decision Tree Algorithm and K- Nearest Neighbour Algorithm. For Medical Fitness, Decision Tree Algorithm is compared with the Naïve Bayes Algorithm and Artificial Bee Colony Algorithm. For Emergency Admission, Linear Regression Algorithm is best suited in the means of efficiency when compared with K-Nearest Neighbour Algorithm and Artificial Bee Colony Algorithm. For Gastro Palpation, Decision Tree Algorithm is compared with Naïve Bayes Algorithm. Sequence Diagram Figure 6: Application Sequence Diagram Figure 7: Data processing, mining and Statistical simulation modelling workflow IV. CONCLUSION AND FUTURE SCOPE In this project I have used various data mining techniques with respect to the application of big data which shows the efficiency of the use of big data in healthcare field. Technology is getting upgraded every day, for an issue generated there comes many solutions to fix the issue yet the efficiency happens to be the main turnover for any technology to be accepted. Big data analysis happens to be the revolutionary technology that is changing the efficiency of the applications in the field of healthcare. With the help of various data mining techniques in big data the efficiency of the application happens to be a turning point which made the researchers across the world to look into big data. Using this application helps the doctors to predict and analyse possible solutions to help their patients. This turned out to be having the power to see future consequences by a doctor of a particular patient and help resolving the issue of health. REFERENCES [1] The Impact of Big Data on the Healthcare Information Systems Kuo Lane Chen, Huei Lee - Transactions of the International Conference on Health Information Technology Advancement 2013 [2] Big data security and privacy issues in healthcare, Nanthealth, Harsh Kupwade Patil and Ravi Seshadri, 2014 IEEE International Conference on Big Data [3] Using Decision Tree for Diagnosing Heart Disease Patients Mai Shouman, Tim Turner, Rob Stocker Proceedings of the 9th Australasian Data Mining Conference,Australia [4] Decision Support in Heart Disease Prediction System usingnaive Bayes G.Subbalakshmi, K. Ramesh, M. Chinna Rao - Indian Journal of Computer Science and Engineering (IJCSE), 2011 [5] Optimization of Clustering Problem Using Population Based Artificial Bee Colony Algorithm: A Review, Twinkle Gupta, Dharmender Kumar, 2014 International Journal of Advanced Research in Computer Science and Software Engineering Copyright 2015.All rights reserved. 3167
5 [6] Implementation of Artificial Bee Colony Algorithm, Vimal Nayak, Haresh Suthar, Jagrut Gadit, 2012 IAES International Journal of Artificial Intelligence (IJ-AI) [7] A Layer Based Architecture for Provenance in Big Data, Ashiq Imran, Rajeev Agrawal, Jessie Walker, Anthony Gomes, 2014 IEEE International Conference on Big Data [8] A Big Data Framework for u-healthcare Systems Utilizing Vital Signs Tae-Woong Kim, Kwang-Ho Park, Sang-Hoon Yi, Hee-Cheol Kim International Symposium on Computer, Consumer and Control Copyright 2015.All rights reserved. 3168
Prediction of Heart Disease Using Naïve Bayes Algorithm
Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,
Keywords data mining, prediction techniques, decision making.
Volume 5, Issue 4, April 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analysis of Datamining
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Classification algorithm in Data mining: An Overview
Classification algorithm in Data mining: An Overview S.Neelamegam #1, Dr.E.Ramaraj *2 #1 M.phil Scholar, Department of Computer Science and Engineering, Alagappa University, Karaikudi. *2 Professor, Department
A Secured Approach to Credit Card Fraud Detection Using Hidden Markov Model
A Secured Approach to Credit Card Fraud Detection Using Hidden Markov Model Twinkle Patel, Ms. Ompriya Kale Abstract: - As the usage of credit card has increased the credit card fraud has also increased
Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network
General Article International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347-5161 2014 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Impelling
Effective Analysis and Predictive Model of Stroke Disease using Classification Methods
Effective Analysis and Predictive Model of Stroke Disease using Classification Methods A.Sudha Student, M.Tech (CSE) VIT University Vellore, India P.Gayathri Assistant Professor VIT University Vellore,
An Introduction to Data Mining
An Introduction to Intel Beijing [email protected] January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
Reference Books. Data Mining. Supervised vs. Unsupervised Learning. Classification: Definition. Classification k-nearest neighbors
Classification k-nearest neighbors Data Mining Dr. Engin YILDIZTEPE Reference Books Han, J., Kamber, M., Pei, J., (2011). Data Mining: Concepts and Techniques. Third edition. San Francisco: Morgan Kaufmann
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
DATA MINING AND REPORTING IN HEALTHCARE
DATA MINING AND REPORTING IN HEALTHCARE Divya Gandhi 1, Pooja Asher 2, Harshada Chaudhari 3 1,2,3 Department of Information Technology, Sardar Patel Institute of Technology, Mumbai,(India) ABSTRACT The
E-commerce Transaction Anomaly Classification
E-commerce Transaction Anomaly Classification Minyong Lee [email protected] Seunghee Ham [email protected] Qiyi Jiang [email protected] I. INTRODUCTION Due to the increasing popularity of e-commerce
International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering
A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier
A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier G.T. Prasanna Kumari Associate Professor, Dept of Computer Science and Engineering, Gokula Krishna College of Engg, Sullurpet-524121,
Decision Support System on Prediction of Heart Disease Using Data Mining Techniques
International Journal of Engineering Research and General Science Volume 3, Issue, March-April, 015 ISSN 091-730 Decision Support System on Prediction of Heart Disease Using Data Mining Techniques Ms.
Genetic Neural Approach for Heart Disease Prediction
Genetic Neural Approach for Heart Disease Prediction Nilakshi P. Waghulde 1, Nilima P. Patil 2 Abstract Data mining techniques are used to explore, analyze and extract data using complex algorithms in
A Review of Data Mining Techniques
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,
ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis
ElegantJ BI White Paper The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis Integrated Business Intelligence and Reporting for Performance Management, Operational
SAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics
SAP Brief SAP HANA Objectives Transform Your Future with Better Business Insight Using Predictive Analytics Dealing with the new reality Dealing with the new reality Organizations like yours can identify
Research on the Performance Optimization of Hadoop in Big Data Environment
Vol.8, No.5 (015), pp.93-304 http://dx.doi.org/10.1457/idta.015.8.5.6 Research on the Performance Optimization of Hadoop in Big Data Environment Jia Min-Zheng Department of Information Engineering, Beiing
MS1b Statistical Data Mining
MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to
A REVIEW ON EFFICIENT DATA ANALYSIS FRAMEWORK FOR INCREASING THROUGHPUT IN BIG DATA. Technology, Coimbatore. Engineering and Technology, Coimbatore.
A REVIEW ON EFFICIENT DATA ANALYSIS FRAMEWORK FOR INCREASING THROUGHPUT IN BIG DATA 1 V.N.Anushya and 2 Dr.G.Ravi Kumar 1 Pg scholar, Department of Computer Science and Engineering, Coimbatore Institute
Role of Social Networking in Marketing using Data Mining
Role of Social Networking in Marketing using Data Mining Mrs. Saroj Junghare Astt. Professor, Department of Computer Science and Application St. Aloysius College, Jabalpur, Madhya Pradesh, India Abstract:
IDENTIFIC ATION OF SOFTWARE EROSION USING LOGISTIC REGRESSION
http:// IDENTIFIC ATION OF SOFTWARE EROSION USING LOGISTIC REGRESSION Harinder Kaur 1, Raveen Bajwa 2 1 PG Student., CSE., Baba Banda Singh Bahadur Engg. College, Fatehgarh Sahib, (India) 2 Asstt. Prof.,
A Comparative Analysis of Classification Techniques on Categorical Data in Data Mining
A Comparative Analysis of Classification Techniques on Categorical Data in Data Mining Sakshi Department Of Computer Science And Engineering United College of Engineering & Research Naini Allahabad [email protected]
Customer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
Using multiple models: Bagging, Boosting, Ensembles, Forests
Using multiple models: Bagging, Boosting, Ensembles, Forests Bagging Combining predictions from multiple models Different models obtained from bootstrap samples of training data Average predictions or
Keywords: Data Warehouse, Data Warehouse testing, Lifecycle based testing, performance testing.
DOI 10.4010/2016.493 ISSN2321 3361 2015 IJESC Research Article December 2015 Issue Performance Testing Of Data Warehouse Lifecycle Surekha.M 1, Dr. Sanjay Srivastava 2, Dr. Vineeta Khemchandani 3 IV Sem,
Comparison of Data Mining Techniques used for Financial Data Analysis
Comparison of Data Mining Techniques used for Financial Data Analysis Abhijit A. Sawant 1, P. M. Chawan 2 1 Student, 2 Associate Professor, Department of Computer Technology, VJTI, Mumbai, INDIA Abstract
First Semester Computer Science Students Academic Performances Analysis by Using Data Mining Classification Algorithms
First Semester Computer Science Students Academic Performances Analysis by Using Data Mining Classification Algorithms Azwa Abdul Aziz, Nor Hafieza IsmailandFadhilah Ahmad Faculty Informatics & Computing
A Survey on Product Aspect Ranking
A Survey on Product Aspect Ranking Charushila Patil 1, Prof. P. M. Chawan 2, Priyamvada Chauhan 3, Sonali Wankhede 4 M. Tech Student, Department of Computer Engineering and IT, VJTI College, Mumbai, Maharashtra,
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
International Journal of Advanced Computer Technology (IJACT) ISSN:2319-7900 PRIVACY PRESERVING DATA MINING IN HEALTH CARE APPLICATIONS
PRIVACY PRESERVING DATA MINING IN HEALTH CARE APPLICATIONS First A. Dr. D. Aruna Kumari, Ph.d, ; Second B. Ch.Mounika, Student, Department Of ECM, K L University, [email protected]; Third C.
Random forest algorithm in big data environment
Random forest algorithm in big data environment Yingchun Liu * School of Economics and Management, Beihang University, Beijing 100191, China Received 1 September 2014, www.cmnt.lv Abstract Random forest
REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES
REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES R. Chitra 1 and V. Seenivasagam 2 1 Department of Computer Science and Engineering, Noorul Islam Centre for
Principles of Data Mining by Hand&Mannila&Smyth
Principles of Data Mining by Hand&Mannila&Smyth Slides for Textbook Ari Visa,, Institute of Signal Processing Tampere University of Technology October 4, 2010 Data Mining: Concepts and Techniques 1 Differences
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
BIDM Project. Predicting the contract type for IT/ITES outsourcing contracts
BIDM Project Predicting the contract type for IT/ITES outsourcing contracts N a n d i n i G o v i n d a r a j a n ( 6 1 2 1 0 5 5 6 ) The authors believe that data modelling can be used to predict if an
Role of Neural network in data mining
Role of Neural network in data mining Chitranjanjit kaur Associate Prof Guru Nanak College, Sukhchainana Phagwara,(GNDU) Punjab, India Pooja kapoor Associate Prof Swami Sarvanand Group Of Institutes Dinanagar(PTU)
Machine Learning Logistic Regression
Machine Learning Logistic Regression Jeff Howbert Introduction to Machine Learning Winter 2012 1 Logistic regression Name is somewhat misleading. Really a technique for classification, not regression.
Top Top 10 Algorithms in Data Mining
ICDM 06 Panel on Top Top 10 Algorithms in Data Mining 1. The 3-step identification process 2. The 18 identified candidates 3. Algorithm presentations 4. Top 10 algorithms: summary 5. Open discussions ICDM
Top 10 Algorithms in Data Mining
Top 10 Algorithms in Data Mining Xindong Wu ( 吴 信 东 ) Department of Computer Science University of Vermont, USA; 合 肥 工 业 大 学 计 算 机 与 信 息 学 院 1 Top 10 Algorithms in Data Mining by the IEEE ICDM Conference
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
Financial Trading System using Combination of Textual and Numerical Data
Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,
Data Mining Approach For Subscription-Fraud. Detection in Telecommunication Sector
Contemporary Engineering Sciences, Vol. 7, 2014, no. 11, 515-522 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ces.2014.4431 Data Mining Approach For Subscription-Fraud Detection in Telecommunication
Data Mining for Business Intelligence. Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. 2nd Edition
Brochure More information from http://www.researchandmarkets.com/reports/2170926/ Data Mining for Business Intelligence. Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. 2nd
Determining optimum insurance product portfolio through predictive analytics BADM Final Project Report
2012 Determining optimum insurance product portfolio through predictive analytics BADM Final Project Report Dinesh Ganti(61310071), Gauri Singh(61310560), Ravi Shankar(61310210), Shouri Kamtala(61310215),
A New Approach in Software Cost Estimation with Hybrid of Bee Colony and Chaos Optimizations Algorithms
A New Approach in Software Cost Estimation with Hybrid of Bee Colony and Chaos Optimizations Algorithms Farhad Soleimanian Gharehchopogh 1 and Zahra Asheghi Dizaji 2 1 Department of Computer Engineering,
Stock Portfolio Selection using Data Mining Approach
IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 11 (November. 2013), V1 PP 42-48 Stock Portfolio Selection using Data Mining Approach Carol Anne Hargreaves, Prateek
Data Mining Yelp Data - Predicting rating stars from review text
Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University [email protected] Chetan Naik Stony Brook University [email protected] ABSTRACT The majority
Manjeet Kaur Bhullar, Kiranbir Kaur Department of CSE, GNDU, Amritsar, Punjab, India
Volume 5, Issue 6, June 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Multiple Pheromone
Content-Based Recommendation
Content-Based Recommendation Content-based? Item descriptions to identify items that are of particular interest to the user Example Example Comparing with Noncontent based Items User-based CF Searches
Improving performance of Memory Based Reasoning model using Weight of Evidence coded categorical variables
Paper 10961-2016 Improving performance of Memory Based Reasoning model using Weight of Evidence coded categorical variables Vinoth Kumar Raja, Vignesh Dhanabal and Dr. Goutam Chakraborty, Oklahoma State
Sentiment analysis using emoticons
Sentiment analysis using emoticons Royden Kayhan Lewis Moharreri Steven Royden Ware Lewis Kayhan Steven Moharreri Ware Department of Computer Science, Ohio State University Problem definition Our aim was
Efficient Security Alert Management System
Efficient Security Alert Management System Minoo Deljavan Anvary IT Department School of e-learning Shiraz University Shiraz, Fars, Iran Majid Ghonji Feshki Department of Computer Science Qzvin Branch,
International Journal of Advanced Research in Computer Science and Software Engineering
ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: Automation of Hospital with Decision Making Ability Prof. Seema
Identifying Peer-to-Peer Traffic Based on Traffic Characteristics
Identifying Peer-to-Peer Traffic Based on Traffic Characteristics Prof S. R. Patil Dept. of Computer Engineering SIT, Savitribai Phule Pune University Lonavala, India [email protected] Suraj Sanjay Dangat
Predicting Student Performance by Using Data Mining Methods for Classification
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 13, No 1 Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0006 Predicting Student Performance
ISSN: 2320-1363 CONTEXTUAL ADVERTISEMENT MINING BASED ON BIG DATA ANALYTICS
CONTEXTUAL ADVERTISEMENT MINING BASED ON BIG DATA ANALYTICS A.Divya *1, A.M.Saravanan *2, I. Anette Regina *3 MPhil, Research Scholar, Muthurangam Govt. Arts College, Vellore, Tamilnadu, India Assistant
USING DATA SCIENCE TO DISCOVE INSIGHT OF MEDICAL PROVIDERS CHARGE FOR COMMON SERVICES
USING DATA SCIENCE TO DISCOVE INSIGHT OF MEDICAL PROVIDERS CHARGE FOR COMMON SERVICES Irron Williams Northwestern University [email protected] Abstract--Data science is evolving. In
Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management
Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management Paper Jean-Louis Amat Abstract One of the main issues of operators
Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University
Mining Big Data Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University Website: http://www.cse.msu.edu/~ptan Google Trends Big Data Smart Cities Big Data and
Machine Learning with MATLAB David Willingham Application Engineer
Machine Learning with MATLAB David Willingham Application Engineer 2014 The MathWorks, Inc. 1 Goals Overview of machine learning Machine learning models & techniques available in MATLAB Streamlining the
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, [email protected] Abstract: Independent
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 11 Sajjad Haider Fall 2013 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
Use of Data Mining Techniques to Improve the Effectiveness of Sales and Marketing
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 4, April 2015,
Email Spam Detection Using Customized SimHash Function
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 1, Issue 8, December 2014, PP 35-40 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Email
American International Journal of Research in Science, Technology, Engineering & Mathematics
American International Journal of Research in Science, Technology, Engineering & Mathematics Available online at http://www.iasir.net ISSN (Print): 2328-349, ISSN (Online): 2328-3580, ISSN (CD-ROM): 2328-3629
Nine Common Types of Data Mining Techniques Used in Predictive Analytics
1 Nine Common Types of Data Mining Techniques Used in Predictive Analytics By Laura Patterson, President, VisionEdge Marketing Predictive analytics enable you to develop mathematical models to help better
Client Overview. Engagement Situation. Key Requirements
Client Overview Our client is one of the leading providers of business intelligence systems for customers especially in BFSI space that needs intensive data analysis of huge amounts of data for their decision
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
A Review on Data Mining in Cloud Computing Environment
A Review on Data Mining in Cloud Computing Environment R.Kabilan, Dr.N.Jayaveeran Research Scholar, P.G & Research Dept. of Computer Science, Khadir Mohideen College, Adirampattinam, Thanjavur District.
COURSE RECOMMENDER SYSTEM IN E-LEARNING
International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand
Index Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
Learning is a very general term denoting the way in which agents:
What is learning? Learning is a very general term denoting the way in which agents: Acquire and organize knowledge (by building, modifying and organizing internal representations of some external reality);
Data Mining for Business Analytics
Data Mining for Business Analytics Lecture 2: Introduction to Predictive Modeling Stern School of Business New York University Spring 2014 MegaTelCo: Predicting Customer Churn You just landed a great analytical
CLOUD COMPUTING PARTITIONING ALGORITHM AND LOAD BALANCING ALGORITHM
CLOUD COMPUTING PARTITIONING ALGORITHM AND LOAD BALANCING ALGORITHM Anisaara Nadaph 1 and Prof. Vikas Maral 2 1 Department of Computer Engineering, K.J College of Engineering and Management Research Pune
Data Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification
Web Data Mining: A Case Study. Abstract. Introduction
Web Data Mining: A Case Study Samia Jones Galveston College, Galveston, TX 77550 Omprakash K. Gupta Prairie View A&M, Prairie View, TX 77446 [email protected] Abstract With an enormous amount of data stored
Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1
Data Mining 1 Introduction 2 Data Mining methods Alfred Holl Data Mining 1 1 Introduction 1.1 Motivation 1.2 Goals and problems 1.3 Definitions 1.4 Roots 1.5 Data Mining process 1.6 Epistemological constraints
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
PharmaSUG2011 Paper HS03
PharmaSUG2011 Paper HS03 Using SAS Predictive Modeling to Investigate the Asthma s Patient Future Hospitalization Risk Yehia H. Khalil, University of Louisville, Louisville, KY, US ABSTRACT The focus of
CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19
PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations
Improving spam mail filtering using classification algorithms with discretization Filter
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
Detection of Heart Diseases by Mathematical Artificial Intelligence Algorithm Using Phonocardiogram Signals
International Journal of Innovation and Applied Studies ISSN 2028-9324 Vol. 3 No. 1 May 2013, pp. 145-150 2013 Innovative Space of Scientific Research Journals http://www.issr-journals.org/ijias/ Detection
Scalable Developments for Big Data Analytics in Remote Sensing
Scalable Developments for Big Data Analytics in Remote Sensing Federated Systems and Data Division Research Group High Productivity Data Processing Dr.-Ing. Morris Riedel et al. Research Group Leader,
Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics
Predictive Analytics Powered by SAP HANA Cary Bourgeois Principal Solution Advisor Platform and Analytics Agenda Introduction to Predictive Analytics Key capabilities of SAP HANA for in-memory predictive
CoolaData Predictive Analytics
CoolaData Predictive Analytics 9 3 6 About CoolaData CoolaData empowers online companies to become proactive and predictive without having to develop, store, manage or monitor data themselves. It is an
Big Data Analytics for Mitigating Insider Risks in Electronic Medical Records
Big Data Analytics for Mitigating Insider Risks in Electronic Medical Records Bradley Malin, Ph.D. Associate Prof. & Vice Chair of Biomedical Informatics, School of Medicine Associate Prof. of Computer
