2.1. Data Mining for Biomedical and DNA data analysis



Similar documents
Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Healthcare Measurement Analysis Using Data mining Techniques

Data Mining Solutions for the Business Environment

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

not possible or was possible at a high cost for collecting the data.

Data mining in the e-learning domain

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

Data Mining System, Functionalities and Applications: A Radical Review

DATA MINING TECHNIQUES AND APPLICATIONS

Introduction to Data Mining

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

An Overview of Knowledge Discovery Database and Data mining Techniques

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Dynamic Data in terms of Data Mining Streams

Introduction to Data Mining

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

Applications and Trends in Data Mining

TEXT ANALYTICS INTEGRATION

Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

Data Mining: Overview. What is Data Mining?

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Data Mining for Fun and Profit

Chapter ML:XI. XI. Cluster Analysis

Overview Applications of Data Mining In Health Care: The Case Study of Arusha Region

Customer Classification And Prediction Based On Data Mining Technique

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Data Mining in Telecommunication

A Review of Data Mining Techniques

Data Mining Analytics for Business Intelligence and Decision Support

SPATIAL DATA CLASSIFICATION AND DATA MINING

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY

International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: X DATA MINING TECHNIQUES AND STOCK MARKET

Chapter 3: Cluster Analysis

Syllabus. HMI 7437: Data Warehousing and Data/Text Mining for Healthcare

Session 10 : E-business models, Big Data, Data Mining, Cloud Computing

Data Mining: Concepts and Techniques

Introduction. A. Bellaachia Page: 1

Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

Information Management course

Chapter 2 Literature Review

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Application of Data Mining Methods in Health Care Databases

Foundations of Business Intelligence: Databases and Information Management

Data Mining in Web Search Engine Optimization and User Assisted Rank Results

Web Mining as a Tool for Understanding Online Learning

Keywords Data Mining, Knowledge Discovery, Direct Marketing, Classification Techniques, Customer Relationship Management

Introduction to Data Mining

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Mining Online GIS for Crime Rate and Models based on Frequent Pattern Analysis

COURSE RECOMMENDER SYSTEM IN E-LEARNING

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

ANALYSIS OF FEATURE SELECTION WITH CLASSFICATION: BREAST CANCER DATASETS

A Statistical Text Mining Method for Patent Analysis

Static Data Mining Algorithm with Progressive Approach for Mining Knowledge

Introduction to Data Mining

PREPROCESSING OF WEB LOGS

Significance of Data Warehousing and Data Mining in Business Applications

Data Mining: Concepts and Techniques

Business Intelligence and Decision Support Systems

Data Mining for Everyone

1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining

APPLICATION OF DATA MINING TECHNIQUES FOR BUILDING SIMULATION PERFORMANCE PREDICTION ANALYSIS.

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users

Data Warehousing and Data Mining

INVESTIGATIONS INTO EFFECTIVENESS OF GAUSSIAN AND NEAREST MEAN CLASSIFIERS FOR SPAM DETECTION

An Empirical Study of Application of Data Mining Techniques in Library System

Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

Concept and Applications of Data Mining. Week 1

Database Marketing, Business Intelligence and Knowledge Discovery

Mining an Online Auctions Data Warehouse

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc.

Data Mining Governance for Service Oriented Architecture

Dr. U. Devi Prasad Associate Professor Hyderabad Business School GITAM University, Hyderabad

Tracking System for GPS Devices and Mining of Spatial Data

Implementation of Data Mining Techniques to Perform Market Analysis

Data Mining. Knowledge Discovery, Data Warehousing and Machine Learning Final remarks. Lecturer: JERZY STEFANOWSKI

Importance or the Role of Data Warehousing and Data Mining in Business Applications

Grid Density Clustering Algorithm

Ezgi Dinçerden. Marmara University, Istanbul, Turkey

Use of Data Mining in Banking

Data are everywhere. IBM projects that every day we generate 2.5

Data Mining: Introduction. Lecture Notes for Chapter 1. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler

KNOWLEDGE DISCOVERY FOR SUPPLY CHAIN MANAGEMENT SYSTEMS: A SCHEMA COMPOSITION APPROACH

Prediction of Heart Disease Using Naïve Bayes Algorithm

EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS DATA MINING APPROACH

Transcription:

Applications of Data Mining Simmi Bagga Assistant Professor Sant Hira Dass Kanya Maha Vidyalaya, Kala Sanghian, Distt Kpt, India (Email: simmibagga12@gmail.com) Dr. G.N. Singh Department of Physics and Computer Science, Sudarshan Degree College, Lalgaon, Rewa (MP), India Abstract Data Mining is a powerful tool capable of handling decision making and for forecasting future trends of market. Data Mining tools and techniques can be successfully applied in various fields in various forms. In this paper, we will explain various applications of Data Mining. We will also explain how Data Mining can integrate with other fields. Keywords DATA MINING, FINANCIAL DATA ANALYSIS, APPLICATIONS 1. Introduction Data Mining is main concerned with the analysis of data and Data Mining tools and techniques are used for finding patterns from the data set. The main objective of Data Mining is to find patterns automatically with minimal user input and efforts. Data Mining is a powerful tool capable of handling decision making and for forecasting future trends of market. Data Mining tools and techniques can be successfully applied in various fields in various forms. Many Organizations now start using Data Mining as a tool, to deal with the competitive environment for data analysis. By using Mining tools and techniques, various fields of business get benefit by easily evaluate various trends and pattern of market and to produce quick and effective market trend analysis. 2. DATA MINING APPLICATIONS Data Mining applications are widely used in Health industry, Auditing, Telecommunication industry, Retail industry etc. 2.1. Data Mining for Biomedical and DNA data analysis In recent years, Data Mining has been widely used in area of Medical science such as Biomedical, DNA, Genetics and Medicine etc. In the area of Genetics, the important goal is to understand the mapping relationship between the variation in human DNA sequences and the disease susceptibility. Data Mining is very important tool to help improve the diagnosis, prevention and treatment of the diseases.

Due to growth in biomedical research, the large scale of genes patterns and functions has to be studied. Data Mining tools helps greatly to study DNA analysis and to find various patterns and functions. Data Mining helps the researches of Biomedical and DNA data analysis in the following ways: The DNA data is generally heterogeneous, highly distributed and uncontrolled in nature but this data is highly used in various areas of research in medical science. For the research purpose data must be systematic. Data Cleansing and integration approaches of Data Mining helps to systemize the data and store this data in the data base or data warehouse for further use in research. One of the main tasks related to DNA data analysis is to compare various sequences of the data and search for the similarities among this DNA data. Comparison mainly involves the gene sequence of healthy and diseased tissues to find the difference between these two types. This can be done by retrieving the gene sequence of both healthy and disease tissue classes and then finds the frequently occurring patterns of both classes. This analysis helps to find the similarities and dissimilarities in genetic sequence. In Biomedical research, it is studied that most of the diseases are triggered by combination of genes. Association analysis method is used to determine the cooccurrence of the group of genes and also we can study the interaction and relationships between the genes. There are different combinations of genes contributes to diseases but these different genes becomes activate at different level of stages. Path analysis is used to link the different genes to different stages of disease development. Path analysis performs an important role in genetic study. Visualization tools also play an important role in biomedical Data Mining. The visualization tools helps to present complex genes structure in graphs, trees and chains. The visual representation helps to better understanding of complex genes structures, for knowledge discovery and data exploration. 2.2. Data Mining as Financial Data Analysis Financial data is mainly collected from banks and from other financial sectors. This financial data is usually reliable, complete and has high quality. Financial data need a systematic method for data analysis. Data Mining plays an important in analysis of financial data. Data Mining follows steps such as data collection and understanding, data refinement, model building and model evaluation and deployment. These steps help to deal with analysis of financial data. The proper analysis of financial data enables us to better decisions making capabilities according to the market analysis. Data Mining tools and techniques helps to analyze the financial data in the following ways: Data collected from the various financial institutes like banks are first collected in the data warehouse. Multidimensional data analysis techniques are used to analyze such data collected in data warehouse for its general properties. One of main task related to analyze about the prediction about loan payments and customer credit policies. For such analysis, Data Mining methods such as feature selection that helps to identifies the various features like customer

income level, payment to income ratio and credit history etc. By analyze of such features the bank can decide about the loan grant policies on the basis of relatively low risks. Clustering and Classification techniques of Data Mining help the financial institutes to cluster various customers that have the similar features. Effective clustering and filtering methods helps the bank to identify the customer groups, relate new customer with the present cluster and facilitate them some common benefits. Data Mining tools helps the financial institutes to detect the frauds or crimes by interrelate data from the various data bases and from the history transactions done by that customers. Data Visualization tools help to present data in the different format like graphs based on the certain attributes. By viewing data from different angles the bank can view the customers who have performed some illegal operations and then the detail investigation of these suspicious cases helps to find the frauds and crimes. 2.3. Data Mining for the Retail Industry Data Mining plays an important role in the retail industry also. Retail industry involves large amount of data that includes transportation, sales and consumptions of goods and services. This data grows rapidly due to increase in purchase and sales in business. These days, E-commerce is growing fast with the growth of companies and also improving the online experience. Electronic commerce describes the buying and selling of products, services, and information via computer networks including the Internet. It can be viewed as a business concept that handles effectively and efficiently all previous business management and economic concepts. E-commerce streamlined the business processes, flatter organizational hierarchies and inter-firm collaboration. The databases have become treasure and essential e-commerce tools. To take advantage of the data base, Data Mining must be integrated into the e-commerce systems. Retail Data mining helps in identifying customer behaviour, shopping patterns and distribution policies etc. As retail data in a very large in quantity, so we design data warehouse to store this large data and effective analysis of data. The main decision has to take while designing the data warehouse is dimension, level and preprocessing to perform the quality and efficient data mining. The main requirement of the retail industry is timely information regarding the customer requirements, trends of market with the cost, profit and quality etc. To get these information effectively and efficiently, powerful multidimensional analysis and visualization tools are required. The retail industry also do some sort of advertisements, give discounts to customers to attract them. An effective analysis is required to get information to check the effect of advertisement on the sales. This analysis can be done by checking the amount of sale done during the sales period before the advertisement and the amount of sale after the advertisement. The sequential pattern mining used to find the change in the customer consumption, adjustment of price and variety in various goods in order to attract customer. Association analysis also helps to get information in order to promote sales. In this way, various Data Mining tools help in the retail industry.

2.4. Data Mining for Telecommunication Industry Telecommunication Industry has been growing very fast as the technology grows. These days Telecommunication services has grown from local and long distance voice communication services to fax, pager, cellular phones and e-mails. Now the telecommunication services have integrated with the computer, internet, and network and with other communication technologies. Due to the advancements in telecommunication technologies and to work these technologies effectively, Data Mining techniques integrated with these technologies to produce effective results. Data Mining helps to identify telecommunications patterns, fraud activities and also helps to better use of resources and improve the quality of services. Data Mining improves the telecommunication services in the following ways: Telecommunication data involves type of call, location of caller, location of called, time of call and duration of call etc. Multidimensional data analysis helps to identify and compare the system load, data traffic and profit etc. Analyst can view the charts and graphs of calling resources, destinations etc by using the visualization tools of Data Mining. The main problem faced by the telecommunication industry is due to the fraudulent activities. These fraudulent activities may involve fraudulent calls during busy hour, periodic calls etc. These activities may effect on the performance of the communication network. Data Mining methods such as cluster analysis, outlier analysis helps to detect fraudulent patterns and improves the efficiency of the communication services. The association and sequential pattern analysis helps to promote various telecommunication services. Visualization tools such as association visualization, clustering and association visualization shows very useful telecommunication data analysis. The widespread changes in the adoption and utilization of new technologies in business, even small business has large number of financial transaction. It s the responsibility of the analyzer to analyze these transactions to detect frauds and errors in financial transactions. Due to change in business trends, it s very difficult and complicated to analyze financial transactions by manual methods. Due to limitations of in manual analysis of complex data, we use Data Mining tools and techniques in various areas to get effective results. 3. Conclusions Data Mining can be used in various fields like retail industry, telecommunication industry etc. In Retail industry Data mining helps in identifying customer behaviour, shopping patterns and distribution policies etc. Data Mining also helps to identify telecommunications patterns, fraud activities and also helps to better use of resources and improves the quality of services.. Data mining tools helps greatly to study DNA analysis and to find various patterns and functions. In this we explained various applications of Data Mining.

References [1] Fayyad, U.M. Data Mining and Knowledge Discovery: Making Sense Out of Data, IEEE Expert 11(5), 1996. [2] T. Imielinski and H. Mannila. A database perspective on knowledge discovery. Communications of ACM, 39, 1996. [3] J. Han and M. Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann, 2000. [4] Elder, John F., IV and Daryl Pregibon, (1996), Advances in Knowledge Discovery & Data Mining, A Statistical Perspective on KDD. [5] Chen, M. et al, 1996. Data Mining: An Overview from a Database Perspective. IEEE Transactions on Knowledge and Data Engineering, Vol. 8. [6] Han, J., & Kamber, M. (2006). Data mining concepts and techniques. Boston: Elsevier. [7] Dunham, M.H. (2003). Data mining introductory and advanced topics. Upper Saddle River, NJ: Pearson Education, Inc. [8] Yao, Y.Y. A step towards the foundations of data mining, in: Data Mining and Knowledge Discovery: Theory, Tools, and Technology V, Dasarathy, B.V. (ed.), The International Society for Optical Engineering. [9] Singh G.N, Bagga Simmi(2011) Three Phase Iterative Model of KDD, International Journal of Information Technology and Knowledge Management, Volume 4, No. 2, pp.695-697. [10] Singh G.N, Bagga Simmi (2011), Clustering Method for categorical and Numeric Data sets, Global Journal in computer Science.