A PRAGMATIC ANALYSIS OF DATA MINING TECHNIQUES IN MULTIPLE APPLICATIONS AND CASES



Similar documents
Web Data Mining: A Case Study. Abstract. Introduction

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?

Database Marketing, Business Intelligence and Knowledge Discovery

IJMIE Volume 2, Issue 5 ISSN:

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users

Data Warehousing and Data Mining in Business Applications

Data mining for prediction

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Data Mining for Successful Healthcare Organizations

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Building a Database to Predict Customer Needs

DATA MINING AND WAREHOUSING CONCEPTS

Data Mining Solutions for the Business Environment

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

International Journal of Arts and Science Research Journal home page:

DATA WAREHOUSE AND DATA MINING NECCESSITY OR USELESS INVESTMENT

Sunnie Chung. Cleveland State University

Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management

Principles of Data Mining by Hand&Mannila&Smyth

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY

A Review of Data Mining Techniques

PREDICTIVE DATA MINING ON WEB-BASED E-COMMERCE STORE

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Gerard Mc Nulty Systems Optimisation Ltd BA.,B.A.I.,C.Eng.,F.I.E.I

Data Mining and Science?

Information Management course

How to create an effective data management strategy

SPATIAL DATA CLASSIFICATION AND DATA MINING

Dynamic Data in terms of Data Mining Streams

Data Mining Industry: Emerging Trends and New Opportunities

Introduction to Data Mining

Data Mining and Statistics: What is the Connection?

Federico Rajola. Customer Relationship. Management in the. Financial Industry. Organizational Processes and. Technology Innovation.

Importance or the Role of Data Warehousing and Data Mining in Business Applications

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH

Data Warehouse design

A Survey on Web Research for Data Mining

Executive Briefing White Paper Plant Performance Predictive Analytics

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc.

Information management software solutions White paper. Powerful data warehousing performance with IBM Red Brick Warehouse

not possible or was possible at a high cost for collecting the data.

Introduction to Data Mining

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities

Better decision making under uncertain conditions using Monte Carlo Simulation

Data Mining for Everyone

Quality Control of National Genetic Evaluation Results Using Data-Mining Techniques; A Progress Report

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010

HELSINKI UNIVERSITY OF TECHNOLOGY T Enterprise Systems Integration, Data warehousing and Data mining: an Introduction

Business Intelligence and Decision Support Systems

An Overview of Knowledge Discovery Database and Data mining Techniques

Data Mining Applications in Fund Raising

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY

Five predictive imperatives for maximizing customer value

from Larson Text By Susan Miertschin

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90

Data Mart/Warehouse: Progress and Vision

Distance Learning and Examining Systems

Data W a Ware r house house and and OLAP II Week 6 1

Foundations of Business Intelligence: Databases and Information Management

A Business Intelligence Training Document Using the Walton College Enterprise Systems Platform and Teradata University Network Tools Abstract

Subject Description Form

How To Use Neural Networks In Data Mining

Better planning and forecasting with IBM Predictive Analytics

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Introduction to Data Mining

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

The Data Mining Process

Use of Data Mining in Banking

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

Introduction. A. Bellaachia Page: 1

DEMYSTIFYING BIG DATA. What it is, what it isn t, and what it can do for you.

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Data Warehouse Architecture Overview

Data Mining Algorithms Part 1. Dejan Sarka

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Table of Contents Section 8: How Children Learn

Supply chain intelligence: benefits, techniques and future trends

DSS based on Data Warehouse

Data Mining: An Introduction

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

Improving Decision Making and Managing Knowledge

III JORNADAS DE DATA MINING

Developing a Business Analytics Roadmap

Hexaware E-book on Predictive Analytics

Business Analytics and Data Visualization. Decision Support Systems Chattrakul Sombattheera

Integrated Data Mining and Knowledge Discovery Techniques in ERP

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Analytics For Everyone - Even You

Framework for Data warehouse architectural components

TEACHING AN APPLIED BUSINESS INTELLIGENCE COURSE

How To Use Data Mining For Loyalty Based Management

Visualization methods for patent data

(b) How data mining is different from knowledge discovery in databases (KDD)? Explain.

Transcription:

A PRAGMATIC ANALYSIS OF DATA MINING TECHNIQUES IN MULTIPLE APPLICATIONS AND CASES Vidhyotma Chitkara University Rajpura, Punjab, India ABSTRACT Data mining contains the use of innovative information research resources to discover previously unidentified, legitimate styles and connections in huge information sets. These resources can include statistical models, statistical methods, and device studying methods (algorithms that improve their overall performance instantly through experience, such as sensory systems or choice trees). Consequently, data mining contains more than gathering and handling information, it also contains research and prediction. Data mining software allows users to evaluate huge directories to fix company choice problems. Data mining is, in some ways, an expansion of research, with a few synthetic intellect and device studying creativities tossed in. Like research, data mining is not a company solution, it is just a technology. Most data-mining methods

from research, routine identification, and device studying believe information are in the main memory and pay no attention to how the criteria smashes down if only restricted opinions of the information are possible. A related field changing from directories is information warehousing, which is the term for the popular company pattern of gathering and cleaning transactional information to make them available for online research and choice support. INTRODUCTION In modern times, data-mining (DM) has become one of the most beneficial sources for getting and modifying details and for developing designs in order to generate useful details for decision-making. Data mining software, using various strategies, have been developed by both expert and research centers. These methods have been used for expert, expert and medical requirements. For example, details discovery has been used to evaluate huge details locations and identify useful category and designs in the facts locations. Data mining, the removal of unseen predictive details from large internet directories, is a impressive new engineering with great prospective to help organizations focus on the most essential info in their details manufacturing facilities. Information discovery sources calculate future designs and habits, allowing organizations to create practical, knowledge-driven choices. The automated, prospective research offered by details discovery move beyond the research of previous activities offered by retrospective sources common of choice support systems. Data mining tools can

response organization questions that typically were too difficult to take care of. They search internet directories for unseen designs, finding predictive details that professionals may miss because it can be found outside their goals. Data mining uses well-established mathematical and device studying methods to build models that estimate customer behavior. These days, engineering performs the exploration process, combines it with expert data manufacturing facilities, and presents it in a relevant way for organization users. The leading data mining products are now more than just modeling engines employing powerful methods. Instead, they data miningress the wider organization and details, such as their incorporation into today's complex it surroundings. In organization nowadays, organizations are working quickly to obtain valuable competitive benefits over other organizations. A fast-growing and popular engineering, which can help to obtain these benefits, is data mining. Data mining engineering allows an organization to use the great quantities of details that it has collected, and create connections and relationships among this data to help organizations improve performance, learn more about its customers, create better choices, and help in planning. Data mining has three major components Clustering or Category, Association

Rules and Series Analysis. This engineering can create these studies on its own, using a blend of research, artificial intellect, device studying methods, and data stores. Data mining, by its easiest meaning, performs the identification of appropriate styles in an information source. For example, a routine might indicate that we data mininged men with kids are twice more likely to generate a particular activities car than we data mininged men with no kids. If you are a marketing administrator for an auto producer, this somewhat amazing routine might be quite useful. However, data mining is not miracle. For many years, statisticians have personally "mined" directories, looking for mathematically considerable styles. Data mining (DM) is an area that has lately drawn the attention of various scientists and companies data mining is the process of finding workable and considerable styles, information and styles by smelling through your information using routine identification technology such as sensory systems, device learning and inherited methods. DM resources can response business concerns that typically were too time intensive to take care of. They search directories for invisible styles, finding predictive information that professionals may skip because it can be found outside their anticipations. DATA MINING TECHNIQUES

There are several major data mining techniques that are used in data mining projects including association, classification, clustering, prediction and sequential patterns. Association Association is one of the best known data mining strategy. It is a process of studying a routine from illustrations and using the designed design to estimate upcoming principles of the focus on varying. For example, the association strategy is used in market container research to recognize what items that clients regularly purchase together. Based on this information companies can have corresponding advertising strategy to offer more items to make more benefit. Classification Classification is a data mining strategy based on device studying. Often, classification is used to categorize each product in a set of information into one set of team. It is a process of discovering operates that charts information into one of several distinct sessions. Classification method makes use of statistical methods such as choice plants, straight line development, sensory system and research. In classification, we make the application that can learn how to classify the information items into categories. For example, given all past information of workers who remaining the company;

estimate which present workers are probably to keep later on. In this case, we split the individual's information into two categories that are leave and stay. And then we can ask our data mining application to categorize the workers into each team. Clustering Clustering is a task of classifying groups of records that are similar between them but different from other data. The only difference from classification, it is also defines the classes and put objects in them, while classification objects are assigned into predefined classes. Prediction The prediction as it name recommended is one of a data mining techniques that finds connection between separate factors and connection between reliant and separate factors. For example, prediction research technique can be used in purchase to estimate benefit for the future if we consider purchase is a separate varying, benefit could be a reliant varying. Then based on the traditional purchase and benefit data, we can sketch a fixed regression bend that is used for benefit prediction.

Sequential Patterns Sequential patterns analysis in one of data mining technique that looks for to discover similar styles in information deal over a company period. The discover styles are used for further company research to recognize connections among information. DATA MINING IN INDUSTRY Data mining is becoming progressively common in both the team and individual locations. Places such as cost-effective, insurance coverage, medication, and offering usually use data mining to web page, enhance research, and enhance income. In the team market, data mining programs originally were used as a way for recognize scammers and spend, but have started to also be used for requirements such as identifying and enhancing program performance. Data mining programs are often structured around the particular needs of a market industry or even designed and designed for just one company. This is because the styles within data may be very particular. Economical data mining programs may, for example, need to notice customer investing exercises to be able to recognize uncommon dealings that might be fake. Recently, data mining has been progressively described as an essential program for birthplace protection tasks. Some experts recommend that data mining should be used as a way for recognize enemy actions,

such as cash dealings and devices, and to recognize and notice individual terrorists themselves, such as through trip and migrants information. For data mining to affect a company, it needs to have importance to the actual company procedure. Data mining is aspect of a much bigger sequence of actions that happens between a company and its customers. The way in which data mining results a company relies on the company procedure, not the data mining procedure. Take product promotion as an example. A promotion manager's job is to comprehend their market. With this knowing comes the capability to hook up with customers in this market, using numerous programs. This has numerous locations, such as immediate promotion, make promotion, telesales, and radio/television promotion, among others. Data mining, however, ingredients information from directories that the individual did not know ongoing. Connections between factors and customer exercises that are non-intuitive are the jewelry that data mining wishes to discover. And because the individual does not know beforehand what the data mining procedure has found, it is a much bigger jump to take caused by the program and converts it into a remedy to a company problem. This is where relationships and perspective comes in. Marketing customers need to perspective the results of data mining before they can put them into actions. Because data mining usually contains getting "hidata miningen" styles of customer actions, the knowing procedure can get a bit complex. The key is to put the individual in a perspective in which they experience, and then let them keep and power until they know what they didn't see before.

Business Data Analysis A company or an company protecting data mining techniques can appreciate a variety of benefits; these contains understanding customer's actions, creating a thinking on the strength of the organization's web site- if there is one, and benchmarking marketing techniques. Data mining is a very essential system for company and in the future, company is becoming more and more competitive and everyone is having difficulties for advantages against your opponents. Organizations need to acquire an advantages against your opponents, and can get it from the enhanced interest they can get from data mining program that is available in the market right. DD Evolutionary Chart Evolutionary Business Question Enabling Product Characteristics Step Technologies Providers Data "What was my total Computers, tapes, IBM, CDC Retrospective, Collection revenue in the last disks static data five years?" delivery

(1960s) Data Access "What were unit Relational Oracle, Retrospective, (1980s) sales in New England last March?" databases (RDBMS), Sybase, Informix, dynamic data delivery at Structured Query IBM, record level Language (SQL), Microsoft ODBC Data "What were unit On-line analytic Pilot, Retrospective, Warehousing sales in New processing Comshare, dynamic data & England last March? (OLAP), Arbor, delivery at Decision Support Drill down to Boston." multidimensional databases, data warehouses Cognos, Microstrategy multiple levels (1990s) data mining "What s likely to Advanced Pilot, Prospective, (Emerging Today) happen to Boston unit sales next month? Why?" algorithms, multiprocessor computers, Lockheed, IBM, SGI, numerous proactive information delivery This chart is from: http://www.thearling.com/text/dmwhite/dmwhite.htm

massive databases startups (nascent industry) Data Mining and Management To be successful, data source promoters must first recognize areas containing clients or leads with high-profit potential. They then build and perform strategies that positively impact the actions of these individuals. The first task, determining areas, needs significant information about potential buyers and their buying habits. Theoretically, the more information the better. In practice, however, massive information stores often prevent promoters, who battle to dig through the details to discover the blocks of useful information. Recently, promoters have data mininged a new class of application to their focusing on collection. data mining programs improve the process of searching the hills of information to discover styles that are good predictors of purchasing habits. After exploration the information, promoters must supply the results into strategy store that, as the name indicates, controls the strategy instructed at the described areas. In the past, the link between data mining and strategy store was mostly guide. In the toughest cases, it involved "sneaker net," creating a physical file on

record or hard drive, which someone then carried to another computer and packed into the marketing data source. This separating of the data mining and strategy store provides significant ineffectiveness and paves the way for human mistakes. Firmly developing the two professions provides an opportunity for companies to gain aggressive advantage. Database and Marketing Data mining allows promotion customers to focus on promotion strategies more accurately; and also to position strategies more carefully with the needs, wants, and behavior of clients and leads. If the necessary information prevails in a data source, the data mining procedure can design almost any client action. The key is to find styles appropriate to present business problems. Typical concerns that data mining details consist of the following: Which clients are most likely to fall their mobile phone service? What is the possibility that a client will purchase at least $100 value of products from a particular mail-order catalog? Which leads are most likely to reply to a particular offer? Solutions to these concerns can help maintain clients and improve strategy reaction prices, which, in convert, improve purchasing, cross-selling, and revenue

HAVE KNOWLEDGE IN DATABASES Data mining and details development in directories have been gaining a lot of research, market, and press attention of delayed. Data mining methods have progressively been analyzed, especially in their program in real-world directories. One common problem is that directories are generally very huge, and these methods often regularly check out the entire set. Choosing has been used for years, but simple variations among places of things become less obvious. A level of the current attention in Knowledge in directories is the result of the press attention around effective Knowledge in directories programs. Unfortunately, it is not always easy to individual fact from press buzz. However, several well recorded illustrations of effective techniques can appropriately be termed as Knowledge in directories programs and have been implemented in functional use on large-scale realworld problems in technology and in business. Historically, the idea of finding useful styles in details has been given a variety of titles, such as data mining, details removal, details development, details growing, details the archaeology of gortyn, and details routine handling. The term data mining has mostly been used by statisticians, details experts, and the management computer areas. It has also become popular in the data source area. The term details development in directories was created at the first Knowledge in directories class in to highlight that details is the end product of a data-driven development. It has been made popular in the AI and machine-learning areas. In our view, Knowledge in directories is

the term for the overall procedure of finding useful details from details, and data mining is the term for a particular phase in this procedure. data mining is use of specific methods for getting styles from details. The difference between the Knowledge in directories procedure and the data-mining phase (within the process) is a main point of this article. The data miningitional steps in the Knowledge in directories procedure, such as details planning, details selection, details cleaning, development of appropriate details, and proper presentation of the outcomes of exploration, are essential to ensure that useful details is resulting from the details. Sightless program of data-mining methods (rightly belittled as details dredging in the mathematical literature) can be a risky action, easily resulting in the development of useless and incorrect styles. How does someone actually use the outcome of data mining? The easiest way is to keep the outcome in the form of a dark box. If they take the dark box and ranking a data source, they can get a list of customers to focus on (send them a collection, increase their borrowing restrict, etc.). There's not much for the individual to do other than sit back and watch the covers go out. This can be a very effective strategy. Emailing costs can often be decreased by an order of range without considerably decreasing the reaction rate. Then there's the more difficult way to use the outcomes of data mining: getting the individual to actually understand what is going on so that they can take action directly

DATA MINING IN SCIENTIFIC ENGINEERING APPLICATIONS Due to developments in it and powerful processing, very huge details places are becoming available in many healthcare professions. The amount of development of such details far surpasses our capability to evaluate them personally. For example, a computational simulation can generate tera-bytes of details within a few hours, whereas human analysts may take several weeks to evaluate these details places. Other for example several digital sky surveys and details place from the fields of healthcare picture, bioinformatics, and remote sensing. As a result, there is an increasing interest in various healthcare communities to explore the use of emerging data mining methods for the research of these huge details places. Data mining is the semi-automatic development of styles, organizations, changes, flaws, and mathematically considerable components and activities in details. Conventional details research is supposition motivated as a speculation is established and verified against the details. data mining, in comparison, is development motivated as the styles are instantly produced from details. The objective of the guide is to offer scientists and experts in the area of Supercomputing with presenting data mining and its program to several healthcare and technological innovation websites, such as astrophysics, healthcare picture, computational liquid characteristics, architectural methods, and ecosystem.

Due to developments in it and powerful processing, very huge details places are becoming available in many healthcare professions. The amount of development of such details far surpasses our capability to evaluate them personally. s. In most research in the sciences, one compares recorded details with a concept that is founded on an analytic expression of actual laws. The success or otherwise of the comparison is a test of the speculation of how nature works expressed as a statistical formula. This might be something essential like an inverse square law. Alternatively, fitting a statistical model to the details might determine actual parameters (such as a refractive index). On the other hand, where there are no general theories, data mining methods are valuable, especially where one has bulk of details containing noisy styles. This approach hopes to obtain a theoretical generalization instantly from the details by means of induction, deriving empirical models and learning from illustrations. The resultant concept, while maybe not essential, can yield a good understanding of the actual process and can have great practical utility Data mining is the semi-automatic development of styles, organizations, changes, flaws, and mathematically considerable components and activities in details. Conventional details research is supposition motivated as a speculation is established and verified against the details. data mining, in comparison, is development motivated as the styles are instantly produced from details. The objective of the guide is to offer scientists and experts in the area of Supercomputing with presenting data mining and its program to several healthcare and technological innovation websites, such as

astrophysics, healthcare picture, computational liquid characteristics, architectural methods, and ecosystem. SUMMARY Information exploration is growing as one of the key features of many birthplace protection projects. Often used as a way for discovering scams, evaluating risk, and product offering, data exploration includes the use of information research resources to discover formerly unidentified, legitimate styles and relationships in large data places. In the perspective of birthplace protection, data exploration is often considered as a potential method for recognize enemy actions, such as money exchanges and devices, and to recognize and track individual terrorists themselves, such as through travel and migrants information. Is data exploration as useful in technology as in commerce? Certainly, data exploration in technology has much in common with that for business data. One difference, though, is that there is a lot of current medical concept and information. Hence, there is less chance of information growing simply from data. However, medical outcomes can be attractive technology (especially where it boundaries on engineering) as in indicating causality relationships or for acting complicated phenomena. On the other hand, medical guidelines or regulations are, in concept, testable logically. Any outcomes from data exploration techniques must sit within the current sector information. Hence, the participation of a sector professional is essential to the

information exploration procedure. Naïve data exploration often results in obvious outcomes. The task is to integrate guidelines known a priori into the medical introduction, keeping in mind that the whole Knowledge in database procedure is exploratory and repetitive. While data exploration symbolizes a considerable advance in the type of systematic resources currently available, there are restrictions to its ability. One restriction is that although data exploration can help expose styles and relationships, it does not tell the individual the value or importance of these styles. These types of determinations must be made by the individual. A second restriction is that while data exploration can recognize relationships between habits and/or factors, it does not actually recognize a causal connection. To be effective, data exploration still needs experienced specialized and systematic professionals who can framework the research and understand the outcome that is designed.