Data Mining. Shahram Hassas Math 382 Professor: Shapiro



Similar documents
A Review of Data Mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Introduction to Data Mining

Session 10 : E-business models, Big Data, Data Mining, Cloud Computing

Fluency With Information Technology CSE100/IMT100

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users

DATA MINING TECHNIQUES AND APPLICATIONS

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Data Mining for Successful Healthcare Organizations

Machine Learning and Data Mining. Fundamentals, robotics, recognition

A Survey on Web Research for Data Mining

Database Marketing, Business Intelligence and Knowledge Discovery

Statistics for BIG data

Data Mining Solutions for the Business Environment

A HOLISTIC FRAMEWORK FOR KNOWLEDGE MANAGEMENT

Data Mining Techniques

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Data Mining and Machine Learning in Bioinformatics

Introduction. A. Bellaachia Page: 1

Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1

Data Mining. Anyone can tell you that it takes hard work, talent, and hours upon hours of

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

Information Management course

Foundations of Business Intelligence: Databases and Information Management

Introduction to Data Mining

Data Mining & Knowledge Discovery: Personalization and Profiling Technologies

An Introduction to Data Mining

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Data Warehousing and Data Mining in Business Applications

Chapter 20: Data Analysis

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

Using Data Mining for Mobile Communication Clustering and Characterization

2.1. Data Mining for Biomedical and DNA data analysis

An Introduction to Advanced Analytics and Data Mining

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Using Artificial Intelligence to Manage Big Data for Litigation

Data Mining Applications in Higher Education

Customer Classification And Prediction Based On Data Mining Technique

Data Mining. Craig Chomsky, Marek Dvorak

Data Isn't Everything

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Subject Description Form

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

MA2823: Foundations of Machine Learning

Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier

Federico Rajola. Customer Relationship. Management in the. Financial Industry. Organizational Processes and. Technology Innovation.

A Survey on Intrusion Detection System with Data Mining Techniques

Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology Madras

Data Mining System, Functionalities and Applications: A Radical Review

Data, Measurements, Features

not possible or was possible at a high cost for collecting the data.

Data Mining Part 5. Prediction

A Survey on Data Mining with Big data - Applications, Techniques, Tools, Challenges and Visualization

Foundations of Artificial Intelligence. Introduction to Data Mining

Web Usage Mining: Identification of Trends Followed by the user through Neural Network

DATA PREPARATION FOR DATA MINING

MS1b Statistical Data Mining

Data Mining: An Introduction

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT

International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: X DATA MINING TECHNIQUES AND STOCK MARKET

Application of Business Intelligence in Transportation for a Transportation Service Provider

Pentaho Data Mining Last Modified on January 22, 2007

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

III JORNADAS DE DATA MINING

Chapter 12 Discovering New Knowledge Data Mining

How To Use Neural Networks In Data Mining

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Building Data Warehousing and Data Mining from Course Management Systems: A Case Study of FUTA Course Management Information Systems

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

Data Mining for Retail Website Design and Enhanced Marketing

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

Comparison of Data Mining Techniques used for Financial Data Analysis

A New Approach for Evaluation of Data Mining Techniques

INVESTIGATIONS INTO EFFECTIVENESS OF GAUSSIAN AND NEAREST MEAN CLASSIFIERS FOR SPAM DETECTION

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?

Sanjeev Kumar. contribute

Data Mining for Fun and Profit

Data Mining and Neural Networks in Stata

Learning is a very general term denoting the way in which agents:

Nagarjuna College Of

Outline. What is Big data and where they come from? How we deal with Big data?

Intrusion Detection via Machine Learning for SCADA System Protection

Data Mining. Concepts, Models, Methods, and Algorithms. 2nd Edition

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

from Larson Text By Susan Miertschin

Transcription:

Data Mining Shahram Hassas Math 382 Professor: Shapiro

Agenda Introduction Major Elements Steps/ Processes Examples Tools used for data mining Advantages and Disadvantages

What is Data Mining? Described as the method of comparing large volumes of data looking for more information from a data Data Mining is the process of analyzing data from different perspectives and summarizing it into useful information which can be used to increase revenue, and cut costs.

What is Data Mining? Data mining is the results of Classical statistics, artificial intelligence and machine learning. Classical statistic plays the main role in data mining. Artificial intelligence applies human interest processing to statistical problems.

What is it used for? To assist in the analysis of collections of observations and finding correlations among all of the fields and relationship between the facts in databases.

Examples grocery chain stores discovered that when men bought diapers on Thursdays and Saturdays, they also attempt to buy beer. It is also showed that these shoppers typically did their weekly grocery shopping on Saturdays and only bought a few items on Thursdays. The grocery chain used this newly discovered information in various ways to increase revenue for instance; they moved the beers closer to the diapers and, they could make sure beer and diapers were sold at full price on Thursdays.

Data mining consists of five major elements: Extract, transform, and load transaction data onto the data warehouse system. Store and manage the data in a multidimensional database system. Provide data access to business analysts and information technology professionals. Analyze the data by application software. Present the data in a useful format, such as a graph or table.

Data Mining Goal Data mining application goals are predictions, and it allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships by exploration to estimate the expected result(s).

Examples The National Basketball Association (NBA) is exploring a data mining application that can combine and analyze the records of basketball games by using advanced Scout software. This program helps to find patterns derived from game statistics, images, and the movements of the players themselves.

3 Steps Data Mining Process Stage 1: Exploration. This stage usually starts with data preparation which may involve cleaning data, data transformations, selecting subsets of records Stage 2: Model building and validation. This stage involves considering various models and choosing the best one based on their predictive performance Stage 3: Deployment. That final stage involves using the model selected as best in the previous stage and applying it to new data in order to generate predictions or estimates of the expected outcome

Some of the tools used for data mining are: Artificial neural networks - Non-linear predictive models that learn through training and resemble biological neural networks in structure. Decision trees - Tree-shaped structures that represent sets of decisions. These decisions generate rules for the classification of a dataset. Rule induction - The extraction of useful if-then rules from data based on statistical significance. Genetic algorithms - Optimization techniques based on the concepts of genetic combination, mutation, and natural selection. Nearest neighbor - A classification technique that classifies each record based on the records most similar to it in an historical database.

Reasons for the growing popularity of Data Mining Growing Data Volume Limitations of Human Analysis Cost and efficiency compared to other Statistical data applications

ADVANTAGES OF DATA MINING Data mining has been used in area of science and engineering, such as bioinformatics, genetics, medicine, education, agricultural, electrical power engineering, and law enforcement

ADVANTAGES OF DATA MINING In the field of human genetics, usage of data mining attempts to find out how the changes in an individual's DNA sequence affect the risk of developing common diseases such as cancer.

ADVANTAGES OF DATA MINING Law enforcement: Data mining can aid law enforcers in identifying criminal suspects. Electrical power Engineering: data mining techniques have been used for condition monitoring of high voltage electrical equipment to obtain valuable information on the insulation's health status of the equipment

DISADVANTAGES OF DATA MINING Privacy Issues: For example, according to the Washington Post, in 1998, CVS had sold their patient s prescription purchases to a different company American Express also sold their customers credit card purchases to another company.

DISADVANTAGES OF DATA MINING Cont Security issues: Although companies have a lot of personal information about us available online, they do not have sufficient security systems in place to protect that information. Misuse of information: Some of the companies will answer your phone call based on your purchase history. If you have spent a lot of money or bought a lot of products from one company, your call will be answered really soon. So you should not think that your call is really being answered in the order in which it was received.

References: http://en.wikipedia.org/wiki/data_mining http://www.anderson.ucla.edu/faculty/jason.frand/teacher/technologies/p alace/datamining.htm/data_mining.htm, http://www.anderseon.ucla.edu/faculty/jason.frand/teacher/technologies/ palace. Palace, Bill. Data Mining. http://www.niehs.nih.gov/research/resource/databases/gac/guid.cfm