Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing



Similar documents
Introduction to Data Mining

Data Mining + Business Intelligence. Integration, Design and Implementation

Introduction to Data Mining

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Information Management course

Data Warehousing and Data Mining in Business Applications

Chapter ML:XI. XI. Cluster Analysis

Data Mining Solutions for the Business Environment

What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

Data Mining: An Introduction

A Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1

A Review of Data Mining Techniques

Introduction. A. Bellaachia Page: 1

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

Data Mining Part 5. Prediction

Data Mining Fundamentals

from Larson Text By Susan Miertschin

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Federico Rajola. Customer Relationship. Management in the. Financial Industry. Organizational Processes and. Technology Innovation.

Data Mining Algorithms Part 1. Dejan Sarka

TIETS34 Seminar: Data Mining on Biometric identification

Foundations of Artificial Intelligence. Introduction to Data Mining

The University of Jordan

DATA MINING AND WAREHOUSING CONCEPTS

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

Data Mining Analytics for Business Intelligence and Decision Support

An Overview of Database management System, Data warehousing and Data Mining

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

An Overview of Knowledge Discovery Database and Data mining Techniques

not possible or was possible at a high cost for collecting the data.

Overview. Background. Data Mining Analytics for Business Intelligence and Decision Support

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Data Warehousing and Data Mining

DBTech Pro Workshop. Knowledge Discovery from Databases (KDD) Including Data Warehousing and Data Mining. Georgios Evangelidis

An Introduction to Advanced Analytics and Data Mining

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Rhodes University COMPUTER SCIENCE HONOURS PROJECT Literature Review. Data Mining with Oracle 10g using Clustering and Classification Algorithms

BUSINESS ANALYTICS. Overview. Lecture 0. Information Systems and Machine Learning Lab. University of Hildesheim. Germany

Data Mining for Fun and Profit

Statistics for BIG data

Data Mining Applications in Higher Education

Discovering, Not Finding. Practical Data Mining for Practitioners: Level II. Advanced Data Mining for Researchers : Level III

Machine Learning with MATLAB David Willingham Application Engineer

Importance or the Role of Data Warehousing and Data Mining in Business Applications

Pentaho Data Mining Last Modified on January 22, 2007

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Data Mining and Machine Learning in Bioinformatics

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Title. Introduction to Data Mining. Dr Arulsivanathan Naidoo Statistics South Africa. OECD Conference Cape Town 8-10 December 2010.

SPATIAL DATA CLASSIFICATION AND DATA MINING

RESEARCH PAPERS FACULTY OF MATERIALS SCIENCE AND TECHNOLOGY IN TRNAVA SLOVAK UNIVERSITY OF TECHNOLOGY IN BRATISLAVA

An Introduction to Data Mining

Data Mining Techniques in CRM

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Data Mining Techniques and Opportunities for Taxation Agencies

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Using Data Mining for Mobile Communication Clustering and Characterization

EFFECTIVE USE OF THE KDD PROCESS AND DATA MINING FOR COMPUTER PERFORMANCE PROFESSIONALS

Hexaware E-book on Predictive Analytics

Data Warehousing and Data Mining

Azure Machine Learning, SQL Data Mining and R

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?

Foundations of Business Intelligence: Databases and Information Management

Integrated Data Mining and Knowledge Discovery Techniques in ERP

Fluency With Information Technology CSE100/IMT100

How Organisations Are Using Data Mining Techniques To Gain a Competitive Advantage John Spooner SAS UK

Data Mining Techniques

Data Analysis. Management Information Systems 13

BIG DATA What it is and how to use?

Introduction to Data Mining Principles

Data Mining: Overview. What is Data Mining?

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

Data Mining for Successful Healthcare Organizations

Data Mining with SQL Server Data Tools

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Use of Data Mining in the field of Library and Information Science : An Overview

ISSN: (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

Data Mining System, Functionalities and Applications: A Radical Review

DATA MINING - SELECTED TOPICS

Maschinelles Lernen mit MATLAB

Foundations of Business Intelligence: Databases and Information Management

TIM 50 - Business Information Systems

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

HiBench Installation. Sunil Raiyani, Jayam Modi

Knowledge Acquisition in Databases

Transcription:

Introduction to Data Mining and Machine Learning Techniques Iza Moise, Evangelos Pournaras, Dirk Helbing Iza Moise, Evangelos Pournaras, Dirk Helbing 1

Overview Main principles of data mining Definition Steps of a data mining process Supervised vs. unsupervised data mining Applications Data mining functionalities Iza Moise, Evangelos Pournaras, Dirk Helbing 2

Definition Data mining is the automated process of discovering interesting (non-trivial, previously unknown, insightful and potentially useful) information or patterns, as well as descriptive, understandable, and predictive models from large-scale data. Also known as: Knowledge discovery in databases (KDD) Data analysis Information harvesting Business intelligence Iza Moise, Evangelos Pournaras, Dirk Helbing 3

Goals search consistent patterns and/or systemic relationships between data validate the findings by applying the detected patterns to new subsets of data Iza Moise, Evangelos Pournaras, Dirk Helbing 4

Data Mining is... k-means clustering decision trees neural networks Bayesian networks Iza Moise, Evangelos Pournaras, Dirk Helbing 5

Data Mining is not... Data warehousing SQL / Ad Hoc Queries / Reporting Software Agents Online Analytical Processing (OLAP) Data Visualization Iza Moise, Evangelos Pournaras, Dirk Helbing 6

Knowledge Discovery Process Iza Moise, Evangelos Pournaras, Dirk Helbing 7

Steps of a data mining process 1. Exploration 2. Model building and validation 3. Deployment Iza Moise, Evangelos Pournaras, Dirk Helbing 8

Exploration data preparation data cleaning, data transformation etc. elaborate exploratory analyses using a wide variety of graphical and statistical methods, depending on the nature of the analytic problem determine the general nature of models that can be taken into account in the next stage Iza Moise, Evangelos Pournaras, Dirk Helbing 9

Model building and validation, and Deployment Model building and validation: consider various models and choose the best one validate the model on existing data Deployment: apply the model to new data Iza Moise, Evangelos Pournaras, Dirk Helbing 10

Model building and validation, and Deployment Model building and validation: consider various models and choose the best one validate the model on existing data Deployment: apply the model to new data Iza Moise, Evangelos Pournaras, Dirk Helbing 10

Model building and validation, and Deployment Model building and validation: consider various models and choose the best one validate the model on existing data Deployment: apply the model to new data Iza Moise, Evangelos Pournaras, Dirk Helbing 10

Two styles of data mining: Predictive and Descriptive 1. Predictive Predict the value of a specific attribute (target or dependent) based on the value of other attributes (explanatory) 2. Descriptive Derive patterns that summarise the relationships between data Iza Moise, Evangelos Pournaras, Dirk Helbing 11

Supervised vs. Unsupervised Supervised predictive or directed useful when you have a specific target value to predict about your data Classification, Regression, Anomaly Detection Unsupervised descriptive or undirected finds hidden structure and relation within the data Clustering, Association, Feature Extraction Iza Moise, Evangelos Pournaras, Dirk Helbing 12

Supervised Data Mining a pre-specified target variable explanatory variables one (or more) dependent variables the algorithm is given many training data where the target is known the algorithm identifies the new data points that match the model of each target value Iza Moise, Evangelos Pournaras, Dirk Helbing 13

Unsupervised Data Mining determine the existence of classes or clusters in the data exploratory analysis all variable are treated in the same way Iza Moise, Evangelos Pournaras, Dirk Helbing 14

Overview Main principles of data mining Definition Steps of a data mining process Supervised vs. unsupervised data mining Applications Data mining functionalities Iza Moise, Evangelos Pournaras, Dirk Helbing 15

Fielded Applications Web mining PageRank, Search Engine, Recommenders, Social Networks Screening Images Diagnosis Marketing and Sales Market basket analysis, Direct marketing Biology, biomedicine, astronomy, chemistry Iza Moise, Evangelos Pournaras, Dirk Helbing 16

Beers and Diapers Iza Moise, Evangelos Pournaras, Dirk Helbing 17

Overview Main principles of data mining Definition Steps of a data mining process Supervised vs. unsupervised data mining Applications Data mining functionalities Iza Moise, Evangelos Pournaras, Dirk Helbing 18

1. Supervised Data Mining Classification Regression Outlier detection Frequent pattern mining 2. Unsupervised Data Mining Clustering Feature Extraction definition real use-cases method pros and cons Iza Moise, Evangelos Pournaras, Dirk Helbing 19

1. Supervised Data Mining Classification Regression Outlier detection Frequent pattern mining 2. Unsupervised Data Mining Clustering Feature Extraction definition real use-cases method pros and cons Iza Moise, Evangelos Pournaras, Dirk Helbing 19