INTRODUCTION TO MACHINE LEARNING 3RD EDITION



Similar documents
Obtaining Value from Big Data

Machine Learning. CS494/594, Fall :10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004

MA2823: Foundations of Machine Learning

Lecture Slides for INTRODUCTION TO. ETHEM ALPAYDIN The MIT Press, Lab Class and literature. Friday, , Harburger Schloßstr.

Machine Learning Introduction

Machine Learning and Statistics: What s the Connection?

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Introduction to Machine Learning Using Python. Vikram Kamath

Machine Learning: Overview

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Data Mining Techniques in CRM

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Introduction to Learning & Decision Trees

Learning is a very general term denoting the way in which agents:

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Introduction to Data Mining

Introduction to Pattern Recognition

What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO

CSE 517A MACHINE LEARNING INTRODUCTION

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

CS 6220: Data Mining Techniques Course Project Description

Foundations of Business Intelligence: Databases and Information Management

Data Mining: Overview. What is Data Mining?

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

Data Warehousing and Data Mining for improvement of Customs Administration in India. Lessons learnt overseas for implementation in India

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Data Mining Applications in Manufacturing

: Introduction to Machine Learning Dr. Rita Osadchy

Data Mining Solutions for the Business Environment

Data Mining System, Functionalities and Applications: A Radical Review

Title. Introduction to Data Mining. Dr Arulsivanathan Naidoo Statistics South Africa. OECD Conference Cape Town 8-10 December 2010.

Management Decision Making. Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011

Data Mining for Fun and Profit

Research-based Learning (RbL) in Computing Courses for Senior Engineering Students

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Data, Measurements, Features

Introduction to Data Mining

TIETS34 Seminar: Data Mining on Biometric identification

Limitations of Human Vision. What is computer vision? What is computer vision (cont d)?

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

NEURAL NETWORKS IN DATA MINING

Learning outcomes. Knowledge and understanding. Competence and skills

Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal

Doctor of Philosophy in Computer Science

Using Artificial Intelligence to Manage Big Data for Litigation

Data Mining: Motivations and Concepts

Information Management course

B2B opportunity predictiona Big Data and Advanced. Analytics Approach. Insert

Big Data Analytics for SCADA

Discovering, Not Finding. Practical Data Mining for Practitioners: Level II. Advanced Data Mining for Researchers : Level III

1 What is Machine Learning?

MBA Data Mining & Knowledge Discovery

ADVANCED MACHINE LEARNING. Introduction

Maschinelles Lernen mit MATLAB

Data Mining + Business Intelligence. Integration, Design and Implementation

TDA and Machine Learning: Better Together

In this presentation, you will be introduced to data mining and the relationship with meaningful use.

CREDIT CARD FRAUD DETECTION SYSTEM USING GENETIC ALGORITHM

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

Applications of Deep Learning to the GEOINT mission. June 2015

DATA MINING TECHNIQUES AND APPLICATIONS

Machine Learning for Data Science (CS4786) Lecture 1

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Foundations of Artificial Intelligence. Introduction to Data Mining

Steven C.H. Hoi School of Information Systems Singapore Management University

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Digital and Big Data Opportunities in Credit Risk. Banking Congress Warsaw, October 2015

8. Machine Learning Applied Artificial Intelligence

MS1b Statistical Data Mining

Machine Learning using MapReduce

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining

Knowledge Discovery from Data Bases Proposal for a MAP-I UC

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

Azure Machine Learning, SQL Data Mining and R

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?

TNS EX A MINE BehaviourForecast Predictive Analytics for CRM. TNS Infratest Applied Marketing Science

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Business Intelligence and Decision Support Systems

Synthetic Aperture Radar: Principles and Applications of AI in Automatic Target Recognition

from Larson Text By Susan Miertschin

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Introduction to Data Mining

Course 395: Machine Learning

Master's projects at ITMO University. Daniil Chivilikhin PhD ITMO University

Introduction. A. Bellaachia Page: 1

Transcription:

ETHEM ALPAYDIN The MIT Press, 2014 Lecture Slides for INTRODUCTION TO MACHINE LEARNING 3RD EDITION alpaydin@boun.edu.tr http://www.cmpe.boun.edu.tr/~ethem/i2ml3e

CHAPTER 1: INTRODUCTION

Big Data 3 Widespread use of personal computers and wireless communication leads to big data We are both producers and consumers of data Data is not random, it has structure, e.g., customer behavior We need big theory to extract that structure from data for (a) Understanding the process (b) Making predictions for the future

Why Learn? 4 Machine learning is programming computers to optimize a performance criterion using example data or past experience. There is no need to learn to calculate payroll Learning is used when: Human expertise does not exist (navigating on Mars), Humans are unable to explain their expertise (speech recognition) Solution changes in time (routing on a computer network) Solution needs to be adapted to particular cases (user biometrics)

5 What We Talk About When We Talk About Learning Learning general models from a data of particular examples Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and scarce. Example in retail: Customer transactions to consumer behavior: People who bought Blink also bought Outliers (www.amazon.com) Build a model that is a good and useful approximation to the data.

Data Mining 6 Retail: Market basket analysis, Customer relationship management (CRM) Finance: Credit scoring, fraud detection Manufacturing: Control, robotics, troubleshooting Medicine: Medical diagnosis Telecommunications: Spam filters, intrusion detection Bioinformatics: Motifs, alignment Web mining: Search engines...

What is Machine Learning? 7 Optimize a performance criterion using example data or past experience. Role of Statistics: Inference from a sample Role of Computer science: Efficient algorithms to Solve the optimization problem Representing and evaluating the model for inference

Applications 8 Association Supervised Learning Classification Regression Unsupervised Learning Reinforcement Learning

Learning Associations 9 Basket analysis: P (Y X ) probability that somebody who buys X also buys Y where X and Y are products/services. Example: P ( chips beer ) = 0.7

Classification 10 Example: Credit scoring Differentiating between low-risk and high-risk customers from their income and savings Discriminant: IF income > θ 1 AND savings > θ 2 THEN low-risk ELSE high-risk

Classification: Applications 11 Aka Pattern recognition Face recognition: Pose, lighting, occlusion (glasses, beard), make-up, hair style Character recognition: Different handwriting styles. Speech recognition: Temporal dependency. Medical diagnosis: From symptoms to illnesses Biometrics: Recognition/authentication using physical and/or behavioral characteristics: Face, iris, signature, etc Outlier/novelty detection:

Face Recognition 12 Training examples of a person Test images ORL dataset, AT&T Laboratories, Cambridge UK

Regression Example: Price of a used car x : car attributes y = wx+w 0 y : price y = g (x q ) g ( ) model, q parameters 13

Regression Applications 14 Navigating a car: Angle of the steering Kinematics of a robot arm (x,y) α 1 = g 1 (x,y) α 2 = g 2 (x,y) α 2 α 1 Response surface design

Supervised Learning: Uses 15 Prediction of future cases: Use the rule to predict the output for future inputs Knowledge extraction: The rule is easy to understand Compression: The rule is simpler than the data it explains Outlier detection: Exceptions that are not covered by the rule, e.g., fraud

Unsupervised Learning 16 Learning what normally happens No output Clustering: Grouping similar instances Example applications Customer segmentation in CRM Image compression: Color quantization Bioinformatics: Learning motifs

Reinforcement Learning 17 Learning a policy: A sequence of outputs No supervised output but delayed reward Credit assignment problem Game playing Robot in a maze Multiple agents, partial observability,...

Resources: Datasets 18 UCI Repository: http://www.ics.uci.edu/~mlearn/mlrepository.html Statlib: http://lib.stat.cmu.edu/

Resources: Journals 19 Journal of Machine Learning Research www.jmlr.org Machine Learning Neural Computation Neural Networks IEEE Trans on Neural Networks and Learning Systems IEEE Trans on Pattern Analysis and Machine Intelligence Journals on Statistics/Data Mining/Signal Processing/Natural Language Processing/Bioinformatics/...

Resources: Conferences 20 International Conference on Machine Learning (ICML) European Conference on Machine Learning (ECML) Neural Information Processing Systems (NIPS) Uncertainty in Artificial Intelligence (UAI) Computational Learning Theory (COLT) International Conference on Artificial Neural Networks (ICANN) International Conference on AI & Statistics (AISTATS) International Conference on Pattern Recognition (ICPR)...