INTRODUCTION TO MACHINE LEARNING

Similar documents
INTRODUCTION TO MACHINE LEARNING 3RD EDITION

Obtaining Value from Big Data

MA2823: Foundations of Machine Learning

Machine Learning. CS494/594, Fall :10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Lecture Slides for INTRODUCTION TO. ETHEM ALPAYDIN The MIT Press, Lab Class and literature. Friday, , Harburger Schloßstr.

Machine Learning Introduction

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Introduction to Learning & Decision Trees

Machine Learning: Overview

Learning is a very general term denoting the way in which agents:

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Data, Measurements, Features

Introduction to Pattern Recognition

Title. Introduction to Data Mining. Dr Arulsivanathan Naidoo Statistics South Africa. OECD Conference Cape Town 8-10 December 2010.

TIETS34 Seminar: Data Mining on Biometric identification

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal

Machine Learning using MapReduce

Machine Learning for Data Science (CS4786) Lecture 1

Information Management course

Ming-Wei Chang. Machine learning and its applications to natural language processing, information retrieval and data mining.

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

Chapter 12 Discovering New Knowledge Data Mining

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Knowledge-based systems and the need for learning

OUTLIER ANALYSIS. Data Mining 1

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

Introduction to Machine Learning Using Python. Vikram Kamath

Attribution. Modified from Stuart Russell s slides (Berkeley) Parts of the slides are inspired by Dan Klein s lecture material for CS 188 (Berkeley)

UK PhD Centre for Financial Computing

Doctor of Philosophy in Computer Science

Eastern Washington University Department of Computer Science. Questionnaire for Prospective Masters in Computer Science Students

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

: Introduction to Machine Learning Dr. Rita Osadchy

Data Analytics at NICTA. Stephen Hardy National ICT Australia (NICTA)

Machine Learning with MATLAB David Willingham Application Engineer

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Steven C.H. Hoi School of Information Systems Singapore Management University

Machine Learning and Statistics: What s the Connection?

Management Decision Making. Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Master of Science in Computer Science

What is Learning? CS 391L: Machine Learning Introduction. Raymond J. Mooney. Classification. Problem Solving / Planning / Control

The Data Mining Process

Role Description. Position of a Data Scientist Machine Learning at Fractal Analytics

Principles of Dat Da a t Mining Pham Tho Hoan hoanpt@hnue.edu.v hoanpt@hnue.edu. n

Computer Information Systems

Classification algorithm in Data mining: An Overview

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)

Bowdoin Computer Science

Introduction to Data Mining

How To Create A Text Classification System For Spam Filtering

Non-negative Matrix Factorization (NMF) in Semi-supervised Learning Reducing Dimension and Maintaining Meaning

Department of CSE. Jaypee University of Information Technology, Waknaghat. Course Curricula

An Overview of Knowledge Discovery Database and Data mining Techniques

Azure Machine Learning, SQL Data Mining and R

Machine Learning. CS 188: Artificial Intelligence Naïve Bayes. Example: Digit Recognition. Other Classification Tasks

A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace

Course Requirements for the Ph.D., M.S. and Certificate Programs

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Course Requirements for the Ph.D., M.S. and Certificate Programs

Structure of Postgraduate Programs (2005)

CSE 517A MACHINE LEARNING INTRODUCTION

Course 395: Machine Learning

Data Mining and Machine Learning in Bioinformatics

ADVANCED MACHINE LEARNING. Introduction

Principles of Data Mining by Hand&Mannila&Smyth

Challenges and Opportunities in Data Mining: Personalization

6.2.8 Neural networks for data mining

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

Big learning: challenges and opportunities

Lecture 6. Artificial Neural Networks

Learning to Process Natural Language in Big Data Environment

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Introduction to Data Mining

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

Lecture 1: Introduction to Reinforcement Learning

Data mining for prediction

Finance & CS. Banks. Consumer/Personal Loans. Deposits vs Loans. Credit Cards. Commercial/Business Loans

Supervised Learning (Big Data Analytics)

Master s Program in Information Systems

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Can we Analyze all the Big Data we Collect?

The Need for Training in Big Data: Experiences and Case Studies

Data Mining System, Functionalities and Applications: A Radical Review

MACHINE LEARNING BASICS WITH R

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

1 What is Machine Learning?

Pattern-Aided Regression Modelling and Prediction Model Analysis

Statistics Graduate Courses

The University of Jordan

Simple and efficient online algorithms for real world applications

Method of Combining the Degrees of Similarity in Handwritten Signature Authentication Using Neural Networks

Data Mining Fundamentals

Transcription:

Why are you here? What is Machine Learning? Why are you taking this course? INTRODUCTION TO MACHINE LEARNING David Kauchak CS 451 Fall 2013 What topics would you like to see covered? Machine Learning is Machine Learning is Machine learning, a branch of artificial intelligence, concerns the construction and study of systems that can learn from data. Machine learning is programming computers to optimize a performance criterion using example data or past experience. -- Ethem Alpaydin The goal of machine learning is to develop methods that can automatically detect patterns in data, and then to use the uncovered patterns to predict future data or other outcomes of interest. -- Kevin P. Murphy The field of pattern recognition is concerned with the automatic discovery of regularities in data through the use of computer algorithms and with the use of these regularities to take actions. -- Christopher M. Bishop 1

Machine Learning is Machine Learning is Machine learning is about predicting the future based on the past. -- Hal Daume III Machine learning is about predicting the future based on the past. -- Hal Daume III past Training learn model/ predictor future Testing model/ predictor predict Machine Learning, aka data mining: machine learning applied to databases, i.e. collections of data inference and/or estimation in statistics pattern recognition in engineering signal processing in electrical engineering induction optimization Goals of the course: Learn about Different machine learning problems Common techniques/tools used theoretical understanding practical implementation Proper experimentation and evaluation Dealing with large (huge) data sets Parallelization frameworks Programming tools 2

Goals of the course Administrative Course page: http://www.cs.middlebury.edu/~dkauchak/classes/cs451/ go/cs451 Assignments Weekly Mostly programming (Java, mostly) Some written/write-up Generally due Friday evenings Two exams Late Policy Be able to laugh at these signs (or at least know why one might) Honor code Course expectations 400-level course Plan to stay busy! Machine learning problems What high-level machine learning problems have you seen or heard of before? Applied class, so lots of programming Machine learning involves math 3

4

Supervised learning Supervised learning 1 3 ed 1 3 model/ predictor 4 4 5 5 Supervised learning: given ed Supervised learning: given ed Supervised learning Supervised learning: classification apple model/ predictor predicted apple Classification: a finite set of s banana banana Supervised learning: learn to predict new example Supervised learning: given ed 5

Classification Example Classification Applications Face recognition Differentiate between low-risk and high-risk customers from their income and savings Character recognition Spam detection Medical diagnosis: From symptoms to illnesses Biometrics: Recognition/authentication using physical and/ or behavioral characteristics: Face, iris, signature, etc... Supervised learning: regression Regression Example -4.5 Price of a used car 10.1 3.2 Regression: is real-valued x : car attributes (e.g. mileage) y : price y = wx+w 0 4.3 Supervised learning: given ed 24 6

Regression Applications Supervised learning: ranking Economics/Finance: predict the value of a stock Epidemiology Car/plane navigation: angle of the steering wheel, acceleration, Temporal trends: weather over time 1 4 2 3 Ranking: is a ranking Supervised learning: given ed Ranking example Given a query and a set of web pages, rank them according to relevance Ranking Applications User preference, e.g. Netflix My List -- movie queue ranking itunes flight search (search in general) reranking N-best output lists 7

Unsupervised learning Unsupervised learning applications learn clusters/groups without any customer segmentation (i.e. grouping) image compression bioinformatics: learn motifs Unupervised learning: given data, i.e., but no s Reinforcement learning Reinforcement learning example left, right, straight, left, left, left, straight GOOD Backgammon left, straight, straight, left, right, straight, straight BAD WIN! left, right, straight, left, left, left, straight left, straight, straight, left, right, straight, straight 18.5 Given a sequence of /states and a reward after completing that sequence, learn to predict the action to take in for an individual example/state -3 LOSE! Given sequences of moves and whether or not the player won at the end, learn to make good moves 8

Reinforcement learning example Other learning variations What data is available: n Supervised, unsupervised, reinforcement learning n semi-supervised, active learning, How are we getting the data: n online vs. offline learning http://www.youtube.com/watch?v=vcdxqn0fcne Type of model: n generative vs. discriminative n parametric vs. non-parametric 9