Machine Learning CS 6830. Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu



Similar documents
Learning is a very general term denoting the way in which agents:

CSE 517A MACHINE LEARNING INTRODUCTION

Machine Learning Introduction

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Introduction to Data Mining

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Machine Learning: Overview

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Introduction to Pattern Recognition

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

: Introduction to Machine Learning Dr. Rita Osadchy

Information Management course

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Machine learning for algo trading

MA2823: Foundations of Machine Learning

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

DATA MINING TECHNIQUES AND APPLICATIONS

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

What is Learning? CS 391L: Machine Learning Introduction. Raymond J. Mooney. Classification. Problem Solving / Planning / Control

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

Supervised and unsupervised learning - 1

Machine Learning. CS494/594, Fall :10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004

Machine Learning.

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Machine Learning for Data Science (CS4786) Lecture 1

Data Mining Part 5. Prediction

Machine Learning using MapReduce

Social Media Mining. Data Mining Essentials

Introduction to Machine Learning Using Python. Vikram Kamath

ADVANCED MACHINE LEARNING. Introduction

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal

Machine Learning with MATLAB David Willingham Application Engineer

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

Classification algorithm in Data mining: An Overview

Data, Measurements, Features

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

Learning outcomes. Knowledge and understanding. Competence and skills

Practical Introduction to Machine Learning and Optimization. Alessio Signorini

The Data Mining Process

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

Foundations of Artificial Intelligence. Introduction to Data Mining

An Introduction to Data Mining

8. Machine Learning Applied Artificial Intelligence

Machine Learning and Statistics: What s the Connection?

Machine Learning Capacity and Performance Analysis and R

not possible or was possible at a high cost for collecting the data.

Steven C.H. Hoi School of Information Systems Singapore Management University

Defending Networks with Incomplete Information: A Machine Learning Approach. Alexandre

COMP 590: Artificial Intelligence

Data Mining: Overview. What is Data Mining?

Office: LSK 5045 Begin subject: [ISOM3360]...

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Data Mining and Machine Learning in Bioinformatics

EVALUATION OF NEURAL NETWORK BASED CLASSIFICATION SYSTEMS FOR CLINICAL CANCER DATA CLASSIFICATION

Introduction to Learning & Decision Trees

Introduction to Data Mining

Using multiple models: Bagging, Boosting, Ensembles, Forests

Maschinelles Lernen mit MATLAB

Knowledge-based systems and the need for learning

MS1b Statistical Data Mining

Applications of Deep Learning to the GEOINT mission. June 2015

Attribution. Modified from Stuart Russell s slides (Berkeley) Parts of the slides are inspired by Dan Klein s lecture material for CS 188 (Berkeley)

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

MACHINE LEARNING IN HIGH ENERGY PHYSICS

Lecture 1: Introduction to Reinforcement Learning

Sanjeev Kumar. contribute

MACHINE LEARNING BASICS WITH R

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Reference Books. Data Mining. Supervised vs. Unsupervised Learning. Classification: Definition. Classification k-nearest neighbors

Data Mining for Fun and Profit

from Larson Text By Susan Miertschin

Université de Montpellier 2 Hugo Alatrista-Salas : hugo.alatrista-salas@teledetection.fr

Machine Learning. CS 188: Artificial Intelligence Naïve Bayes. Example: Digit Recognition. Other Classification Tasks

Data Mining Analytics for Business Intelligence and Decision Support

Chapter 12 Discovering New Knowledge Data Mining

Predictive Dynamix Inc

1 What is Machine Learning?

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

COLLEGE OF SCIENCE. John D. Hromi Center for Quality and Applied Statistics

Introduction to Artificial Intelligence G51IAI. An Introduction to Data Mining

How To Cluster

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

TD-Gammon, A Self-Teaching Backgammon Program, Achieves Master-Level Play

Impelling Heart Attack Prediction System using Data Mining and Artificial Neural Network

Pentaho Data Mining Last Modified on January 22, 2007

Azure Machine Learning, SQL Data Mining and R

Feed-Forward mapping networks KAIST 바이오및뇌공학과 정재승

Predictive Data modeling for health care: Comparative performance study of different prediction models

Obtaining Value from Big Data

Distance Metric Learning in Data Mining (Part I) Fei Wang and Jimeng Sun IBM TJ Watson Research Center

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

Big Data Analytics CSCI 4030

Analysis Tools and Libraries for BigData

Introduction. A. Bellaachia Page: 1

T : Classification as Spam or Ham using Naive Bayes Classifier. Santosh Tirunagari :

Visualization of Breast Cancer Data by SOM Component Planes

BIDM Project. Predicting the contract type for IT/ITES outsourcing contracts

Transcription:

Machine Learning CS 6830 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu

What is Learning? Merriam-Webster: learn = to acquire knowledge, understanding, or skill by study, instruction, or experience. What (tasks) do we learn: read, translate, write, speak. walk, play backgammon, ride bikes, drive cars, fly helicopters. categorize email, recognize faces, diagnose diseases, Why do we learn? to improve performance on a given task. 2

What is Learning?? 6 1 2 3 5 4 Class C 1 Class C 2 3

What is Learning?? 7 6 1 2 3 5 4 Class C 1 Class C 2 4

What is Learning?? 8 6 1 2 3 5 7 4 Class C 1 Class C 2 5

What is Learning?? 9 6 8 3 7 4 1 2 5 Class C 1 Class C 2 6

What is Learning? 6 8 3 9 7 4 1 2 5 Class C 1 Class C 2 7

What is Machine Learning? Machine Learning = constructing computer programs that automatically improve with experience: Supervised Learning i.e. learning from labeled examples: Classification Regression Unsupervised Learning i.e. learning from unlabeled examples: Clustering. Dimensionality reduction (vizualization). Density estimation. Reinforcement Learning i.e. learning with delayed feedback. 8

Supervised Learning Task = learn a function f : X T that maps input instances x X to output targets t T: Classification: The output t T is one of a finite set of discrete categories. Regression: The output t T is continuous, or has a continuous component. Supervision = set of training examples: (x 1,t 1 ), (x 2,t 2 ), (x n,t n ) 9

Classification vs. Regression 1 1 0-1 x 0-1 0 1 I 0 1 x 10

Classification: Junk Email Filtering [Sahami, Dumais & Heckerman, AAAI 98] From: Tammy Jordan jordant@oak.cats.ohiou.edu Subject: Spring 2015 Course -------------------------------------- CS690: Machine Learning Instructor: Razvan Bunescu Email: bunescu@ohio.edu Time and Location: Tue, Thu 9:00 AM, ARC 101 Website: http://ace.cs.ohio.edu/~razvan/courses/ml6830 Course description: Machine Learning is concerned with the design and analysis of algorithms that enable computers to automatically find patterns in the data. This introductory course will give an overview From: UK National Lottery edreyes@uknational.co.uk Subject: Award Winning Notice -------------------------------------- UK NATIONAL LOTTERY. GOVERNMENT ACCREDITED LICENSED LOTTERY. REGISTERED UNDER THE UNITED KINGDOM DATA PROTECTION ACT; We happily announce to you the draws of ( UK NATIONAL LOTTERY PROMOTION ) International programs held in London, England Your email address attached to ticket number :3456 with serial number : 7576/06 drew the lucky number 4-2-274, which subsequently won you the lottery in the first category Email filtering: Provide emails labeled as {Spam, Ham}. Train Naïve Bayes model to discriminate between the two. 11

Classification: Routing in Wireless Sensor Networks [Wang, Martonosi & Peh, SECON 06] Link quality prediction: Provide a set of training links: received signal strength, send/forward buffer sizes node depth from base station, forward/backward probability o LQI = Link Quality Indication, binarized as {Good, Bad} - Train Decision Trees model to predict LQ using runtime features. 12

Classification: Handwritten Zip Code Recognition [Le Cun et al., Neural Computation 89] Handwritten digit recognition: Provide images of handwritten digits, labeled as {0, 1,, 9}. Train Neural Network model to recognize digits from input images. 13

Classification: Medical Diagnosis Cancer diagnosis from gene expression signatures: Create database of gene expression profiles (X) from tissues of known cancer status (Y): Human accute leukemia dataset: http://www.broadinstitute.org/cgi-bin/cancer/datasets.cgi Colon cancer microarray data: [Krishnapuram et al., GENSIPS 02] http://microarray.princeton.edu/oncology Train Logistic Regression / SVM / RVM model to classify the gene expression of a tissue of unknown cancer status. 14

Classification: Other Examples Handwritten letter recognition Face recognition Credit card applications/transactions Recommender systems: books, music, Fraud detection in e-commerce Worm detection in network packets 15

Regression: Examples 1. Stock market prediction: Use the current stock market conditions (x X) to predict tomorrow s value of a particular stock (t T). 2. Oil price, GDP, income prediction. 3. Chemical processes: Predict the yield in a chemical process based on the concentrations of reactants, temperature and pressure. Algorithms: Linear Regression, Neural Networks, Support Vector Machines, 16

Unsupervised Learning: Hierarchical Clustering Pan Troglodytes Homo Sapiens 17

100 80 60 40 20 similarity scale 0 18

Unsupervised Learning: Clustering Partition unlabeled examples into disjoint clusters such that: Examples in the same cluster are very similar. Examples in different clusters are very different. Need to provide: - number of clusters (k = 2) - similarity measure (Euclidean) 19

Unsupervised Learning: Clustering Partition unlabeled examples into disjoint clusters such that: Examples in the same cluster are very similar. Examples in different clusters are very different. Need to provide: - number of clusters (k = 2) - similarity measure (Euclidean) 20

Unsupervised Learning: Dimensionality Reduction Manifold Learning: Data lies on a low-dimensional manifold embedded in a highdimensional space. Useful for feature extraction and visualization. 21

Reinforcement Learning Interaction between agent and environment modeled as a sequence of actions & states: Learn policy for mapping states to actions in order to maximize a reward. Reward given at the end state => delayed reward. States may be only partially observable. Trade-off between exploration and exploitation. Examples: Backgammon [Tesauro, CACM 95]. Aerobatic helicopter flight [Abbeel, NIPS 07]. 22

Reinforcement Learning: TD-Gammon Learn to play Backgammon: Immediate reward: +100 if win -100 if lose 0 for all other states [Tesauro, CACM 95] Temporal Difference Learning with a Multilayer Perceptron. Trained by playing 1.5 million games against itself. Played competitively against top-ranked players in international tournaments. 23

Relevant Disciplines Mathematics: Probability & Statistics Information Theory Linear Algebra Optimization Algorithms: Computational Complexity Artificial Intelligence Psychology Neurobiology Philosophy 24