Foundations of Machine Learning and Data Science

Similar documents
Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)

Interactive Machine Learning. Maria-Florina Balcan

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Data, Measurements, Features

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning

MACHINE LEARNING BASICS WITH R

CLASSIFYING NETWORK TRAFFIC IN THE BIG DATA ERA

Machine Learning and Statistics: What s the Connection?

Machine Learning.

: Introduction to Machine Learning Dr. Rita Osadchy

Computer-Based Text- and Data Analysis Technologies and Applications. Mark Cieliebak

Learning is a very general term denoting the way in which agents:

Big Data Analytics CSCI 4030

CSE 517A MACHINE LEARNING INTRODUCTION

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Data Mining and Machine Learning in Bioinformatics

Active Learning with Boosting for Spam Detection

Machine Learning: Overview

What is Artificial Intelligence?

Big learning: challenges and opportunities

Boosting.

Machine Learning Introduction

Information Management course

Neural Networks and Support Vector Machines

Big Data - Lecture 1 Optimization reminders

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

Steven C.H. Hoi School of Information Systems Singapore Management University

Azure Machine Learning, SQL Data Mining and R

Defending Networks with Incomplete Information: A Machine Learning Approach. Alexandre

Semi-Supervised and Unsupervised Machine Learning. Novel Strategies

CSC384 Intro to Artificial Intelligence

Support Vector Machine. Tutorial. (and Statistical Learning Theory)

MA2823: Foundations of Machine Learning

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

Semi-Supervised Support Vector Machines and Application to Spam Filtering

Machine learning for algo trading

Text Analysis for Big Data. Magnus Sahlgren

Introduction to Data Mining

Data Mining - Evaluation of Classifiers

Machine Learning Capacity and Performance Analysis and R

Unsupervised Data Mining (Clustering)

1 What is Machine Learning?

Industry 4.0 and Big Data

Introduction to Data Mining

A1 Introduction to Data exploration and Machine Learning

Deep Learning for Big Data

The Discipline of Machine Learning

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Machine Learning using MapReduce

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

CI6227: Data Mining. Lesson 11b: Ensemble Learning. Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore.

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier

COMPSCI 760 S2 C 2014 Machine Learning and Data Mining Computer Science Department

Big Data Classification: Problems and Challenges in Network Intrusion Prediction with Machine Learning

Spam detection with data mining method:

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Anomaly detection. Problem motivation. Machine Learning

Introduction to Machine Learning Using Python. Vikram Kamath

Knowledge Discovery from patents using KMX Text Analytics

Obtaining Value from Big Data

Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal

Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. ~ Spring~r

ADVANCED MACHINE LEARNING. Introduction

Exploring Big Data in Social Networks

An Overview of Computational Advertising

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

CS144R/244R Network Design Project on Software Defined Networking for Computing

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Dan French Founder & CEO, Consider Solutions

Challenges of Cloud Scale Natural Language Processing

Steven C.H. Hoi. School of Computer Engineering Nanyang Technological University Singapore

Data Mining Practical Machine Learning Tools and Techniques

Data Mining: Introduction. Lecture Notes for Chapter 1. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler

Classification and Prediction

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

Computer Forensics Application. ebay-uab Collaborative Research: Product Image Analysis for Authorship Identification

Data Quality Mining: Employing Classifiers for Assuring consistent Datasets

Course 395: Machine Learning

LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING. ----Changsheng Liu

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

An Introduction to Machine Learning and Natural Language Processing Tools

Masters in Information Technology

Making Sense of the Mayhem: Machine Learning and March Madness

Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment

Supervised Learning (Big Data Analytics)

Big Data Analytics. Genoveva Vargas-Solar French Council of Scientific Research, LIG & LAFMIA Labs

NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju

School of Computer Science

MACHINE LEARNING & INTRUSION DETECTION: HYPE OR REALITY?

Transcription:

Foundations of Machine Learning and Data Science Lecture 1, September 9, 2015 Maria-Florina (Nina) Balcan

Course Staff Instructors: Nina Balcan http://www.cs.cmu.edu/~ninamf Avrim Blum http://www.cs.cmu.edu/~avrim TAs: Sarah Allen http://www.cs.cmu.edu/~srallen Nika Haghtalab http://www.cs.cmu.edu/~nhaghtal

On the board Lectures in general Ocasionally, will use slides

Image Classification Document Categorization Machine Learning Speech Recognition Protein Classification Branch Prediction Fraud Detection Spam Detection Playing Games Computational Advertising 4

Machine Learning is Changing the World Machine learning is the hot new thing (John Hennessy, President, Stanford) A breakthrough in machine learning would be worth ten Microsofts (Bill Gates, Microsoft) Web rankings today are mostly a matter of machine learning (Prabhakar Raghavan, VP Engineering at Google)

The COOLEST TOPIC IN SCIENCE A breakthrough in machine learning would be worth ten Microsofts (Bill Gates, Chairman, Microsoft) Machine learning is the next Internet (Tony Tether, Director, DARPA) Machine learning is the hot new thing (John Hennessy, President, Stanford) Web rankings today are mostly a matter of machine learning (Prabhakar Raghavan, Dir. Research, Yahoo) Machine learning is going to result in a real revolution (Greg Papadopoulos, CTO, Sun) Machine learning is today s discontinuity (Jerry Yang, CEO, Yahoo)

This course: foundations of Machine Learning and Data Science A 2 Â

Goals of Machine Learning Theory Develop and analyze models to understand: what kinds of tasks we can hope to learn, and from what kind of data what types of guarantees might we hope to achieve prove guarantees for practically successful algs (when will they succeed, how long will they take?) develop new algs that provably meet desired criteria (potentially within new learning paradigms) Interesting connections to other areas including: Algorithms Optimization Probability & Statistics Game Theory Complexity Theory Information Theory

Example: Supervised Classification Decide which emails are spam and which are important. Not spam Supervised classification spam Goal: use emails seen so far to produce good prediction rule for future data. 9

Example: Supervised Classification Represent each message by features. (e.g., keywords, spelling, etc.) example label Reasonable RULES: Predict SPAM if unknown AND (money OR pills) Predict SPAM if 2money + 3pills 5 known > 0 + + - - - - - Linearly separable 10

Two Main Aspects of Supervised Learning Algorithm Design. How to optimize? Automatically generate rules that do well on observed data. Confidence Bounds, Generalization Guarantees, Sample Complexity Confidence for rule effectiveness on future data. Well understood for passive supervised learning. 11

Using Unlabeled Data and Interaction for Learning Application Areas Search/Information Retrieval Computer Vision Spam Detection Computational Biology Robotics Medical Diagnosis

Massive Amounts of Raw Data Only a tiny fraction can be annotated by human experts. Protein sequences Billions of webpages Images 13

Semi-Supervised Learning raw data face not face Labeled data Expert Labeler Classifier 14

Active Learning raw data face not face Expert Labeler O O O Classifier 15

Other Protocols for Supervised Learning Semi-Supervised Learning Using cheap unlabeled data in addition to labeled data. Active Learning The algorithm interactively asks for labels of informative examples. Theoretical understanding entirely lacking 10 years ago. Lots of progress recently. We will cover some of these. 16

Distributed Learning Many ML problems today involve massive amounts of data distributed across multiple locations. Often would like low error hypothesis wrt the overall distrib.

Distributed Learning Data distributed across multiple locations. E.g., medical data

Distributed Learning Data distributed across multiple locations. E.g., scientific data

Distributed Learning Data distributed across multiple locations. Each has a piece of the overall data pie. To learn over the combined D, must communicate. Important question: how much communication? Plus, privacy & incentives.

The World is Changing Machine Learning New approaches. E.g., Semi-supervised learning Interactive learning Distributed learning Multi-task/transfer learning Deep Learning Never ending learning Many competing resources & constraints. E.g., Computational efficiency (noise tolerant algos) Human labeling effort Statistical efficiency Communication Privacy/Incentives

Structure of the Class Basic Learning Paradigm: Passive Supervised Learning Basic models: PAC, SLT. Simple algos and hardness results for supervised learning. Standard Sample Complexity Results (VC dimension) Modern Sample Complexity Results Rademacher Complexity; localization Weak-learning vs. Strong-learning Classic, state of the art algorithms: AdaBoost and SVM (kernel based mehtods). Margin analysis of Boosting and SVM

Structure of the Class Other Learning Paradigms Incorporating Unlabeled Data in the Learning Process. Incorporating Interaction in the Learning Process: Active Learning More general types of Interaction Distributed Learning. Transfer learning/multi-task learning/life-long learning. Deep Learning. Foundations and algorithms for constraints/externalities. E.g., privacy, limited memory, and communication.

Structure of the Class Other Topics. Methods for summarizing and making sense of massive datasets including: unsupervised learning. spectral, combinatorial techniques. streaming algorithms. Online Learning, Optimization, and Game Theory connections to Boosting

Admin Course web page: http://www.cs.cmu.edu/~ninamf/courses/806/10-806-index.html Two grading schemes: 1) Project Oriented. - Project [60%] - Take-home final [10%] - Hwks + grading [30%] 2) Homework Oriented. - Hwk +grading [60%] - Take-home final [10%] - Project [30%]

Admin Course web page: http://www.cs.cmu.edu/~ninamf/courses/806/10-806-index.html 1) Project Oriented. - Project [60%] explore a theoretical or empirical question; write-up --- ideally aim for a conference submission! Small groups OK. - Take-home final [10%] - Hwks + grading [30%]

Admin Course web page: http://www.cs.cmu.edu/~ninamf/courses/806/10-806-index.html 2) Homework Oriented. - Hwk +grading [60%] - Take-home final [10%] - Project [30%] read a couple of papers and explain the idea.

On the board Lectures in general Ocasionally, will use slides