EECS 445: Introduction to Machine Learning Winter 2015

Similar documents
CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

Learning outcomes. Knowledge and understanding. Competence and skills

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

Faculty of Science School of Mathematics and Statistics

MS1b Statistical Data Mining

CSCI-599 DATA MINING AND STATISTICAL INFERENCE

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Introduction to Data Science: CptS Syllabus First Offering: Fall 2015

An Introduction to Data Mining

CSci 538 Articial Intelligence (Machine Learning and Data Analysis)

CS 2302 Data Structures Spring 2015

Machine Learning Introduction

Office: LSK 5045 Begin subject: [ISOM3360]...

ADVANCED MACHINE LEARNING. Introduction

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

BUDT 758B-0501: Big Data Analytics (Fall 2015) Decisions, Operations & Information Technologies Robert H. Smith School of Business

CS Data Science and Visualization Spring 2016

Course Syllabus. Purposes of Course:

Syllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015

HT2015: SC4 Statistical Data Mining and Machine Learning

Machine Learning.

Lecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

: Introduction to Machine Learning Dr. Rita Osadchy

INFO 3130 Management Information Systems Spring 2016

Course Evaluation Methods

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague.

Finance 471: DERIVATIVE SECURITIES Fall 2015 Prof. Liang Ma University of South Carolina, Moore School of Business

PTE505: Inverse Modeling for Subsurface Flow Data Integration (3 Units)

Statistics Graduate Courses

College Algebra MATH 1111/11

AMIS 7640 Data Mining for Business Intelligence

Machine Learning for Data Science (CS4786) Lecture 1

Management Science 250: Mathematical Methods for Business Analysis Three Semester Hours

FI 630 Financial Management I

DSBA/MBAD 6211 Advanced Business Analytics UNC Charlotte Fall 2015

Prairie View A&M University P.O. Box 519 Mail Stop 2510 Prairie View, TX 77446

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

AMIS 7640 Data Mining for Business Intelligence

UNIVERSITY OF MICHIGAN SCHOOL OF INFORMATION SI301: Models of Social Information Processing Syllabus

Elementary Business Statistics (STA f309) MTWTh 10:00-12:00, UTC Summer 2012

Brown University Department of Economics Spring 2015 ECON 1620-S01 Introduction to Econometrics Course Syllabus

Gustavus Adolphus College Department of Economics and Management E/M : MARKETING M/T/W/F 11:30AM 12:20AM, BH 301, SPRING 2016

NORTHWESTERN UNIVERSITY Department of Statistics. Fall 2012 Statistics 210 Professor Savage INTRODUCTORY STATISTICS FOR THE SOCIAL SCIENCES

Introduction to Information Technology ITP 101x (4 Units)

PART 1: INSTRUCTOR INFORMATION, COURSE DESCRIPTION AND TEACHING METHODS

Learning is a very general term denoting the way in which agents:

Categorical Data Analysis

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

How To Get A Computer Science Degree

Department of Accounting ACC Fundamentals of Financial Accounting Syllabus

MATH 1900, ANALYTIC GEOMETRY AND CALCULUS II SYLLABUS

Class Syllabus. Department of Business Administration & Management Information Systems. Texas A&M University Commerce

CRJ 460/560 Survey of Technology and Crime FSC 450/550 Computer Forensics Spring 2005

CRN: STAT / CRN / INFO 4300 CRN

IN THE CITY OF NEW YORK Decision Risk and Operations. Advanced Business Analytics Fall 2015

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

Course of Study for the Robotics Ph.D. Program

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

Machine Learning and Statistics: What s the Connection?

Basic Components of an LP:

Statistics W4240: Data Mining Columbia University Spring, 2014

Remote Sensing for Geographical Analysis

MSCA Introduction to Statistical Concepts

KENNESAW STATE UNIVERSITY GRADUATE COURSE PROPOSAL OR REVISION, Cover Sheet (10/02/2002)

Data Mining Carnegie Mellon University Mini 2, Fall Syllabus

Bayesian networks - Time-series models - Apache Spark & Scala

Phone: (773) Spring Office hours: MW 7:30-8:20 and 11:00-12:20, T 7:30-7:50 and 9:55-12:15

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

Winter 2016 Course Timetable. Legend: TIME: M = Monday T = Tuesday W = Wednesday R = Thursday F = Friday BREATH: M = Methodology: RA = Research Area

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

NEURAL NETWORKS A Comprehensive Foundation

Course Description This course will change the way you think about data and its role in business.

Alabama Department of Postsecondary Education. Representing The Alabama Community College System

PADM 596: Research Methods for Public Managers Spring 2016

GEOG 5200S Elements of Cartography : Serving the Community Through Cartography Spring 2015

MATH 2412 PRECALCULUS SPRING 2015 Synonym 26044, Section 011 MW 12:00-1:45, EVC 8106

Principles of Data Mining by Hand&Mannila&Smyth

A1 Introduction to Data exploration and Machine Learning

Data Mining and Business Intelligence CIT-6-DMB. Faculty of Business 2011/2012. Level 6

MGMT 280 Impact Investing Ed Quevedo

Statistics 3202 Introduction to Statistical Inference for Data Analytics 4-semester-hour course

Azure Machine Learning, SQL Data Mining and R

MSCA Introduction to Statistical Concepts

Ordinary Differential Equations

Information and Decision Sciences (IDS)

COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK DEPARTMENT OF INDUSTRIAL ENGINEERING AND OPERATIONS RESEARCH

CEDAR CREST COLLEGE Psychological Assessment, PSY Spring Dr. Diane M. Moyer dmmoyer@cedarcrest.edu Office: Curtis 123

Class #6: Non-linear classification. ML4Bio 2012 February 17 th, 2012 Quaid Morris

University of Florida ADV 3502, Section 7E39 Advertising Sales Summer C 2016

Ordinary Differential Equations

PSY 311: Research Methods in Psychology I (FALL 2011) Course Syllabus

Transcription:

Instructor: Prof. Jenna Wiens Office: 3609 BBB wiensj@umich.edu EECS 445: Introduction to Machine Learning Winter 2015 Graduate Student Instructor: Srayan Datta Office: 3349 North Quad (**office hours location 3941 BBB**) srayand@umich.edu Course Information: Lectures Monday & Wednesday, 1:30pm-3:00pm, 1010 DOW Discussions Friday 11:00am-12:00am, 1010 DOW Course Materials & Textbook Course materials will be posted on the course CTools site (https://ctools.umich.edu/portal) Recommended Textbooks (optional): Chris Bishop, Pattern Recognition and Machine Learning, Springer, 2007. Kevin Murphy, Machine Learning: A Probabilistic Perspective, MIT Press, 2012. Course Description The course is a programming-focused introduction to Machine Learning. Increasingly, extracting value from data is an important contributor to the global economy across a range of industries. The field of Machine Learning provides the theoretical underpinnings for dataanalysis as well as, more broadly, for modern artificial intelligence approaches to building artificial agents that interact with data; it has had a major impact on many real-world applications. The course will emphasize understanding the foundational algorithms and tricks of the trade through implementation and basic-theoretical analysis. On the implementation side, the emphasis will be on practical applications of machine learning to computer vision, data mining, speech recognition, text processing, bioinformatics, and robot perception and control. Real data sets will be used whenever feasible to encourage understanding of practical issues. On the theoretical side, the course will give a undergraduate-level introduction to the foundations of machine learning topics including regression, classification, kernel methods, regularization, neural networks, graphical models, and unsupervised learning. EECS 445: Introduction to Machine Learning Page! 1

Prerequisites EECS 281 In addition, students should have familiarity with linear algebra (MATH 217, MATH 417) and probability (EECS 401). Office Hours GSI: 3941 BBB Wednesday 3:00pm-4:30pm, Thursdays 3:00pm-4:30pm Instructor: 3609 BBB, Tuesdays 10:30am-Noon, or by appointment Course Tools Information about the course including assignments and supplementary readings will be posted on CTools (https://ctools.umich.edu/portal). You are expected to check the site frequently, although you usually will be automatically notified by e-mail when new materials are posted. We will be using Piazza for class discussion. Find our class page at: https:// piazza.com/umich/winter2015/eecs445001w15/home Grading Exams Midterm on March 11th (20%) Final exam during finals weeks (35%) Homework 4 Problem Sets (10%) 3 Mini-Projects (30%) In-Class Problems (4%) Course Evaluation (1%) There will be four problem sets and three mini-projects assigned over the course of the semester to strengthen understanding of fundamental concepts and provide an opportunity for hands-on learning using real datasets. In-class problems will be handed out and solved during lectures, graded for effort. Submitting homework: Bi-weekly homework assignments are due Fridays at 9am on the dates noted in the course schedule. Scanned copies of your homework should be submitted via CTools. Grading policy: Solutions for problem sets will be posted exactly three days after the due date at 9am. Students will be responsible for grading their own problem sets. Scanned copies of the graded/corrected assignments are due at the same time as the next assignment. The goal behind this policy is for students to take an active role in addressing their own misconceptions and in evaluating their own performance. Mini-projects will be graded by the course staff. Late submission policy for homework and projects: You can be up to three days late, automatically losing 10% for each 24 hour period starting immediately (e.g., 1 min late means 10% off, 25 hours late means 20% off, and so on). No submissions will be accepted after the three days unless accompanied with a note from the Dean. EECS 445: Introduction to Machine Learning Page! 2

Honor Code The Honor Code outlines certain standards of ethical conduct for persons associated with the College of Engineering at the University of Michigan. The policies of the Honor Code apply to graduate and undergraduate students, faculty members, and administrators. Read about the UM Honor Code here: (http://www.crlt.umich.edu/faculty/honor.html). There is also an Engineering Honor Code: (http://www.engin.umich.edu/students/honorcode/code/). In this class, as in many others at the University, you will be expected to include and sign the Honor Pledge on each assignment you submit. The Honor Pledge is as follows: I have neither given nor received unauthorized aid on this assignment, nor have I concealed any violations of the Honor Code. The Honor code is based on these tenets: o Engineers must possess personal integrity both as students and as professionals. They must be honorable people to ensure safety, health, fairness, and the proper use of available resources in their undertakings. o Students in the College of Engineering community are honorable and trustworthy persons. o The students, faculty members, and administrators of the College of Engineering trust each other to uphold the principles of the Honor Code. They are jointly responsible for precautions against violations of its policies. o It is dishonorable for students to receive credit for work that is not the result of their own efforts. Among other things, the Honor Code forbids plagiarism. To plagiarize is to use another person's ideas, writings, etc. as one's own, without crediting the other person. Thus, you must credit information obtained from other sources, including web sites, e-mail or other written communications, conversations, articles, books, etc. On team assignments, the co-authors listed on the submission should include only those team members who have contributed their fair share to the assignment. If you allow a teammate's name to appear on an assignment to which he/she has not contributed fairly, then you are violating the Honor Code. Handling Data with Integrity You may not falsify or misrepresent methods, data, results, or conclusions, regardless of their source. Unfair Advantage You may not possess, look at, use, or in any way derive advantage from the solutions of homework, exams or papers prepared in prior years, whether these solutions were former students work products or solutions that have been made available by University of Michigan faculty or on the internet, unless this section s faculty expressly allows the use of such materials. EECS 445: Introduction to Machine Learning Page! 3

Disability Policy If you have any disability as defined under the Americans with Disabilities Act that might interfere with your ability to participate in class, or to turn in assignments on time or in the form required, please contact your instructor and the Office of Students with Disabilities at the start of the term so that arrangements can be made to accommodate you Tentative Course Schedule (Subject to Change) If you have any disability as defined under the Americans with Disabilities Act that might inte Lecture Topics Covered 01/07 Introduction Machine Learning: What & Why? Linear Classification 01/09 PS1 - Out 01/12 Supervised Learning Learning Linear Classifiers, Perceptron Algorithm 01/14 Supervised Learning Linear Classifiers Non-Separable Case, Gradient Descent 01/21 Supervised Learning Linear Regression - Empirical Risk & Least Squares, Regularization 01/23 PS1 - Due, Project1-Out 01/26 Supervised Learning Support Vector Machines; Primal Formulation; Geometric Margin 01/28 Supervised Learning Dual Formulation; Kernels 02/02 Supervised Learning Feature Construction; Selection: Filter, Wrapper, Embedded Methods 02/04 Performance Evaluation Confusion Matrices; AUROC; F-score; Calibration 02/06 Project1- Due, PS2 - Out 02/09 Supervised Learning Decision Trees; Entropy 02/11 Ensemble Methods Bagging; Random Forest 02/16 Ensemble Methods Boosting; Adaboost 02/18 Recommender Problems Collaborative Filtering 02/20 PS2- Due, Project 2 - Out 02/23 Unsupervised Learning Introduction to Clustering; K-means; Hierarchical Clustering 02/25 Unsupervised Learning Spectral Clustering; Clustering as a Graph Cut Problem; Graph Laplacian 03/06 Project 2 - Due, PS3 - Out EECS 445: Introduction to Machine Learning Page! 4

Lecture Topics Covered 03/09 Midterm Review 03/11 Midterm 03/16 Generative Models Gaussian Mixture Model; EM1 03/18 Generative Models EM2 03/20 PS3 - Due, Project 3 Out 03/23 Generative Models Bayesian Nets 03/25 Generative Models Hidden Markov Models; Inference Problems; Viterbi and Forward-backward algorithms 03/30 Generative Models Hidden Markov Models; Baum-Welch; Model Selection 04/01 Graphical Models Latent Dirichlet Allocation; Applications & Extensions 04/03 Project 3 Due, PS4 - Out 04/06 Special Topics Deep Learning, Multi-Layer Neural Nets 1 04/08 Special Topics Deep Learning, Multi-Layer Neural Nets 2 04/13 Special Topics Reinforcement Learning 04/15 Applications Examples from Research & Industry 04/17 PS4 - Due 04/20 Final Review EECS 445: Introduction to Machine Learning Page! 5