Machine Learning: Algorithms and Applications

Similar documents
CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

CSci 538 Articial Intelligence (Machine Learning and Data Analysis)

Faculty of Science School of Mathematics and Statistics

Learning outcomes. Knowledge and understanding. Competence and skills

CSCI-599 DATA MINING AND STATISTICAL INFERENCE

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Machine Learning Introduction

Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)

MA2823: Foundations of Machine Learning

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

An Introduction to Data Mining

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

ADVANCED MACHINE LEARNING. Introduction

8. Machine Learning Applied Artificial Intelligence

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

TDS - Socio-Environmental Data Science

Machine learning for algo trading

Introduction to Machine Learning Using Python. Vikram Kamath

: Introduction to Machine Learning Dr. Rita Osadchy

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

MS1b Statistical Data Mining

Lecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions

Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100

Office: LSK 5045 Begin subject: [ISOM3360]...

Knowledge Discovery from Data Bases Proposal for a MAP-I UC

Learning is a very general term denoting the way in which agents:

PTE505: Inverse Modeling for Subsurface Flow Data Integration (3 Units)

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Machine Learning.

SYLLABUS MAC 1105 COLLEGE ALGEBRA Spring 2011 Tuesday & Thursday 12:30 p.m. 1:45 p.m.

Course 395: Machine Learning

Masters in Information Technology

Statistics 3202 Introduction to Statistical Inference for Data Analytics 4-semester-hour course

1. Classification problems

Software Engineering and Service Design: courses in ITMO University

College of Health and Human Services. Fall Syllabus

Syllabus for ECON 106G Introduction to Game Theory. Spring 2014, Department of Economics UCLA

COLLEGE OF SCIENCE. John D. Hromi Center for Quality and Applied Statistics

Register for CONNECT using the code with your book and this course access information:

Principles of Data Mining by Hand&Mannila&Smyth

Supervised Learning (Big Data Analytics)

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Final Project Report

Management 525 Marketing Analytics 1. Course Description and Objectives

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

How To Get A Masters Degree In Logistics And Supply Chain Management

CSE 427 CLOUD COMPUTING WITH BIG DATA APPLICATIONS

Machine Learning and Statistics: What s the Connection?

Data Mining and Business Intelligence CIT-6-DMB. Faculty of Business 2011/2012. Level 6

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Computer Science Theory. From the course description:

Online Course Syllabus CS406/BA406 Managing Web Technologies. Important Notes:

ECO 250 Economics and Business Statistics I Fall 2009 Online Course

Syllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015

PSG College of Technology, Coimbatore Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

BIOM611 Biological Data Analysis

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

Before you begin to adapt your course for online learning, consider the following questions:

Data Mining Carnegie Mellon University Mini 2, Fall Syllabus

Lecture: Mon 13:30 14:50 Fri 9:00-10:20 ( LTH, Lift 27-28) Lab: Fri 12:00-12:50 (Rm. 4116)

T : Classification as Spam or Ham using Naive Bayes Classifier. Santosh Tirunagari :

Lake-Sumter Community College Course Syllabus. STA 2023 Course Title: Elementary Statistics I. Contact Information: Office Hours:

CS Data Science and Visualization Spring 2016

CS 6220: Data Mining Techniques Course Project Description

Core Curriculum to the Course:

Syllabus. HMI 7437: Data Warehousing and Data/Text Mining for Healthcare

Bayesian networks - Time-series models - Apache Spark & Scala

STAT 121 Hybrid SUMMER 2014 Introduction to Statistics for the Social Sciences Session I: May 27 th July 3 rd

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology

An Augmented Normalization Mechanism for Capacity Planning & Modelling Elegant Approach with Artificial Intelligence

DEPARTMENT OF INFORMATION SCIENCE. INFO221 Application Software Development COURSE OUTLINE

Machine Learning. CS494/594, Fall :10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004

Predictive Data Mining in Very Large Data Sets: A Demonstration and Comparison Under Model Ensemble

Study Plan for the Master Degree In Industrial Engineering / Management. (Thesis Track)

Introduction to Data Mining

DATA MINING TECHNIQUES AND APPLICATIONS

Undergraduate Course Syllabus

Web Document Clustering

Scalable Developments for Big Data Analytics in Remote Sensing

New Ensemble Combination Scheme

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

SYLLABUS Fall 2013 MATH 115 ELEMENTARY STATISTICS. Class Section Name (on WileyPlus):

Business Intelligence Integration. Joel Da Costa, Takudzwa Mabande, Richard Migwalla Antoine Bagula, Joseph Balikuddembe

MACHINE LEARNING BASICS WITH R

Guidelines for Independent Study

MSIS 635 Session 1 Health Information Analytics Spring 2014

Information and Decision Sciences (IDS)

DATA MINING FOR BUSINESS INTELLIGENCE. Data Mining For Business Intelligence: MIS 382N.9/MKT 382 Professor Maytal Saar-Tsechansky

International Financial Accounting (IFA)

Lecture 6: Logistic Regression

NORTHWESTERN UNIVERSITY Department of Statistics. Fall 2012 Statistics 210 Professor Savage INTRODUCTORY STATISTICS FOR THE SOCIAL SCIENCES

Business Intelligence. Data Mining and Optimization for Decision Making

Online Course Syllabus MT201 College Algebra. Important Notes:

Governors State University College of Business and Public Administration. Course: STAT Statistics for Management I (Online Course)

Analysis Tools and Libraries for BigData

Transcription:

Machine Learning: Algorithms and Applications Floriano Zini Free University of Bozen-Bolzano Faculty of Computer Science Academic Year 2011-2012 Contacts Floriano Zini Room 2.18 (POS) floriano.zini@unibz.it Office hours Wednesday, 14:00-16:00 by prior arrangement via email Course web site http://www.inf.unibz.it/~zini/ml 1

Course structure Credits: 4 Lectures: 24 hours Labs: 12 hours Assessment Project in small teams (max 2 people) [40 % of mark] Final written exam [60 % of mark] Timetable Lectures: Mondays, 10:30-12:30, E 411 Labs: Mondays, 15:00-16:00, E 331 Starting on Monday, 05/03/2012 Please check the course webpage and/or online timetable for variations No lecture/lab on 09/04/2012 à10/4/2012, 17:00-20:00 E 531?? 30/4/2012 à 2/5/2012, 16:00-19:00 E 531?? 2

Goals of the course Introduce the fundamental principles and concepts of Machine Learning Present the principal algorithms and techniques focusing on strengths an weaknesses appropriateness for learning problems Teach how to design, implement, and evaluate ML algorithms Present practical applications of Machine Learning How? Frontal lectures Labs Exercises (fundamental for the written exam!) Java will be used by the students to implement simple ML algorithms presented in the lectures NetBeans (http://netbeans.org/) is the suggested IDE for development Weka (http://www.cs.waikato.ac.nz/ml/weka/) will be used to explore the application of different ML algorithms Homework Studying the slides is not enough The students should also Study on the suggested book Do the proposed exercises Read the proposed scientific articles 3

Syllabus Introduction to Machine Learning What is ML, supervised, unsupervised, and reinforcement learning Classification and regression Bayesian (probabilistic) learning, model probability estimation, dimensionality reduction, nonparametric methods, linear discrimination, artificial neural networks, genetic algorithms Design and analysis of ML experiments Strategies, guidelines for good design, cross validation, performance measurement, hypothesis testing, algorithm comparison Advanced algorithms Kernel Machines, Hidden Markov Models, Graphical models Reinforcement Learning Course material Course slides Available at http://www.inf.unibz.it/~zini/ml the day after the lecture Textbooks Ethem Alpaydin, Introduction to Machine Learning, 2 nd edition, MIT Press 2010 Main text book, copies are available in the library Most of the slides are taken from those provided by Alpaydin Tom Mitchell, Machine Learning, McGraw-Hill, 1997. A milestone book, copies are available in the library Selected scientific papers Available at http://www.inf.unibz.it/~zini/ml the day after the lecture Software tools Java NetBeans, Weka Datasets UC Irvine Machine Learning Repository (http://archive.ics.uci.edu/ml/) 4

Assessment Project (P) maximum 12 points In a team of maximum 2 students Choose a ML technique introduced in the course to implement a learning system for an interesting task, and test it on some dataset(s) Implementation in Java Written exam (E) maximum 18 points Questions and small exercises similar to those done in the lab Positively assessed (i.e., at least 7 points) project work is required for taking the written exam!! A positive mark of the project work counts for 3 regular exam sessions Final grade (G) G = P + E To pass: 7 points of (P) and 11 points of (E) Project - proposal Freely propose the task, the ML algorithm, and the dataset(s) for your project work The project proposal should (be reasonably specific!) be 1-2 pages long describe the task you want to solve (i.e., application scenario) indicate the learning algorithm you intend to apply specify what are the input/output of the learning system, what kind of representation is used for the input/output indicate the dataset(s) to be used Please send me by e-mail your proposal, including students names and ids The deadline for proposal submission is??? (check course webpage) I will revise the proposal and the students might be asked to improve it 5

Project - requirements The deliverables of the project work include The source codes (in a zip file) The instruction (i.e., readme) file that describes in details how to install/ compile/run the project program (and additional soft. pack. used) The project report (PDF file) that describes The problem definition (i.e., which task the learning system solves) The ML algorithm implemented and the datasets used The system functions (and how to use these functions) The experimental results of the systems performance on the used datasets The structure of the source code (and the role of each class and the main methods) The major problems/issues occurred in the implementation, and how you solved them Your discussions/findings/conclusions, and possible future improvements Project - assessment The project should be delivered by email to me by??? (check course webpage) The project work will be evaluated on The complexity/difficulty of the task you have solved, and the quality of your solution The evidence of the experimental results, and your discussions/arguments on these results Quality of the report The system implementation (e.g., functions, easy to use, etc.) If you reuse/extend/exploit existing codes/packages, you must explicitly and clearly describe in the report (and mention in the presentation) what you have reused/extended/exploited; and what are your own codes 6