CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.


 Edmund Cole
 2 years ago
 Views:
Transcription
1 Lecture 1 Machine Learning Milos Hauskrecht 539 Sennott Square, x5 Administration Instructor: Milos Hauskrecht 539 Sennott Square, x5 TA: Zhipeng (Patrick) Luo 59 Sennott Square Office hours: TBA 1
2 Administration Study material Handouts, your notes and course readings Primary textbook: Chris. Bishop. Pattern Recognition and Machine Learning. Springer,. Administration Study material Other books: K. Murphy. Machine Learning: A probabilistic perspective, MIT Press, 1. J. Han, M. Kamber. Data Mining. Morgan Kauffman, 11. Friedman, Hastie, Tibshirani. Elements of statistical learning. Springer, nd edition, 11. Koller, Friedman. Probabilistic graphical models. MIT Press, 9. Duda, Hart, Stork. Pattern classification. nd edition. J Wiley and Sons,. T. Mitchell. Machine Learning. McGraw Hill, 1997.
3 Administration Homeworks: weekly Programming tool: Matlab (CSSD machines and labs) Matlab Tutorial: next week Exams: Midterm (March) Final (April 1317) Final project: Written report + Oral presentation (April ) Lectures: Attendance and Activity Tentative topics Introduction to Machine Learning Density estimation. Supervised Learning. Linear models for regression and classification. Multilayer neural networks. Support vector machines. Kernel methods. Unsupervised Learning. Learning Bayesian networks. Latent variable models. Expectation maximization. Clustering 3
4 Tentative topics (cont) Dimensionality reduction. Feature extraction. Principal component analysis (PCA) Ensemble methods. Mixture models. Bagging and boosting. Reinforcement learning Machine Learning The field of machine learning studies the design of computer programs (agents) capable of learning from past experience or adapting to changes in the environment The need for building agents capable of learning is everywhere predictions in medicine, text and web page classification, speech recognition, image/text retrieval, commercial software
5 Learning Learning process: Learner (a computer program) processes data D representing past experiences and tries to either develop an appropriate response to future data, or describe in some meaningful way the data seen Example: Learner sees a set of patient cases (patient records) with corresponding diagnoses. It can either try: to predict the presence of a disease for future patients describe the dependencies between diseases, symptoms Types of learning Supervised learning Learning mapping between input x and desired output y Teacher gives me y s for the learning purposes Unsupervised learning Learning relations between data components No specific outputs given by a teacher Reinforcement learning Learning mapping between input x and desired output y Critic does not give me y s but instead a signal (reinforcement) of how good my answer was Other types of learning: Concept learning, Active learning, Transfer learning, Deep learning 5
6 Supervised learning Data: D { d1, d,.., dn} a set of n examples d i xi, yi is input vector, and y is desired output (given by a teacher) x i Objective: learn the mapping f : X Y s.t. yi f ( xi ) for all i 1,.., n Two types of problems: Regression: X discrete or continuous Y is continuous Classification: X discrete or continuous Y is discrete Supervised learning examples Regression: Y is continuous Debt/equity Earnings Future product orders company stock price Classification: Y is discrete Label 3 Handwritten digit (array of,1s)
7 Unsupervised learning Data: D { d1, d,.., dn} di xi vector of values No target value (output) y Objective: learn relations between samples, components of samples Types of problems: Clustering Group together similar examples, e.g. patient cases Density estimation Model probabilistically the population of samples Unsupervised learning example Clustering. Group together similar examples di x i
8 Unsupervised learning example Clustering. Group together similar examples di x i Unsupervised learning example Density estimation. We want to build the probability model P(x) of a population from which we draw examples d x i i
9 Unsupervised learning. Density estimation A probability density of a point in the two dimensional space Model used here: Mixture of Gaussians Reinforcement learning We want to learn: f : X Y We see samples of x but not y Instead of y we get a feedback (reinforcement) from a critic about how good our output was input sample Learner output reinforcement Critic The goal is to select outputs that lead to the best reinforcement 9
10 Learning: first look Assume we see examples of pairs (x, y) in D and we want to learn the mapping f : X Y to predict y for some future x We get the data D  what should we do? 1 y x Learning: first look Problem: many possible functions f : X Y exists for representing the mapping between x and y Which one to choose? Many examples still unseen! y x 1
11 Learning: first look Solution: make an assumption about the model, say, f ( x) ax b y x Learning: first look Choosing a parametric model or a set of models is not enough Still too many functions f ( x) ax b One for every pair of parameters a, b y x 11
12 Fitting the data to the model We want the best set of model parameters Objective: Find parameters that: reduce the misfit between the model M and observed data D Or, (in other words) explain the data the best Objective function: Error function: Measures the misfit between D and M Examples of error functions: n Average Square Error 1 ( yi f ( xi )) n i 1 n Average misclassification error 1 1 Average # of misclassified cases n i 1 y f ( x ) i i Fitting the data to the model Linear regression problem Minimizes the squared error function for the linear model n minimizes 1 ( yi f ( xi )) n i 1 1 y x 1
13 Learning: summary Three basic steps: Select a model or a set of models (with parameters) E.g. f ( x) ax b Select the error function to be optimized n E.g. 1 ( yi f ( xi )) n i 1 Find the set of parameters optimizing the error function The model and parameters with the smallest error represent the best fit of the model to the data But there are problems one must be careful about Learning Problem We fit the model based on past examples observed in D But ultimately we are interested in learning the mapping that performs well on the whole population of examples Training data: Data used to fit the parameters of the model Training error: n 1 Error ( D, f ) ( yi f ( xi )) n i 1 True (generalization) error (over the whole population): E ( x, y )[( y f ( x)) ] Mean squared error Training error tries to approximate the true error!!!! Does a good training error imply a good generalization error? 13
14 Overfitting Assume we have a set of 1 points and we consider polynomial functions as our possible models Overfitting Fitting a linear function with the square error Error is nonzero
15 Overfitting Linear vs. cubic polynomial Higher order polynomial leads to a better fit, smaller error Overfitting Is it always good to minimize the error of the observed data?
16 Overfitting For 1 data points, the degree 9 polynomial gives a perfect fit (Lagrange interpolation). Error is zero. Is it always good to minimize the training error? Overfitting For 1 data points, degree 9 polynomial gives a perfect fit (Lagrange interpolation). Error is zero. Is it always good to minimize the training error? NO!! More important: How do we perform on the unseen data?
17 Overfitting Situation when the training error is low and the generalization error is high. Causes of the phenomenon: Model with a large number of parameters (degrees of freedom) Small data size (as compared to the complexity of the model) How to evaluate the learner s performance? Generalization error is the true error for the population of examples we would like to optimize E ( x, y )[( y f ( x)) ] But it cannot be computed exactly Sample mean only approximates the true mean Optimizing (mean) training error can lead to the overfit, i.e. training error may not reflect properly the generalization error 1 ( yi f ( xi )) n i 1,.. n So how to test the generalization error? 17
18 How to evaluate the learner s performance? Generalization error is the true error for the population of examples we would like to optimize Sample mean only approximates it Two ways to assess the generalization error is: Theoretical: Law of Large numbers statistical bounds on the difference between the true and sample mean errors Practical: Use a separate data set with m data samples to test the model (Mean) test error 1 ( y j f ( x j )) m j 1,.. m Testing of learning models Simple holdout method Divide the data to the training and test data Dataset Training set Testing set Evaluate Learn (fit) Predictive model Typically /3 training and 1/3 testing 1
19 Testing of models Data set Training set Test set case control case control Learn on the training set The model Evaluate on the test set Basic experimental setup to test the learner s performance 1. Take a dataset D and divide it into: Training data set Testing data set. Use the training set and your favorite ML algorithm to train the learner 3. Test (evaluate) the learner on the testing data set The results on the testing set can be used to compare different learners powered with different models and learning algorithms 19
20 A learning system: basic cycle 1. Data: D { d1, d,.., dn}. Model selection: Select a model or a set of models (with parameters) E.g. y ax b 3. Choose the objective function n 1 Squared error ( yi f ( xi )) n i 1. Learning: Find the set of parameters optimizing the error function The model and parameters with the smallest error 5. Testing: Apply the learned model to new data E.g. predict ys for new inputs x using learned f (x) Evaluate on the test data
CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht milos@cs.pitt.edu 539 Sennott
More information203.4770: Introduction to Machine Learning Dr. Rita Osadchy
203.4770: Introduction to Machine Learning Dr. Rita Osadchy 1 Outline 1. About the Course 2. What is Machine Learning? 3. Types of problems and Situations 4. ML Example 2 About the course Course Homepage:
More informationC19 Machine Learning
C9 Machine Learning 8 Lectures Hilary Term 25 2 Tutorial Sheets A. Zisserman Overview: Supervised classification perceptron, support vector machine, loss functions, kernels, random forests, neural networks
More informationBIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics http://www.ccmb.med.umich.edu/node/1376
Course Director: Dr. Kayvan Najarian (DCM&B, kayvan@umich.edu) Lectures: Labs: Mondays and Wednesdays 9:00 AM 10:30 AM Rm. 2065 Palmer Commons Bldg. Wednesdays 10:30 AM 11:30 AM (alternate weeks) Rm.
More informationEECS 445: Introduction to Machine Learning Winter 2015
Instructor: Prof. Jenna Wiens Office: 3609 BBB wiensj@umich.edu EECS 445: Introduction to Machine Learning Winter 2015 Graduate Student Instructor: Srayan Datta Office: 3349 North Quad (**office hours
More informationIntroduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research mohri@cims.nyu.edu
Introduction to Machine Learning Lecture 1 Mehryar Mohri Courant Institute and Google Research mohri@cims.nyu.edu Introduction Logistics Prerequisites: basics concepts needed in probability and statistics
More informationIntroduction to Machine Learning. Speaker: Harry Chao Advisor: J.J. Ding Date: 1/27/2011
Introduction to Machine Learning Speaker: Harry Chao Advisor: J.J. Ding Date: 1/27/2011 1 Outline 1. What is machine learning? 2. The basic of machine learning 3. Principles and effects of machine learning
More informationMachine Learning. CUNY Graduate Center, Spring 2013. Professor Liang Huang. huang@cs.qc.cuny.edu
Machine Learning CUNY Graduate Center, Spring 2013 Professor Liang Huang huang@cs.qc.cuny.edu http://acl.cs.qc.edu/~lhuang/teaching/machinelearning Logistics Lectures M 9:3011:30 am Room 4419 Personnel
More informationGovernment of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence
Government of Russian Federation Federal State Autonomous Educational Institution of High Professional Education National Research University «Higher School of Economics» Faculty of Computer Science School
More informationDesigning a learning system
Lecture Designing a learning system Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square, x48845 http://.cs.pitt.edu/~milos/courses/cs750/ Design of a learning system (first vie) Application or Testing
More informationFaculty of Science School of Mathematics and Statistics
Faculty of Science School of Mathematics and Statistics MATH5836 Data Mining and its Business Applications Semester 1, 2014 CRICOS Provider No: 00098G MATH5836 Course Outline Information about the course
More informationCSCI599 DATA MINING AND STATISTICAL INFERENCE
CSCI599 DATA MINING AND STATISTICAL INFERENCE Course Information Course ID and title: CSCI599 Data Mining and Statistical Inference Semester and day/time/location: Spring 2013/ Mon/Wed 3:304:50pm Instructor:
More informationPrinciples of Dat Da a t Mining Pham Tho Hoan hoanpt@hnue.edu.v hoanpt@hnue.edu. n
Principles of Data Mining Pham Tho Hoan hoanpt@hnue.edu.vn References [1] David Hand, Heikki Mannila and Padhraic Smyth, Principles of Data Mining, MIT press, 2002 [2] Jiawei Han and Micheline Kamber,
More informationADVANCED MACHINE LEARNING. Introduction
1 1 Introduction Lecturer: Prof. Aude Billard (aude.billard@epfl.ch) Teaching Assistants: Guillaume de Chambrier, Nadia Figueroa, Denys Lamotte, Nicola Sommer 2 2 Course Format Alternate between: Lectures
More informationCPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015
CPSC 340: Machine Learning and Data Mining Mark Schmidt University of British Columbia Fall 2015 Outline 1) Intro to Machine Learning and Data Mining: Big data phenomenon and types of data. Definitions
More informationIntroduction to machine learning and pattern recognition Lecture 1 Coryn BailerJones
Introduction to machine learning and pattern recognition Lecture 1 Coryn BailerJones http://www.mpia.de/homes/calj/mlpr_mpia2008.html 1 1 What is machine learning? Data description and interpretation
More informationMachine Learning: Statistical Methods  I
Machine Learning: Statistical Methods  I Introduction  15.04.2009 Stefan Roth, 15.04.2009 Department of Computer Science GRIS Machine Learning I Lecturer: Prof. Stefan Roth, Ph.D.
More information270107  MD  Data Mining
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 015 70  FIB  Barcelona School of Informatics 715  EIO  Department of Statistics and Operations Research 73  CS  Department of
More informationHT2015: SC4 Statistical Data Mining and Machine Learning
HT2015: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Bayesian Nonparametrics Parametric vs Nonparametric
More informationClassifiers & Classification
Classifiers & Classification Forsyth & Ponce Computer Vision A Modern Approach chapter 22 Pattern Classification Duda, Hart and Stork School of Computer Science & Statistics Trinity College Dublin Dublin
More informationIntroduction to Machine Learning
Introduction to Machine Learning Brown University CSCI 1950F, Spring 2012 Instructor: Erik Sudderth Graduate TAs: Dae Il Kim & Ben Swanson Head Undergraduate TA: William Allen Undergraduate TAs: Soravit
More informationLearning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal
Learning Example Chapter 18: Learning from Examples 22c:145 An emergency room in a hospital measures 17 variables (e.g., blood pressure, age, etc) of newly admitted patients. A decision is needed: whether
More informationIntroduction to Machine Learning Using Python. Vikram Kamath
Introduction to Machine Learning Using Python Vikram Kamath Contents: 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. Introduction/Definition Where and Why ML is used Types of Learning Supervised Learning Linear Regression
More informationMaschinelles Lernen mit MATLAB
Maschinelles Lernen mit MATLAB Jérémy Huard Applikationsingenieur The MathWorks GmbH 2015 The MathWorks, Inc. 1 Machine Learning is Everywhere Image Recognition Speech Recognition Stock Prediction Medical
More informationData Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
More informationSupervised and unsupervised learning  1
Chapter 3 Supervised and unsupervised learning  1 3.1 Introduction The science of learning plays a key role in the field of statistics, data mining, artificial intelligence, intersecting with areas in
More informationData, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
More informationMachine Learning. 01  Introduction
Machine Learning 01  Introduction Machine learning course One lecture (Wednesday, 9:30, 346) and one exercise (Monday, 17:15, 203). Oral exam, 20 minutes, 5 credit points. Some basic mathematical knowledge
More informationMachine Learning for Data Science (CS4786) Lecture 1
Machine Learning for Data Science (CS4786) Lecture 1 TuTh 10:10 to 11:25 AM Hollister B14 Instructors : Lillian Lee and Karthik Sridharan ROUGH DETAILS ABOUT THE COURSE Diagnostic assignment 0 is out:
More informationAn Introduction to Data Mining
An Introduction to Intel Beijing wei.heng@intel.com January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
More informationMachine Learning and Data Mining. Fundamentals, robotics, recognition
Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,
More informationData Mining Chapter 6: Models and Patterns Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University
Data Mining Chapter 6: Models and Patterns Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Models vs. Patterns Models A model is a high level, global description of a
More informationReference Books. Data Mining. Supervised vs. Unsupervised Learning. Classification: Definition. Classification knearest neighbors
Classification knearest neighbors Data Mining Dr. Engin YILDIZTEPE Reference Books Han, J., Kamber, M., Pei, J., (2011). Data Mining: Concepts and Techniques. Third edition. San Francisco: Morgan Kaufmann
More informationCS 207  Data Science and Visualization Spring 2016
CS 207  Data Science and Visualization Spring 2016 Professor: Sorelle Friedler sorelle@cs.haverford.edu An introduction to techniques for the automated and humanassisted analysis of data sets. These
More informationSyllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015
Syllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015 Lecture: MWF: 1:001:50pm, GEOLOGY 4645 Instructor: Mihai
More informationNew Ensemble Combination Scheme
New Ensemble Combination Scheme Namhyoung Kim, Youngdoo Son, and Jaewook Lee, Member, IEEE Abstract Recently many statistical learning techniques are successfully developed and used in several areas However,
More informationMachine learning for algo trading
Machine learning for algo trading An introduction for nonmathematicians Dr. Aly Kassam Overview High level introduction to machine learning A machine learning bestiary What has all this got to do with
More informationCOLLEGE OF SCIENCE. John D. Hromi Center for Quality and Applied Statistics
ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE John D. Hromi Center for Quality and Applied Statistics NEW (or REVISED) COURSE: COSSTAT747 Principles of Statistical Data Mining
More informationSupervised Learning (Big Data Analytics)
Supervised Learning (Big Data Analytics) Vibhav Gogate Department of Computer Science The University of Texas at Dallas Practical advice Goal of Big Data Analytics Uncover patterns in Data. Can be used
More informationMS1b Statistical Data Mining
MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to
More informationKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 11 Sajjad Haider Fall 2013 1 Supervised Learning Process Data Collection/Preparation Data Cleaning Discretization Supervised/Unuspervised Identification of right
More informationIntroduction to Machine Learning. Rob Schapire Princeton University. schapire
Introduction to Machine Learning Rob Schapire Princeton University www.cs.princeton.edu/ schapire Machine Learning studies how to automatically learn to make accurate predictions based on past observations
More information6.2.8 Neural networks for data mining
6.2.8 Neural networks for data mining Walter Kosters 1 In many application areas neural networks are known to be valuable tools. This also holds for data mining. In this chapter we discuss the use of neural
More informationStatistics W4240: Data Mining Columbia University Spring, 2014
Statistics W4240: Data Mining Columbia University Spring, 2014 Version: January 30, 2014. The syllabus is subject to change, so look for the version with the most recent date. Course Description Massive
More informationMachine Learning CS 6830. Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu
Machine Learning CS 6830 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu What is Learning? MerriamWebster: learn = to acquire knowledge, understanding, or skill
More information10601. Machine Learning. http://www.cs.cmu.edu/afs/cs/academic/class/10601f10/index.html
10601 Machine Learning http://www.cs.cmu.edu/afs/cs/academic/class/10601f10/index.html Course data All uptodate info is on the course web page: http://www.cs.cmu.edu/afs/cs/academic/class/10601f10/index.html
More informationNeural Networks. CAP5610 Machine Learning Instructor: GuoJun Qi
Neural Networks CAP5610 Machine Learning Instructor: GuoJun Qi Recap: linear classifier Logistic regression Maximizing the posterior distribution of class Y conditional on the input vector X Support vector
More informationStatistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
More informationSupport Vector Machines with Clustering for Training with Very Large Datasets
Support Vector Machines with Clustering for Training with Very Large Datasets Theodoros Evgeniou Technology Management INSEAD Bd de Constance, Fontainebleau 77300, France theodoros.evgeniou@insead.fr Massimiliano
More informationToday. COM 578 Empirical Methods in Machine Learning and Data Mining. Staff, Office Hours, Topics. Dull organizational stuff.
COM 578 Empirical Methods in Machine Learning and Data Mining Rich Caruana http://www.cs cs.cornell.edu/courses/cs578/2007fa Today Dull organizational stuff Course Summary Grading Office hours Homework
More informationData Mining  Evaluation of Classifiers
Data Mining  Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
More informationIntroduction to Artificial Neural Networks. Introduction to Artificial Neural Networks
Introduction to Artificial Neural Networks v.3 August Michel Verleysen Introduction  Introduction to Artificial Neural Networks p Why ANNs? p Biological inspiration p Some examples of problems p Historical
More informationData Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification
More informationMA2823: Foundations of Machine Learning
MA2823: Foundations of Machine Learning École Centrale Paris Fall 2015 ChloéAgathe Azencot Centre for Computational Biology, Mines ParisTech chloe agathe.azencott@mines paristech.fr TAs: Jiaqian Yu jiaqian.yu@centralesupelec.fr
More informationKATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics
ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM KATE GLEASON COLLEGE OF ENGINEERING John D. Hromi Center for Quality and Applied Statistics NEW (or REVISED) COURSE (KGCOE CQAS 747 Principles of
More informationCSci 538 Articial Intelligence (Machine Learning and Data Analysis)
CSci 538 Articial Intelligence (Machine Learning and Data Analysis) Course Syllabus Fall 2015 Instructor Derek Harter, Ph.D., Associate Professor Department of Computer Science Texas A&M University  Commerce
More informationIntroduction to Machine Learning
Introduction to Machine Learning Prof. Alexander Ihler Prof. Max Welling icamp Tutorial July 22 What is machine learning? The ability of a machine to improve its performance based on previous results:
More informationClass Overview and General Introduction to Machine Learning
Class Overview and General Introduction to Machine Learning Piyush Rai www.cs.utah.edu/~piyush CS5350/6350: Machine Learning August 23, 2011 (CS5350/6350) Intro to ML August 23, 2011 1 / 25 Course Logistics
More informationSearch Taxonomy. Web Search. Search Engine Optimization. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Retrieval models Older models» Boolean retrieval» Vector Space model Probabilistic Models» BM25» Language models Web search» Learning to Rank Search Taxonomy!
More informationARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)
ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING) Gabriela Ochoa http://www.cs.stir.ac.uk/~goc/ OUTLINE Preliminaries Classification and Clustering Applications
More informationFMA901F: Machine Learning Lecture 1: Introduction and Overview. Cristian Sminchisescu
FMA901F: Machine Learning Lecture 1: Introduction and Overview Cristian Sminchisescu Machine Learning Course Lecturer Prof. Cristian Sminchisescu (about me in 2 lines: PhD Applied Math & CS, INRIA, 99
More informationMachine Learning Capacity and Performance Analysis and R
Machine Learning and R May 3, 11 30 25 15 10 5 25 15 10 5 30 25 15 10 5 0 2 4 6 8 101214161822 0 2 4 6 8 101214161822 0 2 4 6 8 101214161822 100 80 60 40 100 80 60 40 100 80 60 40 30 25 15 10 5 25 15 10
More informationMACHINE LEARNING IN HIGH ENERGY PHYSICS
MACHINE LEARNING IN HIGH ENERGY PHYSICS LECTURE #1 Alex Rogozhnikov, 2015 INTRO NOTES 4 days two lectures, two practice seminars every day this is introductory track to machine learning kaggle competition!
More informationSupport Vector Machine (SVM)
Support Vector Machine (SVM) CE725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Margin concept HardMargin SVM SoftMargin SVM Dual Problems of HardMargin
More informationIntroduction to Machine Learning
Introduction to Machine Learning Isabelle Guyon isabelle@clopinet.com What is Machine Learning? Learning algorithm Trained machine TRAINING DATA Answer Query What for? Classification Time series prediction
More informationClassification: Basic Concepts, Decision Trees, and Model Evaluation. General Approach for Building Classification Model
10 10 Classification: Basic Concepts, Decision Trees, and Model Evaluation Dr. Hui Xiong Rutgers University Introduction to Data Mining 1//009 1 General Approach for Building Classification Model Tid Attrib1
More informationPattern Recognition: An Overview. Prof. Richard Zanibbi
Pattern Recognition: An Overview Prof. Richard Zanibbi Pattern Recognition (One) Definition The identification of implicit objects, types or relationships in raw data by an animal or machine i.e. recognizing
More informationConcepts in Machine Learning, Unsupervised Learning & Astronomy Applications
Data Mining In Modern Astronomy Sky Surveys: Concepts in Machine Learning, Unsupervised Learning & Astronomy Applications ChingWa Yip cwyip@pha.jhu.edu; Bloomberg 518 Human are Great Pattern Recognizers
More informationKnearestneighbor: an introduction to machine learning
Knearestneighbor: an introduction to machine learning Xiaojin Zhu jerryzhu@cs.wisc.edu Computer Sciences Department University of Wisconsin, Madison slide 1 Outline Types of learning Classification:
More informationMachine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer
Machine Learning Chapter 18, 21 Some material adopted from notes by Chuck Dyer What is learning? Learning denotes changes in a system that... enable a system to do the same task more efficiently the next
More informationMACHINE LEARNING AN INTRODUCTION
AN INTRODUCTION JOSEFIN ROSÉN, SENIOR ANALYTICAL EXPERT, SAS INSTITUTE JOSEFIN.ROSEN@SAS.COM TWITTER: @ROSENJOSEFIN AGENDA What is machine learning? When, where and how is machine learning used? Exemple
More informationIntroduction to Pattern Recognition
Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)
More informationData Mining Practical Machine Learning Tools and Techniques
Ensemble learning Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 8 of Data Mining by I. H. Witten, E. Frank and M. A. Hall Combining multiple models Bagging The basic idea
More informationLecture 9: Introduction to Pattern Analysis
Lecture 9: Introduction to Pattern Analysis g Features, patterns and classifiers g Components of a PR system g An example g Probability definitions g Bayes Theorem g Gaussian densities Features, patterns
More informationMachine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)
Machine Learning Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos) What Is Machine Learning? A computer program is said to learn from experience E with respect to some class of
More informationDetection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup
Network Anomaly Detection A Machine Learning Perspective Dhruba Kumar Bhattacharyya Jugal Kumar KaKta»C) CRC Press J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor
More informationIntroduction to Machine Learning
Introduction to Machine Learning CS 590 and STAT 598A, Spring 2010 Instructor: S.V. N. Vishwanathan (email: vishy) http://www.stat.purdue.edu/~vishy/introml/introml.html January 12, 2010 S.V. N. Vishwanathan
More informationMachine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague. http://ida.felk.cvut.
Machine Learning and Data Analysis overview Jiří Kléma Department of Cybernetics, Czech Technical University in Prague http://ida.felk.cvut.cz psyllabus Lecture Lecturer Content 1. J. Kléma Introduction,
More informationLearning outcomes. Knowledge and understanding. Competence and skills
Syllabus Master s Programme in Statistics and Data Mining 120 ECTS Credits Aim The rapid growth of databases provides scientists and business people with vast new resources. This programme meets the challenges
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
More informationEnsemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 20150305
Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 20150305 Roman Kern (KTI, TU Graz) Ensemble Methods 20150305 1 / 38 Outline 1 Introduction 2 Classification
More informationA Simple Introduction to Support Vector Machines
A Simple Introduction to Support Vector Machines Martin Law Lecture for CSE 802 Department of Computer Science and Engineering Michigan State University Outline A brief history of SVM Largemargin linear
More informationDATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
More informationBig Data Analytics for SCADA
ENERGY Big Data Analytics for SCADA Machine Learning Models for Fault Detection and Turbine Performance Elizabeth Traiger, Ph.D., M.Sc. 14 April 2016 1 SAFER, SMARTER, GREENER Points to Convey Big Data
More informationPattern Analysis. Logistic Regression. 12. Mai 2009. Joachim Hornegger. Chair of Pattern Recognition Erlangen University
Pattern Analysis Logistic Regression 12. Mai 2009 Joachim Hornegger Chair of Pattern Recognition Erlangen University Pattern Analysis 2 / 43 1 Logistic Regression Posteriors and the Logistic Function Decision
More informationClassification algorithm in Data mining: An Overview
Classification algorithm in Data mining: An Overview S.Neelamegam #1, Dr.E.Ramaraj *2 #1 M.phil Scholar, Department of Computer Science and Engineering, Alagappa University, Karaikudi. *2 Professor, Department
More informationNovember 28 th, Carlos Guestrin. Lower dimensional projections
PCA Machine Learning 070/578 Carlos Guestrin Carnegie Mellon University November 28 th, 2007 Lower dimensional projections Rather than picking a subset of the features, we can new features that are combinations
More informationRegression Using Support Vector Machines: Basic Foundations
Regression Using Support Vector Machines: Basic Foundations Technical Report December 2004 Aly Farag and Refaat M Mohamed Computer Vision and Image Processing Laboratory Electrical and Computer Engineering
More informationMachine Learning and Statistics: What s the Connection?
Machine Learning and Statistics: What s the Connection? Institute for Adaptive and Neural Computation School of Informatics, University of Edinburgh, UK August 2006 Outline The roots of machine learning
More informationCOMP 551 Applied Machine Learning Lecture 6: Performance evaluation. Model assessment and selection.
COMP 551 Applied Machine Learning Lecture 6: Performance evaluation. Model assessment and selection. Instructor: (jpineau@cs.mcgill.ca) Class web page: www.cs.mcgill.ca/~jpineau/comp551 Unless otherwise
More informationLearning outcomes. Knowledge and understanding. Ability and Competences. Evaluation capability and scientific approach
Syllabus Master s Programme in Statistics and Data Mining 120 ECTS Credits Aim The rapid growth of databases provides scientists and business people with vast new resources. This programme meets the challenges
More informationLecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions
SMA 50: Statistical Learning and Data Mining in Bioinformatics (also listed as 5.077: Statistical Learning and Data Mining ()) Spring Term (Feb May 200) Faculty: Professor Roy Welsch Wed 0 Feb 7:008:0
More informationMachine Learning  Spring 2012 Problem Set 3
10701 Machine Learning  Spring 2012 Problem Set 3 Out: February 29th, 1:30pm In: March 19h, 1:30pm TA: HaiSon Le (hple@cs.cmu.edu) School Of Computer Science, Carnegie Mellon University Homework will
More informationClassification Problems
Classification Read Chapter 4 in the text by Bishop, except omit Sections 4.1.6, 4.1.7, 4.2.4, 4.3.3, 4.3.5, 4.3.6, 4.4, and 4.5. Also, review sections 1.5.1, 1.5.2, 1.5.3, and 1.5.4. Classification Problems
More informationArtificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence
Artificial Neural Networks and Support Vector Machines CS 486/686: Introduction to Artificial Intelligence 1 Outline What is a Neural Network?  Perceptron learners  Multilayer networks What is a Support
More information1 Review of Least Squares Solutions to Overdetermined Systems
cs4: introduction to numerical analysis /9/0 Lecture 7: Rectangular Systems and Numerical Integration Instructor: Professor Amos Ron Scribes: Mark Cowlishaw, Nathanael Fillmore Review of Least Squares
More informationKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Evaluating the Accuracy of a Classifier Holdout, random subsampling, crossvalidation, and the bootstrap are common techniques for
More informationPredictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD
Predictive Analytics Techniques: What to Use For Your Big Data March 26, 2014 Fern Halper, PhD Presenter Proven Performance Since 1995 TDWI helps business and IT professionals gain insight about data warehousing,
More informationInformation Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)
More informationSupport Vector Machines Explained
March 1, 2009 Support Vector Machines Explained Tristan Fletcher www.cs.ucl.ac.uk/staff/t.fletcher/ Introduction This document has been written in an attempt to make the Support Vector Machines (SVM),
More information