An Introduction to Statistical Machine Learning - Overview -

Similar documents
Statistical Machine Learning from Data

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

Principles of Data Mining by Hand&Mannila&Smyth

NEURAL NETWORKS A Comprehensive Foundation

Neural Networks and Support Vector Machines

A Simple Introduction to Support Vector Machines

COLLEGE OF SCIENCE. John D. Hromi Center for Quality and Applied Statistics

Class #6: Non-linear classification. ML4Bio 2012 February 17 th, 2012 Quaid Morris

HT2015: SC4 Statistical Data Mining and Machine Learning

Learning to Process Natural Language in Big Data Environment

Data Mining Practical Machine Learning Tools and Techniques

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

MS1b Statistical Data Mining

Application of Event Based Decision Tree and Ensemble of Data Driven Methods for Maintenance Action Recommendation

Scalable Developments for Big Data Analytics in Remote Sensing

Machine Learning Introduction

An Overview of Knowledge Discovery Database and Data mining Techniques

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

L25: Ensemble learning

Understanding Network Traffic An Introduction to Machine Learning in Networking

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Towards better accuracy for Spam predictions

CS 688 Pattern Recognition Lecture 4. Linear Models for Classification

Car Insurance. Prvák, Tomi, Havri

: Introduction to Machine Learning Dr. Rita Osadchy

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague.

STA 4273H: Statistical Machine Learning

Supervised Learning (Big Data Analytics)

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

Evaluation of Machine Learning Techniques for Green Energy Prediction

Analysis of kiva.com Microlending Service! Hoda Eydgahi Julia Ma Andy Bardagjy December 9, 2010 MAS.622j

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

A Learning Based Method for Super-Resolution of Low Resolution Images

Machine Learning with MATLAB David Willingham Application Engineer

Machine learning for algo trading

A Neural Support Vector Network Architecture with Adaptive Kernels. 1 Introduction. 2 Support Vector Machines and Motivations

An Introduction to Data Mining

Introduction to Machine Learning CMU-10701

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

Classification algorithm in Data mining: An Overview

High-Performance Signature Recognition Method using SVM

MA2823: Foundations of Machine Learning

Introduction to nonparametric regression: Least squares vs. Nearest neighbors

Support Vector Machine (SVM)

Novelty Detection in image recognition using IRF Neural Networks properties

Local features and matching. Image classification & object localization

Predict Influencers in the Social Network

Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing Classifier

The Artificial Prediction Market

ADVANCED MACHINE LEARNING. Introduction

HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION

Chapter 4: Artificial Neural Networks

An Introduction to Machine Learning

LCs for Binary Classification

Statistical Machine Learning

Support Vector Machine. Tutorial. (and Statistical Learning Theory)

Semi-Supervised Support Vector Machines and Application to Spam Filtering

Signature Segmentation from Machine Printed Documents using Conditional Random Field

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set

Acknowledgments. Data Mining with Regression. Data Mining Context. Overview. Colleagues

CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES

Chapter 6. The stacking ensemble approach

Lecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions

Classification Techniques for Remote Sensing

CSci 538 Articial Intelligence (Machine Learning and Data Analysis)

Simple and efficient online algorithms for real world applications

Statistics for BIG data

Operations Research and Knowledge Modeling in Data Mining

DATA MINING TECHNIQUES AND APPLICATIONS

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Intrusion Detection via Machine Learning for SCADA System Protection

One-Class Classifiers: A Review and Analysis of Suitability in the Context of Mobile-Masquerader Detection

FOREX TRADING PREDICTION USING LINEAR REGRESSION LINE, ARTIFICIAL NEURAL NETWORK AND DYNAMIC TIME WARPING ALGORITHMS

E-commerce Transaction Anomaly Classification

Machine Learning in Spam Filtering

Data Mining: An Overview. David Madigan

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Employer Health Insurance Premium Prediction Elliott Lui

Machine Learning.

Annotated bibliographies for presentations in MUMT 611, Winter 2006

Evaluation of Feature Selection Methods for Predictive Modeling Using Neural Networks in Credits Scoring

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Classification of Bad Accounts in Credit Card Industry

A Health Degree Evaluation Algorithm for Equipment Based on Fuzzy Sets and the Improved SVM

Writer Identification for Smart Meeting Room Systems

Steven C.H. Hoi. School of Computer Engineering Nanyang Technological University Singapore

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

Statistical Models in Data Mining

Training Methods for Adaptive Boosting of Neural Networks for Character Recognition

Big Data Analytics and Optimization

COMP 598 Applied Machine Learning Lecture 21: Parallelization methods for large-scale machine learning! Big Data by the numbers

Novelty Detection: A Review Part 1: Statistical Approaches

SURVEY OF TEXT CLASSIFICATION ALGORITHMS FOR SPAM FILTERING

Bayesian Machine Learning (ML): Modeling And Inference in Big Data. Zhuhua Cai Google, Rice University

Football Match Winner Prediction

The Data Mining Process

Dimension Reduction. Wei-Ta Chu 2014/10/22. Multimedia Content Analysis, CSIE, CCU

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Advanced Signal Processing and Digital Noise Reduction

Transcription:

An Introduction to Statistical Machine Learning - Overview - Samy Bengio bengio@idiap.ch Dalle Molle Institute for Perceptual Artificial Intelligence (IDIAP) CP 592, rue du Simplon 4 1920 Martigny, Switzerland http://www.idiap.ch/~bengio IDIAP - January 30, 2003

Outline 1. Introduction to Machine Learning 2. Introduction to the Statistical Learning Theory 3. Classical Models 4. Artificial Neural Networks and Gradient Descent 5. Gaussian Mixture Models and Expectation-Maximization 6. How to Program in Torch 7. Hidden Markov Models 8. Support Vector Machines 9. Ensemble Models Introduction to Machine Learning 2

Outline 10. Advanced Topics (a) Decision Trees (b) Feature Selection (c) Parameter Sharing (d) Identity Verification (e) Kalman Filters (f) Particle Filters Introduction to Machine Learning 3

Documentation Machine learning library: www.torch.ch Journals: Journal of Machine Learning Research Neural Computation IEEE Transactions on Neural Networks Conferences: NIPS: Neural Information Processing Systems COLT: Computational Learning Theory ICML: International Conference on Machine Learning ICANN & ESANN: 2 European conferences Books: Bishop, C. Neural Networks for Pattern Recognition, 1995. Vapnik, V. The Nature of Statistical Learning Theory, 1995. Introduction to Machine Learning 4

Documentation Search engines: NIPS online: http://nips.djvuzone.org NEC: http://citeseer.nj.nec.com/cs Other lecture notes: (some are in french...) Bengio, Y.: http://www.iro.umontreal.ca/ bengioy/ift6266/ Kegl, B.: http://www.iro.umontreal.ca/ kegl/ift6266/ Jordan, M.: http://www.cs.berkeley.edu/ jordan/courses/294-fall98/ Introduction to Machine Learning 5

Introduction to Machine Learning 1. What is Machine Learning? 2. Why is it difficult? 3. Basic Principles (a) Occam s Razor (b) Learning as a Search Problem 4. Types of Problems (a) Regression (b) Classification (c) Density Estimation 5. Applications 6. Documentation Introduction to Machine Learning 6

Statistical Learning Theory 1. The Data 2. The Function Space 3. The Loss Function 4. The Risk and the Empirical Risk 5. The Training Error 6. The Capacity 7. The Bias-Variance Dilemma 8. Regularization 9. Estimation of the Risk 10. Model Selection 11. Methodology Introduction to Machine Learning 7

Classical Models 1. Parametric or Not? 2. Histograms 3. Problem: Curse of Dimensionality 4. K Nearest Neighbors 5. Parzen Windows 6. Maximum Likelihood Approach 7. Bayes Decision and Bayes Classifiers 8. K-Means Introduction to Machine Learning 8

Artificial Neural Networks and Gradient Descent 1. Artificial Neural Networks 2. Multi Layer Perceptrons 3. Gradient Descent 4. ANN for Classification 5. Tricks of the Trade 6. Other ANN Models Introduction to Machine Learning 9

Gaussian Mixture Models and EM 1. Reminder: Basics on Probabilities 2. What is a GMM 3. Basics of EM 4. Convergence of EM 5. EM for GMMs 6. Initialization Introduction to Machine Learning 10

Hidden Markov Models 1. Markov Models 2. Hidden Markov Models 3. HMMs as Generative Processes 4. Markovian Assumptions for HMMs 5. The Likelihood given an HMM 6. EM Training for HMM 7. The Most Likely Path in an HMM 8. HMMs for Speech Recognition Introduction to Machine Learning 11

Support Vector Machines 1. The aim of SVMs 2. Linear SVMs and soft margin 3. Solving the SVMs problem using a Lagrangian method 4. Kernel trick 5. Support Vector Regression Introduction to Machine Learning 12

Ensemble Models 1. Basics of Ensembles 2. Bagging 3. AdaBoost Introduction to Machine Learning 13

Decision Trees 1. General Model 2. Partition Function 3. Training 4. Regularization 5. Regression: CART Introduction to Machine Learning 14

Feature Selection 1. Why Should We Do Select Features? 2. Broad Classes of Feature Selection 3. Wrapper Methods 4. L autre methode! Introduction to Machine Learning 15

Advanced Topic: Parameter Sharing 1. Time Delay Neural Networks 2. LeNet for Images 3. Parameter Sharing for GMMs and HMMs Introduction to Machine Learning 16

Advanced Topic: Identity Verification 1. Speaker Verification (a) Text Dependent, Text Independent, Customized Password (b) Maximum Likelihood and MAP Adaptation 2. Face Verification (a) Localization (b) Verification 3. Fusion and Confidence Estimation 4. Joint Models Introduction to Machine Learning 17