Machine Learning and Statistics: What s the Connection?



Similar documents
Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

Learning is a very general term denoting the way in which agents:

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

MA2823: Foundations of Machine Learning

Machine Learning Introduction

Principles of Dat Da a t Mining Pham Tho Hoan hoanpt@hnue.edu.v hoanpt@hnue.edu. n

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Machine Learning: Overview

Assessing Data Mining: The State of the Practice

Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

Learning outcomes. Knowledge and understanding. Competence and skills

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Data Mining and Machine Learning in Bioinformatics

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Machine Learning. CS494/594, Fall :10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1

Introduction to Learning & Decision Trees

Principles of Data Mining by Hand&Mannila&Smyth

Perspectives on Data Mining

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Supervised Learning (Big Data Analytics)

Statistical Models in Data Mining

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining

CSE 517A MACHINE LEARNING INTRODUCTION

8. Machine Learning Applied Artificial Intelligence

An Introduction to Data Mining

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Prediction of Heart Disease Using Naïve Bayes Algorithm

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

: Introduction to Machine Learning Dr. Rita Osadchy

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

Introduction. A. Bellaachia Page: 1

Data, Measurements, Features

Using Data Mining for Mobile Communication Clustering and Characterization

Dynamic Data in terms of Data Mining Streams

Information Management course

Doctor of Philosophy in Computer Science

Data Warehousing and Data Mining in Business Applications

HT2015: SC4 Statistical Data Mining and Machine Learning

MS1b Statistical Data Mining

Attribution. Modified from Stuart Russell s slides (Berkeley) Parts of the slides are inspired by Dan Klein s lecture material for CS 188 (Berkeley)

Statistics for BIG data

Introduction to Pattern Recognition

Obtaining Value from Big Data

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

Bayesian networks - Time-series models - Apache Spark & Scala

Azure Machine Learning, SQL Data Mining and R

Machine Learning using MapReduce

6.2.8 Neural networks for data mining

Neural Networks for Machine Learning. Lecture 13a The ups and downs of backpropagation

INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR.

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Data Mining Solutions for the Business Environment

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Introduction to Data Mining

Machine Learning Capacity and Performance Analysis and R

Lecture Slides for INTRODUCTION TO. ETHEM ALPAYDIN The MIT Press, Lab Class and literature. Friday, , Harburger Schloßstr.

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Using Artificial Intelligence to Manage Big Data for Litigation

Machine Learning.

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

No BI without Machine Learning

Knowledge Discovery from Data Bases Proposal for a MAP-I UC

Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

Title. Introduction to Data Mining. Dr Arulsivanathan Naidoo Statistics South Africa. OECD Conference Cape Town 8-10 December 2010.

Lecture 1: Introduction to Reinforcement Learning

Activity recognition in ADL settings. Ben Kröse

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

The Data Mining Process

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier

DATA MINING TECHNIQUES AND APPLICATIONS

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining. Data Analysis and Knowledge Discovery

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Classification algorithm in Data mining: An Overview

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Hidden Markov Models in Bioinformatics. By Máthé Zoltán Kőrösi Zoltán 2006

Data Mining course Master in Information Technologies Enginyeria Informàtica Tomàs Aluja. LIAM EIO. UPC Lluis Belanche LSI. UPC

CPSC 340: Machine Learning and Data Mining. Mark Schmidt University of British Columbia Fall 2015

Course: Model, Learning, and Inference: Lecture 5

Introduction to Data Mining Techniques

Machine Learning Overview

What is Learning? CS 391L: Machine Learning Introduction. Raymond J. Mooney. Classification. Problem Solving / Planning / Control

Final Project Report

Data Mining - Evaluation of Classifiers

Defending Networks with Incomplete Information: A Machine Learning Approach. Alexandre

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague.

FOREX TRADING PREDICTION USING LINEAR REGRESSION LINE, ARTIFICIAL NEURAL NETWORK AND DYNAMIC TIME WARPING ALGORITHMS

Predict Influencers in the Social Network

Machine Learning for Cyber Security Intelligence

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov

Transcription:

Machine Learning and Statistics: What s the Connection? Institute for Adaptive and Neural Computation School of Informatics, University of Edinburgh, UK August 2006

Outline The roots of machine learning (ML) Problems addressed by ML Some examples Machine learning and statistics

What is Machine Learning? The goal of machine learning is to build computer systems that can adapt and learn from their experience. (Dietterich, 1999) Machine learning usually refers to changes in systems that perform tasks associated with artificial intelligence (AI). Such tasks involve recognition, diagnosis, planning, robot control, prediction, etc. (Nilsson, 1996) Some reasons for adaptation: Some tasks can be hard to define except via examples Adaptation can improve a human-built system, or track changes over time Goals can be autonomous machine performance, or enabling humans to learn from data (data mining)

Roots of Machine Learning Statistical pattern recognition, adaptive control theory (EE) Artificial Intelligence: e.g. discovering rules using decision trees, inductive logic programming Brain models, e.g. neural networks Psychological models Statistics

Problems Addressed by Machine Learning Supervised Learning model p(y x): regression, classification, etc Unsupervised Learning model p(x): not just clustering! Reinforcement Learning Markov decision processes, POMDPs, planning.

Supervised Learning: Examples Star/galaxy classification Robot arm inverse dynamics Fayyad, Weir, Djorgovski (1993) Vijayakumar and Schaal (2000)

Example: Harmonizing Chorales in the Style of J S Bach Bach chorale K54

Hidden Markov Model q q q q π 0 0 1 A A A 2 3.. q Τ y y y y 0 1 2 3 y Τ Visible states are the melody (quarter notes) Hidden states are the harmony (which chord)

A supervised learning task Trained using labelled melody/harmony data Task: find Viterbi alignment for harmony given melody (or sample from P(harmony melody)) Actually uses HMMs for three subtasks: harmonic skeleton, chord skeleton, ornamentation Online demo at http://www.anc.inf.ed.ac.uk/demos/hmmbach/ Moray Allan and (NIPS 2004) Bach chorale K54, HMM harmonization

Sprites: Learning about Objects from Images An unsupervised task, latent variable modelling 1 2 3 4 5 6

Sprite: a graphics term for a cardboard cutout model having a shape and an appearance Also need to cope with image transformations (e.g. object motion) foreground (original) mask (original) foreground (transformed) mask (transformed)

For a single object p(x) = J p j p(x T j ) j=1 J can be very large. Model fitted using EM Note that multiple objects combine by occlusion With L objects, would naively need to enumerate O(J L ) settings of the latent variables Jojic and Frey (2001): variational inference; Williams and Titsias (2003), greedy algorithm

Movie

Other Examples Automatic recommender systems (collaborative filtering) Bioinformatics (sequence data, microarrays, etc) Information retrieval (Google) (Neonatal) condition monitoring Fraud detection Topic models for documents Statistical natural language processing (e.g. parsing) : RL: TD-gammon, elevator dispatching

Some Theoretical Issues in Machine Learning Computational Learning Theory (COLT) community Uniform convergence bounds, VC dimension, learning curves PAC-Bayesian analysis etc Prediction using expert advice (Cesa-Bianchi and Lugosi); e.g. combining/selecting predictions from a pool of experts Algorithms for and analysis of approximate inference methods in probabilistic graphical models

Machine Learning and Statistics Statistics Machine Learning probabilistic (graphical) models Same models, but different problems? Not all machine learning methods are based on probabilisic models, e.g. SVMs, non-negative matrix factorization

Some Differences Statistics: focus on understanding data in terms of models Statistics: interpretability, hypothesis testing Machine Learning: greater focus on prediction Machine Learning: focus on the analysis of learning algorithms (not just large dataset issues)

Summary Rich variety of problems addressed by machine learning in supervised, unsupervised and reinforcement learning Some differences, but a nexus wrt probabilistic models, graphical models Bidirectional exchanges of ideas and methods As statisticians, what is your view?

Slide from Rob Tibshirani (early 1990s) NEURAL NETS STATISTICS network model weights parameters learning fitting generalization test set performance supervised learning regression/classification unsupervised learning density estimation optimal brain damage model selection large grant = $100,000 large grant= $10,000 nice place to have a meeting: nice place to have a meeting: Snowbird, Utah, French Alps Las Vegas in August

What is data mining? Data mining is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data in novel ways that are both understandable and useful to the data owner. Hand, Mannila, Smyth We are drowning in information, but starving for knowledge! Naisbett [Data mining is the] extraction of interesting (non-trivial, implicit, previously unknown and potentially useful) information or patterns from data in large databases. Han