Learning is a very general term denoting the way in which agents:



Similar documents
What is Learning? CS 391L: Machine Learning Introduction. Raymond J. Mooney. Classification. Problem Solving / Planning / Control

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Machine Learning: Overview

Machine Learning Introduction

Machine Learning and Data Mining. Fundamentals, robotics, recognition

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Data Mining Part 5. Prediction

CSC384 Intro to Artificial Intelligence

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Machine Learning and Statistics: What s the Connection?

Course 395: Machine Learning

Data Mining - Evaluation of Classifiers

Machine Learning using MapReduce

Supervised Learning (Big Data Analytics)

Introduction to Data Mining

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

1 What is Machine Learning?

DATA MINING TECHNIQUES AND APPLICATIONS

CSE 517A MACHINE LEARNING INTRODUCTION

Data, Measurements, Features

An Introduction to Data Mining

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

Learning outcomes. Knowledge and understanding. Competence and skills

Principles of Data Mining by Hand&Mannila&Smyth

Introduction to Machine Learning Using Python. Vikram Kamath

Using Data Mining for Mobile Communication Clustering and Characterization

Reference Books. Data Mining. Supervised vs. Unsupervised Learning. Classification: Definition. Classification k-nearest neighbors

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

Classification algorithm in Data mining: An Overview

Machine Learning for Data Science (CS4786) Lecture 1

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

Social Media Mining. Data Mining Essentials

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

MA2823: Foundations of Machine Learning

Data Mining and Machine Learning in Bioinformatics

Course: Model, Learning, and Inference: Lecture 5

Intrusion Detection. Jeffrey J.P. Tsai. Imperial College Press. A Machine Learning Approach. Zhenwei Yu. University of Illinois, Chicago, USA

An Overview of Knowledge Discovery Database and Data mining Techniques

Supervised and unsupervised learning - 1

Course Outline Department of Computing Science Faculty of Science. COMP Applied Artificial Intelligence (3,1,0) Fall 2015

8. Machine Learning Applied Artificial Intelligence

Data Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland

Sanjeev Kumar. contribute

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague.

Neural Networks and Support Vector Machines

Machine learning for algo trading

Machine Learning. Mausam (based on slides by Tom Mitchell, Oren Etzioni and Pedro Domingos)

EXECUTIVE SUPPORT SYSTEMS (ESS) STRATEGIC INFORMATION SYSTEM DESIGNED FOR UNSTRUCTURED DECISION MAKING THROUGH ADVANCED GRAPHICS AND COMMUNICATIONS *

Introduction to Data Mining

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

DATA MINING IN FINANCE

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

: Introduction to Machine Learning Dr. Rita Osadchy

Introduction. A. Bellaachia Page: 1

Data Mining. Nonlinear Classification

Machine Learning.

Attribution. Modified from Stuart Russell s slides (Berkeley) Parts of the slides are inspired by Dan Klein s lecture material for CS 188 (Berkeley)

from Larson Text By Susan Miertschin

MACHINE LEARNING BASICS WITH R

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

COMP 590: Artificial Intelligence

Cell Phone based Activity Detection using Markov Logic Network

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

Doctor of Philosophy in Computer Science

Nine Common Types of Data Mining Techniques Used in Predictive Analytics

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Big Data Analytics CSCI 4030

Knowledge Discovery and Data Mining

Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100

Université de Montpellier 2 Hugo Alatrista-Salas : hugo.alatrista-salas@teledetection.fr

Comparison of K-means and Backpropagation Data Mining Algorithms

Machine Learning Capacity and Performance Analysis and R

Introduction to Learning & Decision Trees

Artificial Intelligence

Data Mining: Overview. What is Data Mining?

Knowledge-based systems and the need for learning

Improving Decision Making and Managing Knowledge

Machine Learning. CS494/594, Fall :10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004

How To Use Neural Networks In Data Mining

Experiments in Web Page Classification for Semantic Web

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Data Isn't Everything

Using Artificial Intelligence to Manage Big Data for Litigation

Projektgruppe. Categorization of text documents via classification

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) ( ) Roman Kern. KTI, TU Graz

Car Insurance. Prvák, Tomi, Havri

Statistics Graduate Courses

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler

Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification

Overview. Swarms in nature. Fish, birds, ants, termites, Introduction to swarm intelligence principles Particle Swarm Optimization (PSO)

Transcription:

What is learning? Learning is a very general term denoting the way in which agents: Acquire and organize knowledge (by building, modifying and organizing internal representations of some external reality); Discover new knowledge and theories (by creating hypotheses that explain some data or phenomena); Acquire skills (by gradually improving their motor or cognitive skills through repeated practice, sometimes involving little or no conscious thought). Learning results in changes in the agent that improve its competence and/or efficiency. IAT-811 Metacreation 10

What is learning? Two complementary dimensions of learning: Competence: A system is improving its competence if it learns to solve a broader class of problems, and to make fewer mistakes in problem solving. Efficiency: A system is improving its efficiency, if it learns to solve the problems from its area of competence faster or by using fewer resources. Can we do that for artificial agents? IAT-811 Metacreation 11 What is machine learning? Artificial Intelligence is the Science and Engineering that is concerned with the theory and practice of developing systems that exhibit the characteristics we associate with intelligence in human/animal behavior Machine Learning is the domain of Artificial Intelligence which is concerned with building adaptive computer systems that are able to improve their competence and/or efficiency through learning from input data or from their own problem solving experience. Machine learning theories and techniques draw inspiration from many disciplines: Artificial Intelligence, Data Mining, Probability and Statistics, Information theory, Numerical optimization, Computational complexity theory, Control theory (adaptive), Psychology (developmental, cognitive), Neurobiology, Linguistics, Philosophy,... IAT-811 Metacreation 12

Machine Learning Definition: Learning is any process by which a system improves performance from experience. (Herbert Simon) More formally: Improve on task, T, with respect to performance metric, P, based on experience, E What is the task? Classification: Assign object/event to one of a given finite set of categories. Problem solving / planning / control: Performing actions in an environment in order to achieve a goal. Clustering: Partition unlabeled examples into disjoint subsets of clusters, such that: Examples within a cluster are very similar Examples in different clusters are very different Suppervised Learning Unsuppervised Learning IAT-811 Metacreation 13 Supervised vs. Unsupervised Learning Supervised learning: Inputs Supervisor Learner Desired output Output Error Unsupervised learning: Inputs Learner Output IAT-811 Metacreation 14

Examples Classification: The task: assign object/event to one of a given finite set of categories. Examples: Medical diagnosis, Fraud detection in e-commerce, Worm detection in network packets, Spam filtering in email, Recommended articles in a newspaper, Recommended books, movies, music, or jokes, Financial investments, DNA sequences, Spoken words, Handwritten letters, Astronomical images, Rhythm selection, Human trained filters,... Problem Solving / Planning / Control The task: performing actions in an environment in order to achieve a goal. Examples: Playing checkers, chess, backgammon,..., Balancing a pole, controlling a mobile robot,..., Driving a car or a jeep,..., Flying a plane, helicopter, or rocket,..., Controlling a character in a video game, Composing music in the style of Bach,..., Generating rythmic patterns,... Clustering: The task: creating classes for data and classifying them (Reducing the number of dimensions) Examples: Clustering news articles, Clustering sounds, Clustering images,... IAT-811 Metacreation 15 Defining a Learning System T: Playing checkers P: Percentages of games won against an arbitrary opponent E: Playing practice games against itself T: Recognizing hand-written words P: Percentage of words correctly classified E: Database of human-labeled images of handwritten words T: Driving on four-lane highways using vision sensors P: Average distance traveled before a human-judged error E: A sequence of images and steering commands recorded while observing a human driver. T: Categorize email messages as spam or legitimate. P: Percentage of email messages correctly classified. E: Database of emails, some with human-given labels IAT-811 Metacreation 16

Designing a Learning System The design process involves: Choosing exactly what is too be learned, i.e. the target function: learning can be viewed as using direct or indirect experience to approximate a chosen target function. Choosing how to represent the target function. Choosing the training experience (in the supervised case) or the input data (in the unsupervised case) Choosing a learning algorithm to infer the target function from the experience: function approximation can be viewed as a search through a space of hypotheses (representations of functions) for one that best fits a set of training data. Different learning methods assume different hypothesis spaces (representation languages) and/or employ different search techniques. IAT-811 Metacreation 17 Various Function Representations Numerical functions Linear regression Neural networks Support vector machines Symbolic functions Decision trees Rules in propositional logic or in first-order predicate logic Instance-based functions Nearest-neighbor Case-based Probabilistic Graphical Models Naïve Bayes Bayesian networks Hidden-Markov Models (HMMs) Probabilistic Context Free Grammars (PCFGs) Markov networks Agent(s) as a function: Reinforcement learning Learning to communicate Communicate to learn: distributed learning IAT-811 Metacreation 18

Various Search Algorithms Gradient descent Perceptron Backpropagation Dynamic Programming HMM Learning PCFG Learning Reinforcement learning Divide and Conquer Decision tree induction Rule learning Evolutionary Computation Genetic Algorithms (GAs) Genetic Programming (GP) Neuro-evolution IAT-811 Metacreation 19 Evaluation of Learning Systems Assessing solutions' correctness or quality: classification accuracy, goal achievement, number and relevance of clusters,... Experimental evaluation: Conduct controlled cross-validation experiments to compare various methods on a variety of benchmark datasets. Gather data on their performance, e.g. test accuracy, training-time, testing-time. Analyze differences for statistical significance. Theoretical evaluation: Analyze algorithms mathematically and prove theorems about their: Computational complexity Ability to fit training data Sample complexity (number of training examples needed to learn an accurate function) IAT-811 Metacreation 20

Nothing that is worth knowing can be taught. Oscar Wilde IAT-811 Metacreation 21