Object Recognition. Lecture 11, April 20 th, Lexing Xie. EE4830 Digital Image Processing

Similar documents
Local features and matching. Image classification & object localization

: Introduction to Machine Learning Dr. Rita Osadchy

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

Lecture 9: Introduction to Pattern Analysis

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow

Introduction to Pattern Recognition

Linear Threshold Units

Lecture 2: The SVM classifier

Probabilistic Latent Semantic Analysis (plsa)

Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Novelty Detection in image recognition using IRF Neural Networks properties

Artificial Neural Networks and Support Vector Machines. CS 486/686: Introduction to Artificial Intelligence

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

Big Data Analytics CSCI 4030

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

Search Taxonomy. Web Search. Search Engine Optimization. Information Retrieval

Limitations of Human Vision. What is computer vision? What is computer vision (cont d)?

MACHINE LEARNING IN HIGH ENERGY PHYSICS

Classification algorithm in Data mining: An Overview

The Delicate Art of Flower Classification

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Neural Network based Vehicle Classification for Intelligent Traffic Control

Data, Measurements, Features

Social Media Mining. Data Mining Essentials

Content-Based Recommendation

Signature Segmentation from Machine Printed Documents using Conditional Random Field

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall

Introduction to Machine Learning Using Python. Vikram Kamath

The Scientific Data Mining Process

Machine Learning Logistic Regression

Analecta Vol. 8, No. 2 ISSN

Application of Face Recognition to Person Matching in Trains

Machine Learning and Pattern Recognition Logistic Regression

Learning is a very general term denoting the way in which agents:

View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems

ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)

ADVANCED MACHINE LEARNING. Introduction

Predict Influencers in the Social Network

Taking Inverse Graphics Seriously

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

UNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences

Classification of Fingerprints. Sarat C. Dass Department of Statistics & Probability

Android Ros Application

Neural Networks and Support Vector Machines

Image Classification for Dogs and Cats

Logistic Regression. Vibhav Gogate The University of Texas at Dallas. Some Slides from Carlos Guestrin, Luke Zettlemoyer and Dan Weld.

A Learning Based Method for Super-Resolution of Low Resolution Images

CSCI567 Machine Learning (Fall 2014)

Semi-Supervised Support Vector Machines and Application to Spam Filtering

Analysis of kiva.com Microlending Service! Hoda Eydgahi Julia Ma Andy Bardagjy December 9, 2010 MAS.622j

Machine Learning in Computer Vision A Tutorial. Ajay Joshi, Anoop Cherian and Ravishankar Shivalingam Dept. of Computer Science, UMN

Machine Learning: Statistical Methods - I

Statistical Machine Learning

Feature Tracking and Optical Flow

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

High-dimensional labeled data analysis with Gabriel graphs

Classifying Manipulation Primitives from Visual Data

Object Recognition and Template Matching

Environmental Remote Sensing GEOG 2021

FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM

High-Performance Signature Recognition Method using SVM

Assessing Data Mining: The State of the Practice

Tracking in flussi video 3D. Ing. Samuele Salti

Machine Learning CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science

CSE 473: Artificial Intelligence Autumn 2010

Machine Learning and Statistics: What s the Connection?

MA2823: Foundations of Machine Learning

Object Recognition. Selim Aksoy. Bilkent University

Online Learning for Offroad Robots: Using Spatial Label Propagation to Learn Long Range Traversability

Machine Learning Introduction

Tracking and Recognition in Sports Videos

Supporting Online Material for

Colorado School of Mines Computer Vision Professor William Hoff

LIST OF CONTENTS CHAPTER CONTENT PAGE DECLARATION DEDICATION ACKNOWLEDGEMENTS ABSTRACT ABSTRAK

Face Recognition For Remote Database Backup System

Efficient Attendance Management: A Face Recognition Approach

Machine Learning. Chapter 18, 21. Some material adopted from notes by Chuck Dyer

The Implementation of Face Security for Authentication Implemented on Mobile Phone

SURVEY OF TEXT CLASSIFICATION ALGORITHMS FOR SPAM FILTERING

An Introduction to Neural Networks

Steven C.H. Hoi School of Information Systems Singapore Management University

Image Processing Based Automatic Visual Inspection System for PCBs

A Partially Supervised Metric Multidimensional Scaling Algorithm for Textual Data Visualization

Blood Vessel Classification into Arteries and Veins in Retinal Images

Jiří Matas. Hough Transform

Pixels Description of scene contents. Rob Fergus (NYU) Antonio Torralba (MIT) Yair Weiss (Hebrew U.) William T. Freeman (MIT) Banksy, 2006

Part-Based Recognition

Machine Learning in Spam Filtering

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

Data Mining Techniques for Prognosis in Pancreatic Cancer

Robotics. Chapter 25. Chapter 25 1

Simple and efficient online algorithms for real world applications

3 An Illustrative Example

MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos Iasonas.kokkinos@ecp.fr

CATEGORIZATION OF SIMILAR OBJECTS USING BAG OF VISUAL WORDS AND k NEAREST NEIGHBOUR CLASSIFIER

Programming Exercise 3: Multi-class Classification and Neural Networks

Transcription:

Object Recognition Lecture 11, April 20 th, 2009 Lexing Xie EE4830 Digital Image Processing http://www.ee.columbia.edu/~xlx/ee4830/ 1

Announcements 2 HW#5 due today HW#6 last HW of the semester Problems 1-3 (100 pts + bonus) Due May 5 th Covers object recognition and image compression Two written problems one programming with code skeleton given (rewards best recognition accuracy in class) Problems 4-5 (bonus points only) due May 7 th Covers image reconstruction (lecture 13) Two written problems

Roadmap to Date 3 Spatial Domain processing and enhancement Image Transform and Filtering Morphological Processing Image Segmentation Image Descriptors Applications: Image Restoration, Object recognition, Image Compression, Indexing and Retrieval, Reconstruction

Lecture Outline 4 Problem: object recognition from images. What and why Pattern recognition primer Object recognition in controlled environments State of the art object recognition systems

What is Object Recognition? 5 perceptible material thing Courtesy of http://people.csail.mit.edu/torralba/iccv2005/

What is Object Recognition? 6 Sensory data Color, texture, shape, motion, size, weight, smell, touch, sound, Descriptions toy, stuffed Pooh, a frontal, close-up shot of stuffed Pooh, ToysRus #345812, One of the fundamental problems of computer vision: + Apple

Why? 7 Science How do we recognize objects? Practice Robot navigation Medical diagnosis Security Industrial inspection and automation Human-computer interface Information retrieval

Applications of Object Recognition 8 Some images from http://www.cs.utexas.edu/~grauman/research/research.html

9

Lecture Outline 10 Object recognition: what and why Object recognition in controlled environments Distance-based classifiers generalized linear classifiers Neural networks Bayes classifiers Object recognition in practice General object recognition systems Summary

Objects as Vectors 11

image vector representation 12

pattern classifier from examples 13 goal: given x, infer y learning from examples: supervised learning given (x i, y i =f(x i )), i=1,,n for some unknown function f find a good approximation to f rules versus data encode human knowledge as rules e.g. the petal length and width of iris appropriate scenarios for supervised learning no human expert (predict strength to cure AIDS given new molecule structure) human can perform task but can t describe how they do it (e.g. handwriting recognition, object recognition) the desired function is changing constantly w.r.t. time, or user (stock trading decisions, user-specific spam filtering)

minimum distance classifier 14 step 1: calculate class prototypes as the means step 2: use the prototypes to classify a new example discriminant function f:

nearest neighbor classifier 15 steps: store all training examples classify a new example x? by finding the training example (x i, y i ) that s nearest to x? according to Euclidean distance, and copying the labels

nearest neighbor classifier 16 discriminant function f: gray area -1; white area +1 (implicit) decision boundaries form a subset of the Voronoi diagram of the training data each line segment is equidistant between two points comments conditioned on the distance metric prone to noisy, poorly scaled features can smooth the decision by looking at K-neighbors and vote good news: knn is universally consistent

linear classifier 17 two desirables explicit (linear) decision boundary use many training examples/prototypes but do not need to remember all

the perceptron algorithm 18 learning a linear classifier given training data (x i, y i ) and loss function find: weight vector [w; b] that minimizes expected loss on training data use hinge loss: start from initial weights w 0 compute gradient update repeat until convergence η : learning rate

computing the gradient 19 given compute gradient contribution from each training sample contribution from each dimension of each training sample η must decrease to zero in order to guarantee convergence. some algorithms (Newton s) can automatically select η. local minimum is the global minimum for hinge loss

are all linear classifiers created equal? 20 all of the separating hyper-planes have zero (hinge) loss the perceptron algorithm will stop as soon as may some hyper-planes more preferable to others

Support Vector Machines 21 Two key ideas: The best separating hyperplane has the largest margin. Class boundary can be linear in a higherdimensional space, e.g., generalized linear discriminant weighted (generalized) inner product with support vectors

Neural Networks 22

Neural Network Decision Boundaries 23 a single hidden layer, feed forward neural network is capable of approximating any continuous, multivariate function to any desired degree of accuracy and that failure to map a function arises from poor choice of network parameters, or an insufficient number of hidden neurons. [Cybenko 1989]

Digit Recognition with Neural Net 24 0 1 LeCun et al, 1992, 1998, http://yann.lecun.com/exdb/mnist/

probabilistic classifiers 25 what about probabilities p(x y) is usually easy to obtain from training data can we estimate p(y x)?

Bayes classifier 26

Bayes classifier for Gaussian classes 27

estimating the conditionals 28 how do we estimate p(x y) x 1, x 2,, x N discrete: count over observed samples to get the conditional histograms x 1, x 2,, x N continuous and conditionally Gaussian x scalar

29

Bayes classifier example 30

31

classification results 32

exercise 33 P(x ω j ) being 1-D Gaussians with identical covariance. 1) Towards which direction should the decision boundary move if p(ω 1 )>p(ω 2 )? Left/right/stay-put 2) What if there is a third Gaussian?

homework problem 2: classifying digits 34 instruction/sample code available load digits from the MNIST dataset baseline 1-NN classifier experiment/observe/improve k-nn, with k=3, 5 play with features space (PCA, ) Optionally experiment with other classifiers (SVM, neural net, ) compute error rate post examples that are correctly/incorrectly classified discuss what was tried and observed

Lecture Outline 35 object recognition: what and why object recognition as pattern classification general object recognition systems real-world challenges object recognition: a systems view current commercial systems survey of state-of the art demo websites

Object Recognition End-to-End 36 training testing Recognition Target Sensor(s) Pre-processing Feature Extraction train classifier images segments features Sensor(s) Pre-processing Feature Extraction Classification window parts or patches salient points pixel vector descriptors classes Postprocessing face? car? statue of liberty? spatial/temporal smoothing context, etc.

Object Recognition in Practice 37 Commercial object recognition Currently a $4 billion/year industry for inspection and assembly Almost entirely based on template matching Upcoming applications Mobile robots, toys, user interfaces Location recognition Digital camera panoramas, 3D scene modeling courtesy of David Lowe, website and CVPR 2003 Tutorial

Industrial Applications 38 http://www.cs.ubc.ca/spider/lowe/vision.html

http://www.appian-tech.com/ 39

http://www.sportvision.com/ 40

http://www.dipix.com/ 41

What to Recognize 42 Specific Wild card Tower Bridge The Stata Center Categories butterfly butterfly building building Kristen Grauman, http://www.cs.utexas.edu/~grauman/research/

Recognize Specific Objects (1) 43 Appearance Matching [Nayar, Murase et. al.]

44 PCA on the training set. Estimate parameters of a low-dimensional pose manifold with splines. Match new image to the closest point on the manifold.

Recognize Specific Objects (2) 45 Part-based approach Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging parameters select interest points that are stable extrema points across different scales. SIFT Features David Lowe, CVPR 2003 Tutorial

SIFT Descriptor 46 Thresholded image gradients are sampled over 16x16 array of locations in scale space (Gaussian-weighted). Create array of orientation histograms 8 orientations x 4x4 histogram array = 128 dimensions David Lowe, CVPR 2003 Tutorial

Object Category Recognition 47

Overview of object category recognition see iccv tutorial 48

Demos 49 Pittpatt http://demo.pittpatt.com/

It s not just vision Integrate with mobile sensor information (GPS, time, nearby object or people), calendar, schedule Infer semantically rich meta-data labels from joint sources. 10am 7 Sep 05 Australian park Jim, Jill nearby 50 two koalas seen on nat. park trip with Jim and Jill Jill and koala on nat. park trip John and his new car 4pm 8 Sep 05 Sydney 8pm 10 Oct 05 London 3pm 10 Sep 05 downloaded from http://... office parking lot car to consider purchasing http://www.cs.utexas.edu/~grauman/research/research.html

Summary 51 The object recognition problem Pattern classification primer Object recognition grown up Readings: G&W 12.1-12.2 Reference: Duda, Hart, Stork, Pattern Classification, 2 nd Ed. Next time: Image Compression Additional acknowledgements: Dan Ellis, EE6820 Slides; Duda, Hart& Stork, Pattern classificaion 2 nd Ed., David Claus and Christoph F. Eick: Nearest Neighbor Editing and Condensing Techniques