Motivation. Lecture 31: Object Recognition: SIFT Keys. Simple Example. Simple Example. Simple Example

Similar documents
Randomized Trees for Real-Time Keypoint Recognition

Build Panoramas on Android Phones

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow

siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service

Distinctive Image Features from Scale-Invariant Keypoints

Automatic georeferencing of imagery from high-resolution, low-altitude, low-cost aerial platforms

Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28

Cloud-Based Image Coding for Mobile Devices Toward Thousands to One Compression

Make and Model Recognition of Cars

Image Segmentation and Registration

Face Recognition in Low-resolution Images by Using Local Zernike Moments

Feature Tracking and Optical Flow

Android Ros Application

Stitching of X-ray Images

Probabilistic Latent Semantic Analysis (plsa)

MIFT: A Mirror Reflection Invariant Feature Descriptor

VEHICLE TRACKING USING FEATURE MATCHING AND KALMAN FILTERING

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms

Object class recognition using unsupervised scale-invariant learning

Point Matching as a Classification Problem for Fast and Robust Object Pose Estimation

Local features and matching. Image classification & object localization

Automatic Grocery Shopping Assistant

Feature Point Selection using Structural Graph Matching for MLS based Image Registration

Convolution. 1D Formula: 2D Formula: Example on the web:

3D Model based Object Class Detection in An Arbitrary View

Camera geometry and image alignment

Taking Inverse Graphics Seriously

3D OBJECT MODELING AND RECOGNITION IN PHOTOGRAPHS AND VIDEO

TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service

MusicGuide: Album Reviews on the Go Serdar Sali

The use of computer vision technologies to augment human monitoring of secure computing facilities

A Comparative Study of SIFT and its Variants

Part-Based Recognition

VECTORAL IMAGING THE NEW DIRECTION IN AUTOMATED OPTICAL INSPECTION

An Intelligent Wellness Keeper for Food Nutrition with Graphical Icons

Unsupervised Footwear Impression Analysis and Retrieval from Crime Scene Data

Online Learning of Patch Perspective Rectification for Efficient Object Detection

Scale-Invariant Object Categorization using a Scale-Adaptive Mean-Shift Search

Admin stuff. 4 Image Pyramids. Spatial Domain. Projects. Fourier domain 2/26/2008. Fourier as a change of basis

A Learning Based Method for Super-Resolution of Low Resolution Images

Surgical Tools Recognition and Pupil Segmentation for Cataract Surgical Process Modeling

BRIEF: Binary Robust Independent Elementary Features

IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS

PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO

Jiří Matas. Hough Transform

GPS-aided Recognition-based User Tracking System with Augmented Reality in Extreme Large-scale Areas

AN EFFICIENT HYBRID REAL TIME FACE RECOGNITION ALGORITHM IN JAVA ENVIRONMENT ABSTRACT

Heat Kernel Signature

Image Stitching based on Feature Extraction Techniques: A Survey

Classification of Fingerprints. Sarat C. Dass Department of Statistics & Probability

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Image Gradients. Given a discrete image Á Òµ, consider the smoothed continuous image ܵ defined by

Edge detection. (Trucco, Chapt 4 AND Jain et al., Chapt 5) -Edges are significant local changes of intensity in an image.

Fast Matching of Binary Features

Traffic Flow Monitoring in Crowded Cities

CS 534: Computer Vision 3D Model-based recognition

SURF: Speeded Up Robust Features

FastKeypointRecognitioninTenLinesofCode

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report

FREAK: Fast Retina Keypoint

High Quality Image Magnification using Cross-Scale Self-Similarity

International Journal of Advanced Information in Arts, Science & Management Vol.2, No.2, December 2014

ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES

Footwear Print Retrieval System for Real Crime Scene Marks

Object Recognition and Template Matching

A Hardware Architecture for Scalespace Extrema Detection

Automated Location Matching in Movies

Cloud based object recognition: A system proposal

OBJECT TRACKING USING LOG-POLAR TRANSFORMATION

Edge-based Template Matching and Tracking for Perspectively Distorted Planar Objects

3D Object Recognition in Clutter with the Point Cloud Library

Image Classification for Dogs and Cats

Augmented Reality Tic-Tac-Toe

G E N E R A L A P P R O A CH: LO O K I N G F O R D O M I N A N T O R I E N T A T I O N I N I M A G E P A T C H E S

How To Write A Fast Binary Descriptor Based On Brief

EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM

HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT

Thèse de l Université de Lyon

Introduction to Computer Graphics

PCL Tutorial: The Point Cloud Library By Example. Jeff Delmerico. Vision and Perceptual Machines Lab 106 Davis Hall UB North Campus.

J. P. Oakley and R. T. Shann. Department of Electrical Engineering University of Manchester Manchester M13 9PL U.K. Abstract

An Experimental Comparison of Online Object Tracking Algorithms

Image Hallucination Using Neighbor Embedding over Visual Primitive Manifolds

Face detection is a process of localizing and extracting the face region from the

Signature Segmentation and Recognition from Scanned Documents

FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM

More Local Structure Information for Make-Model Recognition

Unsupervised Joint Alignment of Complex Images

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

SWIGS: A Swift Guided Sampling Method

Tracking in flussi video 3D. Ing. Samuele Salti

Convolutional Feature Maps

Euler Vector: A Combinatorial Signature for Gray-Tone Images

Classifying Manipulation Primitives from Visual Data

A Performance Evaluation of Volumetric 3D Interest Point Detectors

Vision based Vehicle Tracking using a high angle camera

FPGA-BASED PARALLEL HARDWARE ARCHITECTURE FOR REAL-TIME OBJECT CLASSIFICATION. by Murad Mohammad Qasaimeh

Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition

Fully Automated and Stable Registration for Augmented Reality Applications

Visual Categorization with Bags of Keypoints

Transcription:

Lecture 31: Object Recognition: SIFT Keys Motivation Want to recognize a known objects from unknown viewpoints. find them in an image database of models Local Feature based Approaches Represent appearance of object by little intensity/feature patches. Try to match patches from object to image Geometrically consistent matches tell you the location and pose of the object Represent object by set of 11x11 intensity templates extracted around Harris corners. harris corners our object model Match patches to new image using NCC. Find matches consistent with affine transformation using RANSAC 1

Inlier matches let you solve for location and pose of object in the image. Problem with Using NCC to match intensity patches puts restrictions on the amount of overall rotation and scaling allowed between the model and the image appearance. model template ncc ncc ncc matches well no match no match More General : SIFT Keys SIFT Keys: General Idea Reliably extract same image points regardless of new magnification and rotation of the image. Normalize image patches, extract feature vector Match feature vectors using correlation David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, 60, 2 (2004), pp. 91-110. SIFT Keys: General Idea Want to detect/match same features regardless of Recall: Scale Space Basic idea: different scales are appropriate for describing different objects in the image, and we may not know the correct scale/size ahead of time. Translation : easy, almost every feature extraction and correlation matching algorithm in vision is translation invariant Rotation : harder. Guess a canonical orientation for each patch from local s Scaling : hardest of all. Create a multi-scale representation of the image and appeal to scale space theory to determine correct scale at each point. 2

Scale Selection Local Scale Space Maxima Lindeberg proposes that the natural scale for describing a feature is the scale at which a normalized derivative for detecting that feature achieves a local maximum both spatially and in scale. DnormL is the DoG operator, in this case. Laplacian operator. Example for blob detection Scale Recall: LoG Blob Finding LoG filter extrema locates blobs maxima = dark blobs on light background minima = light blobs on dark background Scale of blob (size ; radius in pixels) is determined by the sigma parameter of the LoG filter. Scale (sigma) Extrema in Space and Scale DOG level L-1 DOG level L DOG level L+1 Space LoG sigma = 2 LoG sigma = 10 Hint: when finding maxima or minima at level L, use DownSample or UpSample as necessary to make DOG images at level L-1 and L+1 the same size as L. SIFT Keys: General Idea Sift Key Steps Want to detect/match same features regardless of Translation : easy, almost every feature extraction and correlation matching algorithm in vision is translation invariant Rotation : harder. Guess a canonical orientation for each patch from local s Scaling : hardest of all. Create a multi-scale representation of the image and appeal to scale space theory to determine correct scale at each point. 3

Example Keypoint location = extrema location Keypoint scale is scale of the DOG image gaussian image (at closest scale, from pyramid) orientation weighted weighted orientation histogram. Each bucket contains sum of weighted s corresponding to angles that fall within that bucket..* = weighted by 2D gaussian kernel weighted orientation 36 buckets 10 degree range of angles in each bucket, i.e. 0 <=ang<10 : bucket 1 10<=ang<20 : bucket 2 20<=ang<30 : bucket 3 weighted weighted orientation histogram. There may be multiple orientations. peak 80% of peak value peak Second peak 80% of peak value orientation 20-30 degrees Orientation of keypoint is approximately 25 degrees In this case, generate duplicate keypoints, one with orientation at 25 degrees, one at 155 degrees. Design decision: you may want to limit number of possible multiple peaks to two. 4

Example of KeyPoint Detection SIFT Vector to make things more insensitive to changes in lighting or small changes in geometry, Lowe constructs feature vector from image s. Each keypoint has a center point (location), an orientation (rotation) and a radius (scale). At this point, we could try to correlate patches (after first normalizing to a canonical orientation and scale). SIFT Vector Sift Key Matching Model Verification Application: Object Recognition Compute SIFT keys of models and store in a database 5

Application: Object Recognition Application: Object Recognition For sets of 3 SIFT key matches, compute affine transformation and perform model verification. database of models find them in an image Application: Object Recognition Application: Landmark Recognition Note: since these are local, parts-based descriptors, they perform well even when some parts are missing (i.e. under occlusion). Application: Generating Panoramas Application: Generating Panoramas Brown and Lowe, ICCV03 Brown and Lowe, ICCV03 6

State of the Art Fully affine invariant local feature descriptors. State of the Art Affine invariant descriptors can handle larger changes in viewpoint. State of the Art For More Information SIFT keys David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, 60, 2 (2004), pp. 91-110. Affine local-feature methods 7