Object Recognition. Selim Aksoy. Bilkent University
|
|
|
- Joel O’Neal’
- 10 years ago
- Views:
Transcription
1 Image Classification and Object Recognition Selim Aksoy Department of Computer Engineering Bilkent University
2 Image classification Image (scene) classification is a fundamental problem in image understanding. Automatic techniques for associating scenes with semantic labels have a high potential for improving the performance of other computer vision applications such as browsing (natural grouping of images instead of clusters based only on low-level features), retrieval (filtering images in archives based on content), and object recognition (the probability of an unknown object/region that exhibits several local features of a ship actually being a ship can be increased if the scene context is known to be a coast with high confidence but can be decreased if no water related context is dominant in that scene). CS 484, Fall , Selim Aksoy 2
3 Image classification The image classification problem has two critical components: representing images and learning models for semantic categories using these representations. Early work used low-level global features extracted from the whole image or from a fixed spatial layout. More recent approaches exploit local statistics in images using patches extracted by interest point detectors. Other configurations that use regions and their spatial relationships are also proposed. CS 484, Fall , Selim Aksoy 3
4 Hierarchical image classification Images Others (Face/close-up) Indoor Outdoor Others (Face/close-up) City Landscape Others Sunset Mountain/forest Mountain Forest Hierarchy of 11 scene categories (Vailaya et al., Image classification for content-based indexing, IEEE Trans. Image Processing, 2001). CS 484, Fall , Selim Aksoy 4
5 Hierarchical image classification Image representation: Mean and std. dev. of LUV values in 10x10 blocks for indoor/outdoor classification. Edge direction histograms for city/landscape classification. Histograms of HSV and LUV values for sunset/mountain/forest classification. Classification: Class-conditional density estimation using vector quantization. Bayesian classification. CS 484, Fall , Selim Aksoy 5
6 Hierarchical image classification CS 484, Fall , Selim Aksoy 6
7 Hierarchical image classification CS 484, Fall , Selim Aksoy 7
8 Image classification using bag-of-words Caltech data set: 13 natural scene categories. IDIAP data set (left to right): mountain, forest, indoor, city-panorama, city-street. CS 484, Fall , Selim Aksoy 8
9 Image classification using bag-of-words Flowchart from Fei-Fei Li, Pietro Perona, A Bayesian hierarchical model for learning natural scene categories, IEEE CVPR, CS 484, Fall , Selim Aksoy 9
10 Image classification using bag-of-words A codebook obtained from 650 training examples from 13 categories. Image patches are detected by a sliding grid and random sampling of scales. CS 484, Fall , Selim Aksoy 10
11 CS 484, Fall , Selim Aksoy 11
12 CS 484, Fall , Selim Aksoy 12
13 Image classification using bag-of-words CS 484, Fall , Selim Aksoy 13
14 Image classification using bag-of-words Flowchart from Quelhas et al., A thousand words in a scene, IEEE Trans. PAMI, Probabilistic Latent Semantic Analysis (PLSA) is used to learn aspect models to capture co-occurrences of visterms (visual terms). Bag-of-visterms representation or the aspect parameters are given as input to Support Vector Machines for classification. CS 484, Fall , Selim Aksoy 14
15 Image classification using bag-of-words CS 484, Fall , Selim Aksoy 15
16 CS 484, Fall , Selim Aksoy 16
17 Image classification using bag-of-regions D. Gökalp, S. Aksoy, Scene classification using bag-of-regions representations, IEEE CVPR, Beyond Patches Workshop, Region segmentation Region clustering region codebook Above-below spatial relationships region pairs Statistical region selection: identify region types that are frequently found in a particular class of scenes but rarely exist in other classes, and consistently occur together in the same class of scenes. Bayesian scene classification using bag of individual regions, bag of region pairs. CS 484, Fall , Selim Aksoy 17
18 Image classification using bag-of-regions Examples for region clusters. Each row represents a different cluster. CS 484, Fall , Selim Aksoy 18
19 Image classification using bag-of-regions CS 484, Fall , Selim Aksoy 19
20 Image classification using bag-of-regions Examples for correctly classified scenes. Examples for wrongly classified scenes. CS 484, Fall , Selim Aksoy 20
21 Image classification using factor graphs Boutell et al., Scene Parsing Using Region-Based Generative Models, IEEE Trans. Multimedia, CS 484, Fall , Selim Aksoy 21
22 ICCV 2005 Beijing, Short Course, Oct 15 Recognizing and Learning Object Categories Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
23 Agenda Introduction Bag of words models Part-based models Discriminative methods Segmentation and recognition Conclusions CS 484, Fall , Selim Aksoy 23
24 perceptible vision material thing CS 484, Fall , Selim Aksoy 24
25 How many object categories are there? Biederman 1987 CS 484, Fall , Selim Aksoy 25
26 So what does object recognition involve? CS 484, Fall , Selim Aksoy 26
27 Verification: is that a bus? CS 484, Fall , Selim Aksoy 27
28 Detection: are there cars? CS 484, Fall , Selim Aksoy 28
29 Identification: is that a picture of Mao? CS 484, Fall , Selim Aksoy 29
30 Object categorization sky building flag banner bus face street lamp bus wall cars CS 484, Fall , Selim Aksoy 30
31 Scene and context categorization outdoor city traffic CS 484, Fall , Selim Aksoy 31
32 Challenges 1: view point variation Michelangelo CS 484, Fall , Selim Aksoy 32
33 Challenges 2: illumination slide credit: S. Ullman CS 484, Fall , Selim Aksoy 33
34 Challenges 3: occlusion Magritte, 1957 CS 484, Fall , Selim Aksoy 34
35 Challenges 4: scale CS 484, Fall , Selim Aksoy 35
36 Challenges 5: deformation Xu, Beihong 1943 CS 484, Fall , Selim Aksoy 36
37 Challenges 6: background clutter Klimt, 1913 CS 484, Fall , Selim Aksoy 37
38 History: single object recognition CS 484, Fall , Selim Aksoy 38
39 Challenges 7: intra-class variation CS 484, Fall , Selim Aksoy 39
40 History: early object categorization CS 484, Fall , Selim Aksoy 40
41 CS 484, Fall , Selim Aksoy 41
42 ANIMALS PLANTS OBJECTS INANIMATE.. VERTEBRATE NATURAL MAN-MADE MAMMALS BIRDS TAPIR BOAR GROUSE CAMERA CS 484, Fall , Selim Aksoy 42
43 CS 484, Fall , Selim Aksoy 43
44 Scenes, Objects, and Parts Scene Objects Parts E. Sudderth, A. Torralba, W. Freeman, A. Willsky. ICCV Features CS 484, Fall , Selim Aksoy 44
45 Object categorization: the statistical viewpoint p( zebra image ) vs. p( no zebra image) Bayes rule: p ( zebra image ) p ( image zebra ) p ( zebra ) = p( no zebra image) p( image no zebra) p( no zebra) posterior ratio likelihood ratio prior ratio CS 484, Fall , Selim Aksoy 45
46 Object categorization: the statistical viewpoint p( zebra image) p( image zebra) p( zebra) = p ( no zebra image ) p ( image no zebra ) p ( no zebra ) posterior ratio likelihood ratio prior ratio Discriminative methods model posterior Generative methods model likelihood and prior CS 484, Fall , Selim Aksoy 46
47 Discriminative Direct modeling of p( zebra image) p( no zebra image) Decision boundary Zebra Non-zebra CS 484, Fall , Selim Aksoy 47
48 Generative Model p( image zebra) and p( image no zebra) p ( image zebra ) Low High p( image no zebra) Middle MiddleLow CS 484, Fall , Selim Aksoy 48
49 Three main issues Representation How to represent an object category Learning How to form the classifier, given training data Recognition How the classifier is to be used on novel data CS 484, Fall , Selim Aksoy 49
50 Representation Generative / discriminative / hybrid CS 484, Fall , Selim Aksoy 50
51 Representation Generative / discriminative / hybrid Appearance only or location and appearance CS 484, Fall , Selim Aksoy 51
52 Representation Generative / discriminative / hybrid Appearance only or location and appearance Invariances View point Illumination Occlusion Scale Deformation Clutter etc. CS 484, Fall , Selim Aksoy 52
53 Representation Generative / discriminative / hybrid Appearance only or location and appearance invariances Part-based or global w/subwindow CS 484, Fall , Selim Aksoy 53
54 Representation Generative / discriminative / hybrid Appearance only or location and appearance invariances Parts or global w/subwindow Use set of features or each pixel in image CS 484, Fall , Selim Aksoy 54
55 Learning Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning CS 484, Fall , Selim Aksoy 55
56 Learning Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) Methods of training: generative vs. discriminative class densities p(x C 1 ) p(x C 2 ) posterior probabilities p(c 1 x) p(c 2 x) x x CS 484, Fall , Selim Aksoy 56
57 Learning Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) Level of supervision Manual segmentation; bounding box; image labels; noisy labels Contains a motorbike CS 484, Fall , Selim Aksoy 57
58 Learning Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) Level of supervision Manual segmentation; bounding box; image labels; noisy labels Batch/incremental (on category and image level; user-feedback ) CS 484, Fall , Selim Aksoy 58
59 Learning Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) Level of supervision Manual segmentation; bounding box; image labels; noisy labels Batch/incremental (on category and image level; user-feedback ) Training images: Issue of overfitting Negative images for discriminative methods CS 484, Fall , Selim Aksoy 59
60 Learning Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning) What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.) Level of supervision Manual segmentation; bounding box; image labels; noisy labels Batch/incremental (on category and image level; user-feedback ) Training images: Priors Issue of overfitting Negative images for discriminative methods CS 484, Fall , Selim Aksoy 60
61 Recognition Scale / orientation range to search over Speed CS 484, Fall , Selim Aksoy 61
62 Object recognition using context The relationships between objects and the scene setting can be characterized in five ways: Interposition: objects interrupt their background Support: objects tend to rest on surfaces Probability: objects tend to be found in some contexts but not others Position: given an object is probable in a scene, it is often found in some positions and not others Familiar size: objects have a limited set of size relations with other objects CS 484, Fall , Selim Aksoy 62
63 Object recognition using context An ideal setting where the objects are perfectly segmented, the regions are classified, and the objects labels are refined with respect to semantic context in the image. (Rabinovich et al. Objects in Context, ICCV, 2007) CS 484, Fall , Selim Aksoy 63
64 Object recognition using context First column: segmentation results; second column: classification without contextual constraints; third column: classification with co-occurrence contextual constraints implemented using a conditional random field model. (Rabinovich et al. Objects in Context, ICCV, 2007) CS 484, Fall , Selim Aksoy 64
65 Object recognition using context better Using above below inside around relationships as spatial constraints better better worse First column: input image; second column: classification with co-occurrence contextual constraints; third column: classification with spatial and co-occurrence contextual constraints. (Galleguillos et al. Object Categorization using Co-Occurrence, Location and Appearance, CVPR, 2008) CS 484, Fall , Selim Aksoy 65
66 Object recognition using context Geometric context: learn labeling of the image into geometric classes such as support/ground (green), vertical planar (green) and sky (blue). The arrows show the direction that the vertical planar surface is facing. (Hoiem et al., Geometric Context from a Single Image, ICCV, 2005) CS 484, Fall , Selim Aksoy 66
67 Object recognition using context (a) (b) (c) (d) Putting objects in perspective: Car (a) and pedestrian (b) detection by using only local information. Car (c) and pedestrian (d) detection by using context. (Hoiem et al. Putting Objects in Perspective, CVPR, 2006) CS 484, Fall , Selim Aksoy 67
68 Object recognition using context Contextual recognition by maximizing a scene probability function that incorporates outputs of individual object detectors (confidence values for assigned object labels) and pairwise interactions between objects (likelihood of pairwise spatial relationships). Left column: without using context; right column: with using context. (Firat Kalaycilar, An object recognition framework using contextual interactions among objects, M.S. Thesis, Bilkent University, 2009) CS 484, Fall , Selim Aksoy 68
Introduction. Selim Aksoy. Bilkent University [email protected]
Introduction Selim Aksoy Department of Computer Engineering Bilkent University [email protected] What is computer vision? What does it mean, to see? The plain man's answer (and Aristotle's, too)
3D Model based Object Class Detection in An Arbitrary View
3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/
Introduction to Pattern Recognition
Introduction to Pattern Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] CS 551, Spring 2009 CS 551, Spring 2009 c 2009, Selim Aksoy (Bilkent University)
Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011
Behavior Analysis in Crowded Environments XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Sparse Scenes Zelnik-Manor & Irani CVPR
Semantic Recognition: Object Detection and Scene Segmentation
Semantic Recognition: Object Detection and Scene Segmentation Xuming He [email protected] Computer Vision Research Group NICTA Robotic Vision Summer School 2015 Acknowledgement: Slides from Fei-Fei
Probabilistic Latent Semantic Analysis (plsa)
Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg [email protected] www.multimedia-computing.{de,org} References
Content-Based Image Retrieval
Content-Based Image Retrieval Selim Aksoy Department of Computer Engineering Bilkent University [email protected] Image retrieval Searching a large database for images that match a query: What kind
Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.
Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table
Object Categorization using Co-Occurrence, Location and Appearance
Object Categorization using Co-Occurrence, Location and Appearance Carolina Galleguillos Andrew Rabinovich Serge Belongie Department of Computer Science and Engineering University of California, San Diego
Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28
Recognition Topics that we will try to cover: Indexing for fast retrieval (we still owe this one) History of recognition techniques Object classification Bag-of-words Spatial pyramids Neural Networks Object
Object class recognition using unsupervised scale-invariant learning
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of Technology Goal Recognition of object categories
Discovering objects and their location in images
Discovering objects and their location in images Josef Sivic Bryan C. Russell Alexei A. Efros Andrew Zisserman William T. Freeman Dept. of Engineering Science CS and AI Laboratory School of Computer Science
Discovering objects and their location in images
Discovering objects and their location in images Josef Sivic Bryan C. Russell Alexei A. Efros Andrew Zisserman William T. Freeman Dept. of Engineering Science CS and AI Laboratory School of Computer Science
High Level Describable Attributes for Predicting Aesthetics and Interestingness
High Level Describable Attributes for Predicting Aesthetics and Interestingness Sagnik Dhar Vicente Ordonez Tamara L Berg Stony Brook University Stony Brook, NY 11794, USA [email protected] Abstract
Local features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
Open issues and research trends in Content-based Image Retrieval
Open issues and research trends in Content-based Image Retrieval Raimondo Schettini DISCo Universita di Milano Bicocca [email protected] www.disco.unimib.it/schettini/ IEEE Signal Processing Society
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
Big Data: Image & Video Analytics
Big Data: Image & Video Analytics How it could support Archiving & Indexing & Searching Dieter Haas, IBM Deutschland GmbH The Big Data Wave 60% of internet traffic is multimedia content (images and videos)
Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall
Automatic Photo Quality Assessment Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall Estimating i the photorealism of images: Distinguishing i i paintings from photographs h Florin
Learning Motion Categories using both Semantic and Structural Information
Learning Motion Categories using both Semantic and Structural Information Shu-Fai Wong, Tae-Kyun Kim and Roberto Cipolla Department of Engineering, University of Cambridge, Cambridge, CB2 1PZ, UK {sfw26,
Jiří Matas. Hough Transform
Hough Transform Jiří Matas Center for Machine Perception Department of Cybernetics, Faculty of Electrical Engineering Czech Technical University, Prague Many slides thanks to Kristen Grauman and Bastian
Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite
Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Philip Lenz 1 Andreas Geiger 2 Christoph Stiller 1 Raquel Urtasun 3 1 KARLSRUHE INSTITUTE OF TECHNOLOGY 2 MAX-PLANCK-INSTITUTE IS 3
CS 534: Computer Vision 3D Model-based recognition
CS 534: Computer Vision 3D Model-based recognition Ahmed Elgammal Dept of Computer Science CS 534 3D Model-based Vision - 1 High Level Vision Object Recognition: What it means? Two main recognition tasks:!
Tensor Methods for Machine Learning, Computer Vision, and Computer Graphics
Tensor Methods for Machine Learning, Computer Vision, and Computer Graphics Part I: Factorizations and Statistical Modeling/Inference Amnon Shashua School of Computer Science & Eng. The Hebrew University
Colorado School of Mines Computer Vision Professor William Hoff
Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description
MusicGuide: Album Reviews on the Go Serdar Sali
MusicGuide: Album Reviews on the Go Serdar Sali Abstract The cameras on mobile phones have untapped potential as input devices. In this paper, we present MusicGuide, an application that can be used to
Principled Hybrids of Generative and Discriminative Models
Principled Hybrids of Generative and Discriminative Models Julia A. Lasserre University of Cambridge Cambridge, UK [email protected] Christopher M. Bishop Microsoft Research Cambridge, UK [email protected]
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
Tracking in flussi video 3D. Ing. Samuele Salti
Seminari XXIII ciclo Tracking in flussi video 3D Ing. Tutors: Prof. Tullio Salmon Cinotti Prof. Luigi Di Stefano The Tracking problem Detection Object model, Track initiation, Track termination, Tracking
LabelMe: Online Image Annotation and Applications
INVITED PAPER LabelMe: Online Image Annotation and Applications By developing a publicly available tool that allows users to use the Internet to quickly and easily annotate images, the authors were able
IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS
IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS Alexander Velizhev 1 (presenter) Roman Shapovalov 2 Konrad Schindler 3 1 Hexagon Technology Center, Heerbrugg, Switzerland 2 Graphics & Media
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles [email protected] and lunbo
Solving Big Data Problems in Computer Vision with MATLAB Loren Shure
Solving Big Data Problems in Computer Vision with MATLAB Loren Shure 2015 The MathWorks, Inc. 1 Why Are We Talking About Big Data? 100 hours of video uploaded to YouTube per minute 1 Explosive increase
Semantic Image Segmentation and Web-Supervised Visual Learning
Semantic Image Segmentation and Web-Supervised Visual Learning Florian Schroff Andrew Zisserman University of Oxford, UK Antonio Criminisi Microsoft Research Ltd, Cambridge, UK Outline Part I: Semantic
Part III: Machine Learning. CS 188: Artificial Intelligence. Machine Learning This Set of Slides. Parameter Estimation. Estimation: Smoothing
CS 188: Artificial Intelligence Lecture 20: Dynamic Bayes Nets, Naïve Bayes Pieter Abbeel UC Berkeley Slides adapted from Dan Klein. Part III: Machine Learning Up until now: how to reason in a model and
The Visual Internet of Things System Based on Depth Camera
The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
A Bayesian Framework for Unsupervised One-Shot Learning of Object Categories
A Bayesian Framework for Unsupervised One-Shot Learning of Object Categories Li Fei-Fei 1 Rob Fergus 1,2 Pietro Perona 1 1. California Institute of Technology 2. Oxford University Single Object Object
Part-Based Recognition
Part-Based Recognition Benedict Brown CS597D, Fall 2003 Princeton University CS 597D, Part-Based Recognition p. 1/32 Introduction Many objects are made up of parts It s presumably easier to identify simple
Multi-View Object Class Detection with a 3D Geometric Model
Multi-View Object Class Detection with a 3D Geometric Model Joerg Liebelt IW-SI, EADS Innovation Works D-81663 Munich, Germany [email protected] Cordelia Schmid LEAR, INRIA Grenoble F-38330 Montbonnot,
T O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN
T O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN Goal is to process 360 degree images and detect two object categories 1. Pedestrians,
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA N. Zarrinpanjeh a, F. Dadrassjavan b, H. Fattahi c * a Islamic Azad University of Qazvin - [email protected]
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju
NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE Venu Govindaraju BIOMETRICS DOCUMENT ANALYSIS PATTERN RECOGNITION 8/24/2015 ICDAR- 2015 2 Towards a Globally Optimal Approach for Learning Deep Unsupervised
Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang
Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform
Mean-Shift Tracking with Random Sampling
1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of
Image Classification for Dogs and Cats
Image Classification for Dogs and Cats Bang Liu, Yan Liu Department of Electrical and Computer Engineering {bang3,yan10}@ualberta.ca Kai Zhou Department of Computing Science [email protected] Abstract
CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.
Lecture Machine Learning Milos Hauskrecht [email protected] 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht [email protected] 539 Sennott
Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006
Practical Tour of Visual tracking David Fleet and Allan Jepson January, 2006 Designing a Visual Tracker: What is the state? pose and motion (position, velocity, acceleration, ) shape (size, deformation,
Module 5. Deep Convnets for Local Recognition Joost van de Weijer 4 April 2016
Module 5 Deep Convnets for Local Recognition Joost van de Weijer 4 April 2016 Previously, end-to-end.. Dog Slide credit: Jose M 2 Previously, end-to-end.. Dog Learned Representation Slide credit: Jose
Feature Tracking and Optical Flow
02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve
Object Class Recognition by Unsupervised Scale-Invariant Learning
Object Class Recognition by Unsupervised Scale-Invariant Learning R. Fergus 1 P. Perona 2 A. Zisserman 1 1 Dept. of Engineering Science 2 Dept. of Electrical Engineering University of Oxford California
Character Image Patterns as Big Data
22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,
Fast Matching of Binary Features
Fast Matching of Binary Features Marius Muja and David G. Lowe Laboratory for Computational Intelligence University of British Columbia, Vancouver, Canada {mariusm,lowe}@cs.ubc.ca Abstract There has been
Applications of Deep Learning to the GEOINT mission. June 2015
Applications of Deep Learning to the GEOINT mission June 2015 Overview Motivation Deep Learning Recap GEOINT applications: Imagery exploitation OSINT exploitation Geospatial and activity based analytics
An Automatic and Accurate Segmentation for High Resolution Satellite Image S.Saumya 1, D.V.Jiji Thanka Ligoshia 2
An Automatic and Accurate Segmentation for High Resolution Satellite Image S.Saumya 1, D.V.Jiji Thanka Ligoshia 2 Assistant Professor, Dept of ECE, Bethlahem Institute of Engineering, Karungal, Tamilnadu,
Introduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
Tracking Groups of Pedestrians in Video Sequences
Tracking Groups of Pedestrians in Video Sequences Jorge S. Marques Pedro M. Jorge Arnaldo J. Abrantes J. M. Lemos IST / ISR ISEL / IST ISEL INESC-ID / IST Lisbon, Portugal Lisbon, Portugal Lisbon, Portugal
Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance
2012 IEEE International Conference on Multimedia and Expo Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance Rogerio Feris, Sharath Pankanti IBM T. J. Watson Research Center
Machine Learning. CS 188: Artificial Intelligence Naïve Bayes. Example: Digit Recognition. Other Classification Tasks
CS 188: Artificial Intelligence Naïve Bayes Machine Learning Up until now: how use a model to make optimal decisions Machine learning: how to acquire a model from data / experience Learning parameters
MULTI-LEVEL SEMANTIC LABELING OF SKY/CLOUD IMAGES
MULTI-LEVEL SEMANTIC LABELING OF SKY/CLOUD IMAGES Soumyabrata Dev, Yee Hui Lee School of Electrical and Electronic Engineering Nanyang Technological University (NTU) Singapore 639798 Stefan Winkler Advanced
Clustering Connectionist and Statistical Language Processing
Clustering Connectionist and Statistical Language Processing Frank Keller [email protected] Computerlinguistik Universität des Saarlandes Clustering p.1/21 Overview clustering vs. classification supervised
Dissertation TOPIC MODELS FOR IMAGE RETRIEVAL ON LARGE-SCALE DATABASES. Eva Hörster
Dissertation TOPIC MODELS FOR IMAGE RETRIEVAL ON LARGE-SCALE DATABASES Eva Hörster Department of Computer Science University of Augsburg Adviser: Readers: Prof. Dr. Rainer Lienhart Prof. Dr. Rainer Lienhart
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,
Big Data: Rethinking Text Visualization
Big Data: Rethinking Text Visualization Dr. Anton Heijs [email protected] Treparel April 8, 2013 Abstract In this white paper we discuss text visualization approaches and how these are important
Robert Collins CSE598G. More on Mean-shift. R.Collins, CSE, PSU CSE598G Spring 2006
More on Mean-shift R.Collins, CSE, PSU Spring 2006 Recall: Kernel Density Estimation Given a set of data samples x i ; i=1...n Convolve with a kernel function H to generate a smooth function f(x) Equivalent
Pixels Description of scene contents. Rob Fergus (NYU) Antonio Torralba (MIT) Yair Weiss (Hebrew U.) William T. Freeman (MIT) Banksy, 2006
Object Recognition Large Image Databases and Small Codes for Object Recognition Pixels Description of scene contents Rob Fergus (NYU) Antonio Torralba (MIT) Yair Weiss (Hebrew U.) William T. Freeman (MIT)
Perception-based Design for Tele-presence
Perception-based Design for Tele-presence Santanu Chaudhury 1, Shantanu Ghosh 1,2, Amrita Basu 3, Brejesh Lall 1, Sumantra Dutta Roy 1, Lopamudra Choudhury 3, Prashanth R 1, Ashish Singh 1, and Amit Maniyar
Scale-Invariant Object Categorization using a Scale-Adaptive Mean-Shift Search
in DAGM 04 Pattern Recognition Symposium, Tübingen, Germany, Aug 2004. Scale-Invariant Object Categorization using a Scale-Adaptive Mean-Shift Search Bastian Leibe ½ and Bernt Schiele ½¾ ½ Perceptual Computing
Creating a Big Data Resource from the Faces of Wikipedia
Creating a Big Data Resource from the Faces of Wikipedia Md. Kamrul Hasan Génie informatique et génie logiciel École Polytechnique de Montreal Québec, Canada [email protected] Christopher J. Pal
Finding people in repeated shots of the same scene
Finding people in repeated shots of the same scene Josef Sivic 1 C. Lawrence Zitnick Richard Szeliski 1 University of Oxford Microsoft Research Abstract The goal of this work is to find all occurrences
Limitations of Human Vision. What is computer vision? What is computer vision (cont d)?
What is computer vision? Limitations of Human Vision Slide 1 Computer vision (image understanding) is a discipline that studies how to reconstruct, interpret and understand a 3D scene from its 2D images
Neural Network based Vehicle Classification for Intelligent Traffic Control
Neural Network based Vehicle Classification for Intelligent Traffic Control Saeid Fazli 1, Shahram Mohammadi 2, Morteza Rahmani 3 1,2,3 Electrical Engineering Department, Zanjan University, Zanjan, IRAN
Digital Image Fundamentals. Selim Aksoy Department of Computer Engineering Bilkent University [email protected]
Digital Image Fundamentals Selim Aksoy Department of Computer Engineering Bilkent University [email protected] Imaging process Light reaches surfaces in 3D. Surfaces reflect. Sensor element receives
FastKeypointRecognitioninTenLinesofCode
FastKeypointRecognitioninTenLinesofCode Mustafa Özuysal Pascal Fua Vincent Lepetit Computer Vision Laboratory École Polytechnique Fédérale de Lausanne(EPFL) 115 Lausanne, Switzerland Email: {Mustafa.Oezuysal,
9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08
9. Text & Documents Visualizing and Searching Documents Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08 Slide 1 / 37 Outline Characteristics of text data Detecting patterns SeeSoft
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,
Combining Local Recognition Methods for Better Image Recognition
Combining Local Recognition Methods for Better Image Recognition Bart Lamiroy Patrick Gros y Sylvaine Picard INRIA CNRS DGA MOVI GRAVIR z IMAG INRIA RHÔNE-ALPES ZIRST 655, Av. de l Europe Montbonnot FRANCE
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM Amol Ambardekar, Mircea Nicolescu, and George Bebis Department of Computer Science and Engineering University
Information Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli ([email protected])
Classifying Manipulation Primitives from Visual Data
Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if
Distributed forests for MapReduce-based machine learning
Distributed forests for MapReduce-based machine learning Ryoji Wakayama, Ryuei Murata, Akisato Kimura, Takayoshi Yamashita, Yuji Yamauchi, Hironobu Fujiyoshi Chubu University, Japan. NTT Communication
Detection and Recognition of Mixed Traffic for Driver Assistance System
Detection and Recognition of Mixed Traffic for Driver Assistance System Pradnya Meshram 1, Prof. S.S. Wankhede 2 1 Scholar, Department of Electronics Engineering, G.H.Raisoni College of Engineering, Digdoh
Localizing 3D cuboids in single-view images
Localizing 3D cuboids in single-view images Jianxiong Xiao Bryan C. Russell Antonio Torralba Massachusetts Institute of Technology University of Washington Abstract In this paper we seek to detect rectangular
Exploration and Visualization of Post-Market Data
Exploration and Visualization of Post-Market Data Jianying Hu, PhD Joint work with David Gotz, Shahram Ebadollahi, Jimeng Sun, Fei Wang, Marianthi Markatou Healthcare Analytics Research IBM T.J. Watson
Taking Inverse Graphics Seriously
CSC2535: 2013 Advanced Machine Learning Taking Inverse Graphics Seriously Geoffrey Hinton Department of Computer Science University of Toronto The representation used by the neural nets that work best
AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION
AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION Saurabh Asija 1, Rakesh Singh 2 1 Research Scholar (Computer Engineering Department), Punjabi University, Patiala. 2 Asst.
Determining optimal window size for texture feature extraction methods
IX Spanish Symposium on Pattern Recognition and Image Analysis, Castellon, Spain, May 2001, vol.2, 237-242, ISBN: 84-8021-351-5. Determining optimal window size for texture feature extraction methods Domènec
Collective Behavior Prediction in Social Media. Lei Tang Data Mining & Machine Learning Group Arizona State University
Collective Behavior Prediction in Social Media Lei Tang Data Mining & Machine Learning Group Arizona State University Social Media Landscape Social Network Content Sharing Social Media Blogs Wiki Forum
Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
