Introduction. Selim Aksoy. Bilkent University saksoy@cs.bilkent.edu.tr



Similar documents
Object Recognition. Selim Aksoy. Bilkent University

Colorado School of Mines Computer Vision Professor William Hoff

Linear Filtering Part II

Introduction to Pattern Recognition

Limitations of Human Vision. What is computer vision? What is computer vision (cont d)?

Content-Based Image Retrieval

Feature Tracking and Optical Flow

Course Overview. CSCI 480 Computer Graphics Lecture 1. Administrative Issues Modeling Animation Rendering OpenGL Programming [Angel Ch.

Virtual Environments - Basics -

Professor, D.Sc. (Tech.) Eugene Kovshov MSTU «STANKIN», Moscow, Russia

What makes Big Visual Data hard?

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network

Digital Image Fundamentals. Selim Aksoy Department of Computer Engineering Bilkent University

Big Data: Image & Video Analytics

Applying Deep Learning to Car Data Logging (CDL) and Driver Assessor (DA) October 22-Oct-15

Machine Learning for Data Science (CS4786) Lecture 1

Computer/IT Project LIST. Contact:

Digital Image Processing: Introduction

Segmentation & Clustering

Applications of Deep Learning to the GEOINT mission. June 2015

Computer Animation and Visualisation. Lecture 1. Introduction

Artificial Intelligence and Politecnico di Milano. Presented by Matteo Matteucci

Digital Image Processing EE368/CS232

Health Informatics and Artificial Intelligence: the next big thing in health/aged care

BACnet for Video Surveillance

Human behavior analysis from videos using optical flow

The Future of Mobile Robots In 2020, 26 Million Mobile Robots Will Enable Autonomy in Smart Factories, Unmanned Transportation, and Connected Homes

Introduction to Computer Graphics

Computer Science Electives and Clusters

Machine Intelligence. Department of Computer Science

3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving

Tracking of Small Unmanned Aerial Vehicles

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

How To Get A Computer Engineering Degree

Technical Club: New Vision of Computing

Safe Robot Driving 1 Abstract 2 The need for 360 degree safeguarding

Open issues and research trends in Content-based Image Retrieval

Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006

TIETS34 Seminar: Data Mining on Biometric identification

Fact sheet. Tractable. Automating visual recognition tasks with Artificial Intelligence

Problem-Based Group Activities for a Sensation & Perception Course. David S. Kreiner. University of Central Missouri

Levels of Analysis and ACT-R

Machine Learning: Overview

Machine Learning. CS494/594, Fall :10 AM 12:25 PM Claxton 205. Slides adapted (and extended) from: ETHEM ALPAYDIN The MIT Press, 2004

What the Hell is Big Data?

CS 534: Computer Vision 3D Model-based recognition

Trimble Realworks Software

Applied Research Laboratory: Visualization, Information and Imaging Programs

A Simple Guide To Understanding 3D Scanning Technologies

EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM

NVIDIA AUTOMOTIVE. Driving Innovation

Neural Network based Vehicle Classification for Intelligent Traffic Control

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE.

Shape Ape. Low- Cost Cubing and Dimensioning

Object Tracking System Using Motion Detection

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

A Cognitive Approach to Vision for a Mobile Robot

An Approach for Utility Pole Recognition in Real Conditions

Pallas Ludens. We inject human intelligence precisely where automation fails. Jonas Andrulis, Daniel Kondermann

What is Artificial Intelligence?

Human Behavior Analysis in Intelligent Retail Environments

SURVEY REPORT DATA SCIENCE SOCIETY 2014

A Study on the Actual Implementation of the Design Methods of Ambiguous Optical Illusion Graphics

technical notes trimble realworks software

Environmental Remote Sensing GEOG 2021

Multimedia Technology Bachelor of Science

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance

Speaker: Prof. Mubarak Shah, University of Central Florida. Title: Representing Human Actions as Motion Patterns

MusicGuide: Album Reviews on the Go Serdar Sali

Vision based Vehicle Tracking using a high angle camera

Advice for Students completing the B.S. degree in Computer Science based on Quarters How to Satisfy Computer Science Related Electives

CONTROL, COMMUNICATION & SIGNAL PROCESSING (CCSP)

Structure of Postgraduate Programs (2005)

Appendices master s degree programme Artificial Intelligence

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Chapter 6 - Enhancing Business Intelligence Using Information Systems

Edge tracking for motion segmentation and depth ordering

Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011

Intelligent Computing at LUT

How To Get A Computer Science Degree

Agreement on. Dual Degree Master Program in Computer Science KAIST. Technische Universität Berlin

3D Model based Object Class Detection in An Arbitrary View

Tracking And Object Classification For Automated Surveillance

Traffic Monitoring Systems. Technology and sensors

Analecta Vol. 8, No. 2 ISSN

Circle Object Recognition Based on Monocular Vision for Home Security Robot

Labnet Services Summary

2010 Master of Science Computer Science Department, University of Massachusetts Amherst

USING COMPUTER VISION IN SECURITY APPLICATIONS

3. NUMBER OF PARTICIPANTS TO BE ENROLLED

Intelligent Flexible Automation

Manjula Ambur NASA Langley Research Center April 2014

Transcription:

Introduction Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr

What is computer vision? What does it mean, to see? The plain man's answer (and Aristotle's, too) would be, to know what is where by looking. -- David Marr, Vision (1982) Automatic understanding of images and video Computing properties of the 3D world from visual data (measurement). Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities (perception and interpretation). Adapted from Trevor Darrell, UC Berkeley, Alyosha Efros, Carnegie Mellon CS 484, Spring 2012 2012, Selim Aksoy 2

Why study computer vision? As image sources multiply, so do applications Relieve humans of boring, easy tasks Enhance human abilities: human-computer interaction, visualization Perception for robotics / autonomous agents Organize and give access to visual content Goals of vision research: Give machines the ability to understand scenes. Aid understanding and modeling of human vision. Automate visual operations. Adapted from Trevor Darrell, UC Berkeley CS 484, Spring 2012 2012, Selim Aksoy 3

Why study computer vision? Personal photo albums Movies, news, sports Surveillance and security Medical and scientific images CS 484, Spring 2012 2012, Selim Aksoy 4

Related disciplines Graphics Image processing Artificial intelligence Computer vision Algorithms Machine learning Cognitive science CS 484, Spring 2012 2012, Selim Aksoy 5

Applications Medical image analysis Security Biometrics Surveillance Tracking Target recognition Remote sensing Robotics Industrial inspection, quality control Document analysis Multimedia Assisted living Human-computer interfaces CS 484, Spring 2012 2012, Selim Aksoy 6

Medical image analysis http://www.clarontech.com CS 484, Spring 2012 2012, Selim Aksoy 7

Medical image analysis http://www.clarontech.com CS 484, Spring 2012 2012, Selim Aksoy 8

Medical image analysis http://www.clarontech.com CS 484, Spring 2012 2012, Selim Aksoy 9

Medical image analysis 3D imaging: MRI, CT Image guided surgery Grimson et al., MIT Adapted from CSE 455, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 10

Medical image analysis Cancer detection and grading CS 484, Spring 2012 2012, Selim Aksoy 11

Medical image analysis Slice of lung Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 12

Medical image analysis CS 484, Spring 2012 2012, Selim Aksoy 13

Biometrics Adapted from Anil Jain, Michigan State CS 484, Spring 2012 2012, Selim Aksoy 14

Surveillance and tracking University of Central Florida, Computer Vision Lab CS 484, Spring 2012 2012, Selim Aksoy 15

Surveillance and tracking Adapted from Octavia Camps, Penn State CS 484, Spring 2012 2012, Selim Aksoy 16

Surveillance and tracking Adapted from Martial Hebert, CMU CS 484, Spring 2012 2012, Selim Aksoy 17

Surveillance and tracking Generating traffic patterns University of Central Florida, Computer Vision Lab CS 484, Spring 2012 2012, Selim Aksoy 18

Surveillance and tracking Tracking in UAV videos Adapted from Martial Hebert, CMU, and Masaharu Kobashi, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 19

Smart cars Adapted from CSE 455, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 20

Vehicle and pedestrian protection Lane departure warning, collision warning, traffic sign recognition, pedestrian recognition, blind spot warning http://www.mobileye-vision.com CS 484, Spring 2012 2012, Selim Aksoy 21

Forest fire monitoring system Early warning of forest fires Adapted from Enis Cetin, Bilkent University CS 484, Spring 2012 2012, Selim Aksoy 22

Robotics Adapted from CSE 455, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 23

Robotics Adapted from Steven Seitz, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 24

Autonomous navigation http://www.darpa.mil/grandchallenge/index.asp http://en.wikipedia.org/wiki/darpa_grand_challenge CS 484, Spring 2012 2012, Selim Aksoy 25

Autonomous navigation Michigan State University General Dynamics Robotics Systems http://www.gdrs.com CS 484, Spring 2012 2012, Selim Aksoy 26

Face detection and recognition Adapted from CSE 455, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 27

Industrial automation Automatic fruit sorting Color Vision Systems http://www.cvs.com.au CS 484, Spring 2012 2012, Selim Aksoy 28

Industrial automation Industrial robotics; bin picking http://www.braintech.com CS 484, Spring 2012 2012, Selim Aksoy 29

Postal service automation General Dynamics Robotics Systems http://www.gdrs.com CS 484, Spring 2012 2012, Selim Aksoy 30

Optical character recognition Digit recognition, AT&T labs http://www.research.att.com/~yann License place recognition Adapted from Steven Seitz, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 31

Document analysis Adapted from Shapiro and Stockman CS 484, Spring 2012 2012, Selim Aksoy 32

Document analysis Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 33

Sports video analysis Tennis review system http://www.hawkeyeinnovations.co.uk CS 484, Spring 2012 2012, Selim Aksoy 34

Scene classification CS 484, Spring 2012 2012, Selim Aksoy 35

Object recognition Adapted from Rob Fergus, MIT CS 484, Spring 2012 2012, Selim Aksoy 36

Object recognition Lincoln, Microsoft Research kooaba Situated search Yeh et al., MIT Google Goggles Bing Vision CS 484, Spring 2012 2012, Selim Aksoy 37

Land cover classification CS 484, Spring 2012 2012, Selim Aksoy 38

Land cover classification CS 484, Spring 2012 2012, Selim Aksoy 39

Object recognition CS 484, Spring 2012 2012, Selim Aksoy 40

Object recognition Recognition of buildings and building groups CS 484, Spring 2012 2012, Selim Aksoy 41

Organizing image archives Adapted from Pinar Duygulu, Bilkent University CS 484, Spring 2012 2012, Selim Aksoy 42

Photo tourism: exploring photo collections Building 3D scene models from individual photos Adapted from Steven Seitz, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 43

Photosynth CS 484, Spring 2012 2012, Selim Aksoy 44

Content-based retrieval Online shopping catalog search http://www.like.com CS 484, Spring 2012 2012, Selim Aksoy 45

3D scanning and reconstruction Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 46

Earth viewers (3D modeling) CS 484, Spring 2012 2012, Selim Aksoy 47

Motion capture Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 48

Visual effects Adapted from CSE 455, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 49

Motion capture Microsoft s XBox Kinect Adapted from CSE 455, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 50

Mozaic Adapted from David Forsyth, UC Berkeley CS 484, Spring 2012 2012, Selim Aksoy 51

Mozaic Adapted from David Forsyth, UC Berkeley CS 484, Spring 2012 2012, Selim Aksoy 52

Critical issues What information should be extracted? How can it be extracted? How should it be represented? How can it be used to aid analysis and understanding? CS 484, Spring 2012 2012, Selim Aksoy 53

Perception and grouping Subjective contours CS 484, Spring 2012 2012, Selim Aksoy 54

Perception and grouping Adapted from Gonzales and Woods CS 484, Spring 2012 2012, Selim Aksoy 55

Perception and grouping Müller-Lyer Illusion Adapted from Alyosha Efros, Carnegie Mellon CS 484, Spring 2012 2012, Selim Aksoy 56

Copyright A.Kitaoka 2003 CS 484, Spring 2012 2012, Selim Aksoy 58

Perception and grouping Occlusion Adapted from Michael Black, Brown University CS 484, Spring 2012 2012, Selim Aksoy 59

What the computer gets CS 484, Spring 2012 2012, Selim Aksoy 60

Challenges 1: view point variation Michelangelo 1475-1564 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2012 2012, Selim Aksoy 61

Challenges 2: illumination Adapted from Fei-Fei Li CS 484, Spring 2012 2012, Selim Aksoy 62

Challenges 3: occlusion Magritte, 1957 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2012 2012, Selim Aksoy 63

Challenges 4: scale Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2012 2012, Selim Aksoy 64

Challenges 5: deformation Xu, Beihong 1943 Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2012 2012, Selim Aksoy 65

Challenges 6: background clutter Adapted from Fei-Fei Li CS 484, Spring 2012 2012, Selim Aksoy 66

Challenges 7: intra-class variation Adapted from L. Fei-Fei, R. Fergus, A. Torralba CS 484, Spring 2012 2012, Selim Aksoy 67

Recognition How can different cues such as color, texture, shape, motion, etc., can be used for recognition? Which parts of image should be recognized together? How can objects be recognized without focusing on detail? How can objects with many free parameters be recognized? How do we structure very large model bases? CS 484, Spring 2012 2012, Selim Aksoy 68

Color Adapted from Martial Hebert, CMU CS 484, Spring 2012 2012, Selim Aksoy 69

Texture Adapted from David Forsyth, UC Berkeley CS 484, Spring 2012 2012, Selim Aksoy 70

Color, texture, and proximity Adapted from Fei-Fei Li CS 484, Spring 2012 2012, Selim Aksoy 71

Segmentation Original Images Color Regions Texture Regions Line Clusters Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 72

Segmentation Adapted from Jianbo Shi, U Penn CS 484, Spring 2012 2012, Selim Aksoy 73

Shape Recognized objects Model database Adapted from Enis Cetin, Bilkent University CS 484, Spring 2012 2012, Selim Aksoy 74

Motion Adapted from Michael Black, Brown University CS 484, Spring 2012 2012, Selim Aksoy 75

Detection Adapted from David Forsyth, UC Berkeley CS 484, Spring 2012 2012, Selim Aksoy 76

Detection Adapted from David Forsyth, UC Berkeley CS 484, Spring 2012 2012, Selim Aksoy 77

Detection Adapted from Michael Black, Brown University CS 484, Spring 2012 2012, Selim Aksoy 78

Recognition Adapted from Michael Black, Brown University CS 484, Spring 2012 2012, Selim Aksoy 79

Recognition Adapted from David Forsyth, UC Berkeley CS 484, Spring 2012 2012, Selim Aksoy 80

Parts and relations Adapted from Michael Black, Brown University CS 484, Spring 2012 2012, Selim Aksoy 81

Parts and relations Adapted from Michael Black, Brown University CS 484, Spring 2012 2012, Selim Aksoy 82

Context Adapted from Antonio Torralba, MIT CS 484, Spring 2012 2012, Selim Aksoy 83

Context Adapted from Antonio Torralba, MIT CS 484, Spring 2012 2012, Selim Aksoy 84

Context Adapted from Derek Hoiem, CMU CS 484, Spring 2012 2012, Selim Aksoy 85

Context Adapted from Derek Hoiem, CMU CS 484, Spring 2012 2012, Selim Aksoy 86

Stages of computer vision Low-level image image Mid-level image features / attributes Image analysis / image understanding High-level features making sense, recognition CS 484, Spring 2012 2012, Selim Aksoy 87

Low-level sharpening blurring Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 88

Low-level Canny original image Mid-level edge image ORT edge image data structure circular arcs and line segments Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 89

Mid-level K-means clustering (followed by connected component analysis) original color image regions of homogeneous color data structure Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 90

Low-level to high-level low-level edge image mid-level high-level consistent line clusters Adapted from Linda Shapiro, U of Washington CS 484, Spring 2012 2012, Selim Aksoy 91