Visual Relationship Detection with Language Priors

Size: px
Start display at page:

Download "Visual Relationship Detection with Language Priors"

Transcription

1 Visual Relationship Detection with Language Priors Cewu Lu*, Ranjay Krishna*, Michael Bernstein, Li Fei-Fei Stanford University * = equal contribution

2 image #1 image #2 llama person llama person 2

3 next to chasing 3

4 4

5

6 Problem formulation Input (image only) 6

7 Problem formulation Input (image only) person person riding riding horse in front of horse Output 7

8 Problem formulation person person horse horse Input (image only) Output 8

9 Problem formulation Input (image only) person person riding riding horse horse Output 9

10 Problem formulation Input (image only) person person riding riding horse in front of horse Output 10

11 Related work Spatial relationships: cup on top of table Action relationships: person kick ball Common relationships: person wear shirt Roger et al. ICCV 2008 Galleguillos, CVPR 2008 Yao et al. CVPR 2012 Maji et al. CVPR 2011 Rohrbach et al. ICCV 2013 Gupta et al. PAMI 2009 Gupta et al. ECCV 2008 Kumar et al. CVPR 2010 Wang et al. ECCV 2016 Sadeghi et al. CVPR

12

13 Visual Genome dataset 33K object categories 42K relationship categories dataset also contains descriptions, question answers and attributes Krishna et al. IJCV

14 Observation 1: ride Quadratic explosion of - N objects, - K relationships leading to N2K detectors next to lying Visual Genome dataset N = 33K K = 42K drag falling off carry resting on throw 14

15 Observation #2 # of occurrences Long tail distribution of relationships - makes supervised training difficult relationships 15

16 Observation #2 # of occurrences car on wstreet dog behind w w tree dog ride skateboard Long tail distribution of relationships - makes supervised training difficult relationships 1 6

17 Observation #2 # of occurrences car on wstreet dog ride skateboard w elephant wdrink milk dog ride wsurfboard Long tail distribution of relationships - makes supervised training difficult relationships 1 7

18 Visual module Language module Input Tackles: Quadratic explosion of N2K detectors Tackles: Long tail distribution of relationships Output

19 Visual module Input Definitions: Output

20 Visual module Proposals: Uijlings et al. IJCV 2013 Input Definitions: Output

21 Visual module Proposals: Uijlings et al. IJCV 2013 Input Sample: object detector Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] Output

22 Visual module Proposals: Uijlings et al. IJCV 2013 Input Sample: object detector relationship detector Output Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] r [on, in, ride, front of, ]

23 Visual module Proposals: Uijlings et al. IJCV 2013 Input Sample: object detector relationship detector Output Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] r [on, in, ride, front of, ] T is a <o1, r, o2> triple

24 Visual module Proposals: Uijlings et al. IJCV 2013 Input Sample: object detector relationship detector person in horse Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] r [on, in, ride, front of, ] T is a <o1, r, o2> triple

25 Visual module Language module Proposals: Uijlings et al. IJCV 2013 o1: man r: ride o2: horse Input Sample: object detector relationship detector person in horse Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] r [on, in, ride, front of, ] T is a <o1, r, o2> triple

26 Visual module Language module Proposals: Uijlings et al. IJCV 2013 o1: man r: ride o2: horse Input Sample: object detector relationship detector person in horse Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] r [on, in, ride, front of, ] T is a <o1, r, o2> triple

27 Visual module Language module Proposals: Uijlings et al. IJCV 2013 o1: man r: ride o2: horse Input Sample: object detector relationship detector person riding horse Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] r [on, in, ride, front of, ] T is a <o1, r, o2> triple

28 Visual module Proposals: Uijlings et al. IJCV 2013 Sample: Tackles: object detector relationship detector Quadratic explosion only requires N+K detectors

29 Language module Tackles: Long tail distribution can predict rare relationships o 1 : man r: ride o 2 : horse

30 Training the visual module 1. Pre-train using ImageNet object detector object detector relationship detector Definitions: Deng et al. 2009

31 Training the visual module 1. Pre-train using ImageNet 2. Train object detector object detector object detector relationship detector Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] Girshirk et al. CVPR 2014

32 Training the visual module 1. Pre-train using ImageNet 2. Train object detector 3. Train relationship detector object detector object detector relationship detector Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] r [on, in, ride, front of, ]

33 Training the visual module object detector relationship detector object detector Ranking loss Pre-train using ImageNet Train object detector Train relationship detector Fine-tune both jointly Definitions: b1, b2 are object proposals o1, o2 [person, horse, ] r [on, in, ride, front of, ] Deng et al. 2009

34 Training the language module dog ride skateboard w dog ride wsurfboard 34

35 Training the language module dog ride skateboard w dog ride wsurfboard where cos is the cosine distance 35

36 Training the language module dog ride skateboard w 0 0 dog ride wsurfboard where cos is the cosine distance 36

37 Training the language module dog ride skateboard w dog ride wsurfboard where cos is the cosine distance Minimize: 37

38 Training both modules iteratively Visual module Language module

39 Our results: 39

40 Our results: spatial, comparative, asymmetrical, verb, prepositional person wear taller than person left of on wear shirt snow ski 40

41 Our results: spatial, comparative, asymmetrical, verb, prepositional person wear taller than person left of on wear shirt snow ski 41

42 Our results: spatial, comparative, asymmetrical, verb, prepositional person wear taller than person left of on wear shirt snow ski 42

43 Our results: spatial, comparative, asymmetrical, verb, prepositional person wear taller than person left of on wear shirt snow ski 43

44 Relationship types: spatial, comparative, asymmetrical, verb, prepositional person wear taller than person left of on wear shirt snow ski 44

45 Our results: spatial, comparative, asymmetrical, verb, prepositional person wear taller than person left of on wear shirt snow ski 45

46 Ablation study map Sadeghi et al Visual only Visual + language

47 Ablation study person wear shirt person wear shirt map Sadeghi et al Visual only Visual + language

48 Ablation study person wear shirt person wear shirt person in horse person in shirt map Sadeghi et al Visual only Visual + language

49 Ablation study person wear shirt person wear shirt Sadeghi et al map person in horse person ride horse person in shirt person near horse Visual only Visual + language

50 person ride bicycle 50

51 person throw frisbee person throw frisbee 51

52 Zero shot detection person sit chair 948 training examples hydrant on ground 29 training examples 52

53 Zero shot detection person sit chair 948 training examples hydrant on ground 29 training examples person sit hydrant 0 training examples 53

54 Zero shot detection person ride horse 578 training examples person wear hat 1023 training examples 54

55 Zero shot detection person ride horse 578 training examples person wear hat 1023 training examples horse wear hat 0 training examples 55

56 Visual Relationship Detection with Language Priors Cewu Lu*, Ranjay Krishna*, Michael Bernstein, Li Fei-Fei Stanford University Poster #4 Questions? 56

Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection

Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection CSED703R: Deep Learning for Visual Recognition (206S) Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection Bohyung Han Computer Vision Lab. bhhan@postech.ac.kr 2 3 Object detection

More information

Semantic Recognition: Object Detection and Scene Segmentation

Semantic Recognition: Object Detection and Scene Segmentation Semantic Recognition: Object Detection and Scene Segmentation Xuming He xuming.he@nicta.com.au Computer Vision Research Group NICTA Robotic Vision Summer School 2015 Acknowledgement: Slides from Fei-Fei

More information

Bringing Semantics Into Focus Using Visual Abstraction

Bringing Semantics Into Focus Using Visual Abstraction Bringing Semantics Into Focus Using Visual Abstraction C. Lawrence Zitnick Microsoft Research, Redmond larryz@microsoft.com Devi Parikh Virginia Tech parikh@vt.edu Abstract Relating visual information

More information

Module 5. Deep Convnets for Local Recognition Joost van de Weijer 4 April 2016

Module 5. Deep Convnets for Local Recognition Joost van de Weijer 4 April 2016 Module 5 Deep Convnets for Local Recognition Joost van de Weijer 4 April 2016 Previously, end-to-end.. Dog Slide credit: Jose M 2 Previously, end-to-end.. Dog Learned Representation Slide credit: Jose

More information

The goal is multiply object tracking by detection with application on pedestrians.

The goal is multiply object tracking by detection with application on pedestrians. Coupled Detection and Trajectory Estimation for Multi-Object Tracking By B. Leibe, K. Schindler, L. Van Gool Presented By: Hanukaev Dmitri Lecturer: Prof. Daphna Wienshall The Goal The goal is multiply

More information

Fast R-CNN Object detection with Caffe

Fast R-CNN Object detection with Caffe Fast R-CNN Object detection with Caffe Ross Girshick Microsoft Research arxiv code Latest roasts Goals for this section Super quick intro to object detection Show one way to tackle obj. det. with ConvNets

More information

Convolutional Feature Maps

Convolutional Feature Maps Convolutional Feature Maps Elements of efficient (and accurate) CNN-based object detection Kaiming He Microsoft Research Asia (MSRA) ICCV 2015 Tutorial on Tools for Efficient Object Detection Overview

More information

High Level Describable Attributes for Predicting Aesthetics and Interestingness

High Level Describable Attributes for Predicting Aesthetics and Interestingness High Level Describable Attributes for Predicting Aesthetics and Interestingness Sagnik Dhar Vicente Ordonez Tamara L Berg Stony Brook University Stony Brook, NY 11794, USA tlberg@cs.stonybrook.edu Abstract

More information

Semantic Aware Video Transcription Using Random Forest Classifiers

Semantic Aware Video Transcription Using Random Forest Classifiers Semantic Aware Video Transcription Using Random Forest Classifiers Chen Sun and Ram Nevatia University of Southern California, Institute for Robotics and Intelligent Systems Los Angeles, CA 90089, USA

More information

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree. Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table

More information

Box 1 - Article Noun - Blue

Box 1 - Article Noun - Blue the floor a basket the people a broom the igloo the rocks a dog an eagle a bats the leaves a dolphin a triangle the tusks the shoes the book the sky an elephant the ocean a tiger a donkey the shark a cage

More information

Baby Talk: Understanding and Generating Image Descriptions

Baby Talk: Understanding and Generating Image Descriptions Baby Talk: Understanding and Generating Image Descriptions Girish Kulkarni Visruth Premraj Sagnik Dhar Siming Li Yejin Choi Alexander C Berg Tamara L Berg Stony Brook University Stony Brook University,

More information

Transform-based Domain Adaptation for Big Data

Transform-based Domain Adaptation for Big Data Transform-based Domain Adaptation for Big Data Erik Rodner University of Jena Judy Hoffman Jeff Donahue Trevor Darrell Kate Saenko UMass Lowell Abstract Images seen during test time are often not from

More information

Weakly Supervised Object Boundaries Supplementary material

Weakly Supervised Object Boundaries Supplementary material Weakly Supervised Object Boundaries Supplementary material Anna Khoreva Rodrigo Benenson Mohamed Omran Matthias Hein 2 Bernt Schiele Max Planck Institute for Informatics, Saarbrücken, Germany 2 Saarland

More information

Edge Boxes: Locating Object Proposals from Edges

Edge Boxes: Locating Object Proposals from Edges Edge Boxes: Locating Object Proposals from Edges C. Lawrence Zitnick and Piotr Dollár Microsoft Research Abstract. The use of object proposals is an effective recent approach for increasing the computational

More information

CS 1699: Intro to Computer Vision. Deep Learning. Prof. Adriana Kovashka University of Pittsburgh December 1, 2015

CS 1699: Intro to Computer Vision. Deep Learning. Prof. Adriana Kovashka University of Pittsburgh December 1, 2015 CS 1699: Intro to Computer Vision Deep Learning Prof. Adriana Kovashka University of Pittsburgh December 1, 2015 Today: Deep neural networks Background Architectures and basic operations Applications Visualizing

More information

Nombre: I am not playing. You are not (you aren t) eating. He is not (He isn t) reading. She is not (She isn t) sleeping.

Nombre: I am not playing. You are not (you aren t) eating. He is not (He isn t) reading. She is not (She isn t) sleeping. Nombre: PRESENT CONTINOUS Affirmative Negative Interrogative I am (I m) playing. You are (you re) eating He is (He s) reading. She is (She s) sleeping. It is (It s) running. We are (we re) speaking. You

More information

What more can we do with videos?

What more can we do with videos? What more can we do with videos? Occlusion and Mo6on Reasoning for Tracking & Human Pose Es6ma6on Karteek Alahari Inria Grenoble Rhone- Alpes Joint work with Anoop Cherian, Yang Hua, Julien Mairal, Cordelia

More information

Curriculum Text. Texto del curso Texte du cours Kursinhalt Testo del corso. Inglés english UK Level 1. Inglese Britannico.

Curriculum Text. Texto del curso Texte du cours Kursinhalt Testo del corso. Inglés english UK Level 1. Inglese Britannico. English Curriculum Text Level 1 English UK Inglés english UK Level 1 UK Anglais Britannique Englisch UK Inglese Britannico Curriculum Text Texto del curso Texte du cours Kursinhalt Testo del corso RosettaStone.com

More information

MODIFIERS. There are many different types of modifiers. Let's begin by taking a look at the most common ones.

MODIFIERS. There are many different types of modifiers. Let's begin by taking a look at the most common ones. MODIFIERS A modifier is a word, phrase, or clause that describes another word or word group. Many types of words and phrases can act as modifiers, such as adjectives, adverbs, and prepositional phrases.

More information

Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011

Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Crowded Environments XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Sparse Scenes Zelnik-Manor & Irani CVPR

More information

SSD: Single Shot MultiBox Detector

SSD: Single Shot MultiBox Detector SSD: Single Shot MultiBox Detector Wei Liu 1, Dragomir Anguelov 2, Dumitru Erhan 3, Christian Szegedy 3, Scott Reed 4, Cheng-Yang Fu 1, Alexander C. Berg 1 1 UNC Chapel Hill 2 Zoox Inc. 3 Google Inc. 4

More information

ESL 3 8 8 + 1 62 QUESTION 62 ANSWER 8 LUCKY CARDS

ESL 3 8 8 + 1 62 QUESTION 62 ANSWER 8 LUCKY CARDS Speaking Cards ESL Card Game For 3 8 players Age: 8 + Level of English: High Beginners or False Beginners (at least 1 year of learning English) 62 QUESTION cards, 62 ANSWER cards, 8 LUCKY CARDS (together:

More information

Semantic Image Segmentation and Web-Supervised Visual Learning

Semantic Image Segmentation and Web-Supervised Visual Learning Semantic Image Segmentation and Web-Supervised Visual Learning Florian Schroff Andrew Zisserman University of Oxford, UK Antonio Criminisi Microsoft Research Ltd, Cambridge, UK Outline Part I: Semantic

More information

Nombre: Today is Monday. Yesterday was. Tomorrow will be. Today is Friday. Yesterday was. Tomorrow will be. Today is Wednesday.

Nombre: Today is Monday. Yesterday was. Tomorrow will be. Today is Friday. Yesterday was. Tomorrow will be. Today is Wednesday. Nombre: Days of the week. Read and write. Today is Monday. Yesterday was Tomorrow will be. Today is Friday. Yesterday was Tomorrow will be. Today is Wednesday. Yesterday was Tomorrow will be. Today is

More information

Object Detection from Video Tubelets with Convolutional Neural Networks

Object Detection from Video Tubelets with Convolutional Neural Networks Object Detection from Video Tubelets with Convolutional Neural Networks Kai Kang Wanli Ouyang Hongsheng Li Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong {kkang,wlouyang,hsli,xgwang}@ee.cuhk.edu.hk

More information

Deformable Part Models with CNN Features

Deformable Part Models with CNN Features Deformable Part Models with CNN Features Pierre-André Savalle 1, Stavros Tsogkas 1,2, George Papandreou 3, Iasonas Kokkinos 1,2 1 Ecole Centrale Paris, 2 INRIA, 3 TTI-Chicago Abstract. In this work we

More information

Coaching TOPSoccer. Training Session Activities. 1 US Youth Soccer

Coaching TOPSoccer. Training Session Activities. 1 US Youth Soccer Coaching TOPSoccer Training Session Activities 1 US Youth Soccer Training Session Activities for Down Syndrome THE ACTIVITY I CAN DO THIS, CAN YOU? (WHAT CAN YOU DO?) The coach does something with or without

More information

rarecorvettes.com, joe@rarecorvettes.com, (831) 475-4442 Pacific Time Zone

rarecorvettes.com, joe@rarecorvettes.com, (831) 475-4442 Pacific Time Zone INTRODUCTION TO WHEEL ALIGNMENT A SHORT COURSE ON WHEEL ALIGNMENT, FRONT AND REAR PREPARED FOR THE N.C.R.S. NATIONAL CONVENTION JUNE 29 TO JULY 5, 2012 by: JOE CALCAGNO, RARE CORVETTES rarecorvettes.com,

More information

Pedestrian Detection with RCNN

Pedestrian Detection with RCNN Pedestrian Detection with RCNN Matthew Chen Department of Computer Science Stanford University mcc17@stanford.edu Abstract In this paper we evaluate the effectiveness of using a Region-based Convolutional

More information

Enclosed is a detailed list of parade rules, Insurance requirements and an application form. Please review the

Enclosed is a detailed list of parade rules, Insurance requirements and an application form. Please review the Great Events in Peoria are Brought to You by PACE! Peoria Area Community Events (PACE) P.O. Box 10228 - Peoria, IL 61612-0228 www.peoriaevents.com Dear Santa Parade Participant, October 1, 2015 Candy canes,

More information

Quality Assessment for Crowdsourced Object Annotations

Quality Assessment for Crowdsourced Object Annotations S. VITTAYAKORN, J. HAYS: CROWDSOURCED OBJECT ANNOTATIONS 1 Quality Assessment for Crowdsourced Object Annotations Sirion Vittayakorn svittayakorn@cs.brown.edu James Hays hays@cs.brown.edu Computer Science

More information

Finding people in repeated shots of the same scene

Finding people in repeated shots of the same scene Finding people in repeated shots of the same scene Josef Sivic 1 C. Lawrence Zitnick Richard Szeliski 1 University of Oxford Microsoft Research Abstract The goal of this work is to find all occurrences

More information

Localizing 3D cuboids in single-view images

Localizing 3D cuboids in single-view images Localizing 3D cuboids in single-view images Jianxiong Xiao Bryan C. Russell Antonio Torralba Massachusetts Institute of Technology University of Washington Abstract In this paper we seek to detect rectangular

More information

Scalable Object Detection by Filter Compression with Regularized Sparse Coding

Scalable Object Detection by Filter Compression with Regularized Sparse Coding Scalable Object Detection by Filter Compression with Regularized Sparse Coding Ting-Hsuan Chao, Yen-Liang Lin, Yin-Hsi Kuo, and Winston H Hsu National Taiwan University, Taipei, Taiwan Abstract For practical

More information

Prepositions. TLC/College of the Canyons. Prepared by Kim Haglund, M.Ed: TLC Coordinator

Prepositions. TLC/College of the Canyons. Prepared by Kim Haglund, M.Ed: TLC Coordinator Prepositions TLC/College of the Canyons Prepared by Kim Haglund, M.Ed: TLC Coordinator What is a Preposition? A preposition is a word that shows the relationship of a noun or pronoun to another word in

More information

InstaNet: Object Classification Applied to Instagram Image Streams

InstaNet: Object Classification Applied to Instagram Image Streams InstaNet: Object Classification Applied to Instagram Image Streams Clifford Huang Stanford University chuang8@stanford.edu Mikhail Sushkov Stanford University msushkov@stanford.edu Abstract The growing

More information

SEMANTIC CONTEXT AND DEPTH-AWARE OBJECT PROPOSAL GENERATION

SEMANTIC CONTEXT AND DEPTH-AWARE OBJECT PROPOSAL GENERATION SEMANTIC TEXT AND DEPTH-AWARE OBJECT PROPOSAL GENERATION Haoyang Zhang,, Xuming He,, Fatih Porikli,, Laurent Kneip NICTA, Canberra; Australian National University, Canberra ABSTRACT This paper presents

More information

car boat airplane train taxi bus motorcycle ambulance bicycle tricycle horse mule scooter

car boat airplane train taxi bus motorcycle ambulance bicycle tricycle horse mule scooter car boat airplane train taxi bus cycle ambulance bicycle tricycle horse mule scooter ice SUV truck spaceship camel cable car ferry boat car airplane horse taxi bus cycle ambulance bicycle tricycle cable

More information

Tracking in flussi video 3D. Ing. Samuele Salti

Tracking in flussi video 3D. Ing. Samuele Salti Seminari XXIII ciclo Tracking in flussi video 3D Ing. Tutors: Prof. Tullio Salmon Cinotti Prof. Luigi Di Stefano The Tracking problem Detection Object model, Track initiation, Track termination, Tracking

More information

Object Detection in Video using Faster R-CNN

Object Detection in Video using Faster R-CNN Object Detection in Video using Faster R-CNN Prajit Ramachandran University of Illinois at Urbana-Champaign prmchnd2@illinois.edu Abstract Convolutional neural networks (CNN) currently dominate the computer

More information

Large-scale Knowledge Transfer for Object Localization in ImageNet

Large-scale Knowledge Transfer for Object Localization in ImageNet Large-scale Knowledge Transfer for Object Localization in ImageNet Matthieu Guillaumin ETH Zürich, Switzerland Vittorio Ferrari University of Edinburgh, UK Abstract ImageNet is a large-scale database of

More information

SUBJECT-VERB AGREEMENT

SUBJECT-VERB AGREEMENT SUBJECT-VERB AGREEMENT RULES FOR SUBJECT-VERB AGREEMENT RULE 1 A verb agrees with its subject in number. Singular subjects take singular verbs: The car stays in the garage. The flower smells good. There

More information

Segmentation as Selective Search for Object Recognition

Segmentation as Selective Search for Object Recognition Segmentation as Selective Search for Object Recognition Koen E. A. van de Sande Jasper R. R. Uijlings Theo Gevers Arnold W. M. Smeulders University of Amsterdam University of Trento Amsterdam, The Netherlands

More information

Chapter 6 Quadratic Functions

Chapter 6 Quadratic Functions Chapter 6 Quadratic Functions Determine the characteristics of quadratic functions Sketch Quadratics Solve problems modelled b Quadratics 6.1Quadratic Functions A quadratic function is of the form where

More information

LIBSVX and Video Segmentation Evaluation

LIBSVX and Video Segmentation Evaluation CVPR 14 Tutorial! 1! LIBSVX and Video Segmentation Evaluation Chenliang Xu and Jason J. Corso!! Computer Science and Engineering! SUNY at Buffalo!! Electrical Engineering and Computer Science! University

More information

Automatic Attribute Discovery and Characterization from Noisy Web Data

Automatic Attribute Discovery and Characterization from Noisy Web Data Automatic Attribute Discovery and Characterization from Noisy Web Data Tamara L. Berg 1, Alexander C. Berg 2, and Jonathan Shih 3 1 Stony Brook University, Stony Brook NY 11794, USA, tlberg@cs.sunysb.edu,

More information

CAP 6412 Advanced Computer Vision

CAP 6412 Advanced Computer Vision CAP 6412 Advanced Computer Vision http://www.cs.ucf.edu/~bgong/cap6412.html Boqing Gong Jan 26, 2016 Today Administrivia A bigger picture and some common questions Object detection proposals, by Samer

More information

Nombre: Today is Monday. Yesterday was. Tomorrow will be. Today is Friday. Yesterday was. Tomorrow will be. Today is Wednesday.

Nombre: Today is Monday. Yesterday was. Tomorrow will be. Today is Friday. Yesterday was. Tomorrow will be. Today is Wednesday. Nombre: Days of the week. Read and complete. Today is Monday. Yesterday was Tomorrow will be. Today is Friday. Yesterday was Tomorrow will be. Today is Wednesday. Yesterday was Tomorrow will be. Today

More information

First-Person Activity Recognition: What Are They Doing to Me?

First-Person Activity Recognition: What Are They Doing to Me? IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, June 23 First-Person Activity Recognition: What Are They Doing to Me? M. S. Ryoo and Larry Matthies Jet Propulsion Laboratory,

More information

Object Recognition. Selim Aksoy. Bilkent University saksoy@cs.bilkent.edu.tr

Object Recognition. Selim Aksoy. Bilkent University saksoy@cs.bilkent.edu.tr Image Classification and Object Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Image classification Image (scene) classification is a fundamental

More information

Bert Huang Department of Computer Science Virginia Tech

Bert Huang Department of Computer Science Virginia Tech This paper was submitted as a final project report for CS6424/ECE6424 Probabilistic Graphical Models and Structured Prediction in the spring semester of 2016. The work presented here is done by students

More information

Learning and transferring mid-level image representions using convolutional neural networks

Learning and transferring mid-level image representions using convolutional neural networks Willow project-team Learning and transferring mid-level image representions using convolutional neural networks Maxime Oquab, Léon Bottou, Ivan Laptev, Josef Sivic 1 Image classification (easy) Is there

More information

Università degli Studi di Bologna

Università degli Studi di Bologna Università degli Studi di Bologna DEIS Biometric System Laboratory Incremental Learning by Message Passing in Hierarchical Temporal Memory Davide Maltoni Biometric System Laboratory DEIS - University of

More information

Energy - Kinetic Energy and Potenial Energy

Energy - Kinetic Energy and Potenial Energy Energy - Kinetic Energy and Potenial Energy Developed by Nicole Gerardo (Gouverneur MS) through a grant provided by the Title ll Part B Math/Science Partnership and in collaboration with Sheri Amsel and

More information

1. The more idioms you know how to use correctly, the more natural your speech will sound

1. The more idioms you know how to use correctly, the more natural your speech will sound What are idioms? Idioms are words, phrases, or expressions which are often grammatically strange and are not meant to be understood literally. Idioms are a very important part of any language, so learning

More information

Explore 3: Crash Test Dummies

Explore 3: Crash Test Dummies Explore : Crash Test Dummies Type of Lesson: Learning Goal & Instructiona l Objectives Content with Process: Focus on constructing knowledge through active learning. Students investigate Newton s first

More information

Clustering Connectionist and Statistical Language Processing

Clustering Connectionist and Statistical Language Processing Clustering Connectionist and Statistical Language Processing Frank Keller keller@coli.uni-sb.de Computerlinguistik Universität des Saarlandes Clustering p.1/21 Overview clustering vs. classification supervised

More information

Transfer Learning by Borrowing Examples for Multiclass Object Detection

Transfer Learning by Borrowing Examples for Multiclass Object Detection Transfer Learning by Borrowing Examples for Multiclass Object Detection Joseph J. Lim CSAIL, MIT lim@csail.mit.edu Ruslan Salakhutdinov Department of Statistics, University of Toronto rsalakhu@utstat.toronto.edu

More information

Nombre: Today is Monday. Yesterday was. Tomorrow will be. Today is Friday. Yesterday was. Tomorrow will be. Today is Wednesday.

Nombre: Today is Monday. Yesterday was. Tomorrow will be. Today is Friday. Yesterday was. Tomorrow will be. Today is Wednesday. Days of the week. Read and complete. Today is Monday. Yesterday was Tomorrow will be. Today is Friday. Yesterday was Tomorrow will be. Today is Wednesday. Yesterday was Tomorrow will be. Today is Saturday.

More information

Character Image Patterns as Big Data

Character Image Patterns as Big Data 22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,

More information

Image Compression through DCT and Huffman Coding Technique

Image Compression through DCT and Huffman Coding Technique International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Rahul

More information

Young Learners English

Young Learners English University of Cambridge ESOL Examinations Young Learners English Starters Information for Candidates Information for candidates YLE Starters Dear Parent Thank you for encouraging your child to learn English

More information

Machine Learning Final Project Spam Email Filtering

Machine Learning Final Project Spam Email Filtering Machine Learning Final Project Spam Email Filtering March 2013 Shahar Yifrah Guy Lev Table of Content 1. OVERVIEW... 3 2. DATASET... 3 2.1 SOURCE... 3 2.2 CREATION OF TRAINING AND TEST SETS... 4 2.3 FEATURE

More information

Chapter 3 Growing with Verbs 77

Chapter 3 Growing with Verbs 77 Chapter 3 Growing with Verbs 77 3.2 Direct Objects A direct object is the noun or pronoun that receives the direct action of a verb. The verb used with a direct object is always an action verb and is called

More information

Administrivia. Traditional Recognition Approach. Overview. CMPSCI 370: Intro. to Computer Vision Deep learning

Administrivia. Traditional Recognition Approach. Overview. CMPSCI 370: Intro. to Computer Vision Deep learning : Intro. to Computer Vision Deep learning University of Massachusetts, Amherst April 19/21, 2016 Instructor: Subhransu Maji Finals (everyone) Thursday, May 5, 1-3pm, Hasbrouck 113 Final exam Tuesday, May

More information

A picture is worth five captions

A picture is worth five captions A picture is worth five captions Learning visually grounded word and sentence representations Grzegorz Chrupała (with Ákos Kádár and Afra Alishahi) Learning word (and phrase) meanings Cross-situational

More information

Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation

Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek & Cordelia Schmid LEAR Team, INRIA Rhône-Alpes Grenoble, France Discriminative

More information

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform

More information

A Bayesian Framework for Unsupervised One-Shot Learning of Object Categories

A Bayesian Framework for Unsupervised One-Shot Learning of Object Categories A Bayesian Framework for Unsupervised One-Shot Learning of Object Categories Li Fei-Fei 1 Rob Fergus 1,2 Pietro Perona 1 1. California Institute of Technology 2. Oxford University Single Object Object

More information

R-CNN minus R. 1 Introduction. Karel Lenc http://www.robots.ox.ac.uk/~karel. Department of Engineering Science, University of Oxford, Oxford, UK.

R-CNN minus R. 1 Introduction. Karel Lenc http://www.robots.ox.ac.uk/~karel. Department of Engineering Science, University of Oxford, Oxford, UK. LENC, VEDALDI: R-CNN MINUS R 1 R-CNN minus R Karel Lenc http://www.robots.ox.ac.uk/~karel Andrea Vedaldi http://www.robots.ox.ac.uk/~vedaldi Department of Engineering Science, University of Oxford, Oxford,

More information

Task-driven Progressive Part Localization for Fine-grained Recognition

Task-driven Progressive Part Localization for Fine-grained Recognition Task-driven Progressive Part Localization for Fine-grained Recognition Chen Huang Zhihai He chenhuang@mail.missouri.edu University of Missouri hezhi@missouri.edu Abstract In this paper we propose a task-driven

More information

Mean-Shift Tracking with Random Sampling

Mean-Shift Tracking with Random Sampling 1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of

More information

Warmups and Energizers

Warmups and Energizers Warmups and Energizers Warmups or energizers are activities the trainer uses throughout the course to encourage participant involvement and interaction. These activities may be used at the beginning of

More information

3D Model based Object Class Detection in An Arbitrary View

3D Model based Object Class Detection in An Arbitrary View 3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/

More information

Latest Advances in Deep Learning. Yao Chou

Latest Advances in Deep Learning. Yao Chou Latest Advances in Deep Learning Yao Chou Outline Introduction Images Classification Object Detection R-CNN Traditional Feature Descriptor Selective Search Implementation Latest Application Deep Learning

More information

Evaluating Sources and Strategies for Learning Video Concepts from Social Media

Evaluating Sources and Strategies for Learning Video Concepts from Social Media Evaluating Sources and Strategies for Learning Video Concepts from Social Media Svetlana Kordumova Intelligent Systems Lab Amsterdam University of Amsterdam The Netherlands Email: s.kordumova@uva.nl Xirong

More information

Learning Spatial Context: Using Stuff to Find Things

Learning Spatial Context: Using Stuff to Find Things Learning Spatial Context: Using Stuff to Find Things Geremy Heitz Daphne Koller Department of Computer Science, Stanford University {gaheitz,koller}@cs.stanford.edu Abstract. The sliding window approach

More information

ALBANY AREA YMCA HOMESCHOOL P.E. PROGRAM CURRICULUM GUIDE

ALBANY AREA YMCA HOMESCHOOL P.E. PROGRAM CURRICULUM GUIDE ALBANY AREA YMCA HOMESCHOOL P.E. PROGRAM CURRICULUM GUIDE The Albany Area YMCA Homeschool P.E. program provides a quality Christian-based course that is convenient and creative. The program offers age-appropriate

More information

Nombre: I am not playing. You are not (you aren t) eating. He is not (He isn t) reading. She is not (She isn t) sleeping.

Nombre: I am not playing. You are not (you aren t) eating. He is not (He isn t) reading. She is not (She isn t) sleeping. PRESENT CONTINUOUS Affirmative Negative Interrogative I am (I m) playing. You are (you re) eating He is (He s) reading. She is (She s) sleeping. It is (It s) running. We are (we re) speaking. You are (you

More information

A Note to Parents. 1. As you study the list, vary the order of the words.

A Note to Parents. 1. As you study the list, vary the order of the words. A Note to Parents This Wordbook contains all the sight words we will be studying throughout the year plus some additional enrichment words. Your child should spend some time each week studying this Wordbook

More information

Image Captioning A survey of recent deep-learning approaches

Image Captioning A survey of recent deep-learning approaches Image Captioning A survey of recent deep-learning approaches Kiran Vodrahalli February 23, 2015 The task We want to automatically describe images with words Why? 1) It's cool 2) Useful for tech companies

More information

Studying Relationships Between Human Gaze, Description, and Computer Vision

Studying Relationships Between Human Gaze, Description, and Computer Vision Studying Relationships Between Human Gaze, Description, and Computer Vision Kiwon Yun1, Yifan Peng1, Dimitris Samaras1, Gregory J. Zelinsky2, Tamara L. Berg1 1 Department of Computer Science, 2 Department

More information

Azure Machine Learning, SQL Data Mining and R

Azure Machine Learning, SQL Data Mining and R Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:

More information

Fast Matching of Binary Features

Fast Matching of Binary Features Fast Matching of Binary Features Marius Muja and David G. Lowe Laboratory for Computational Intelligence University of British Columbia, Vancouver, Canada {mariusm,lowe}@cs.ubc.ca Abstract There has been

More information

arxiv:1506.03365v2 [cs.cv] 19 Jun 2015

arxiv:1506.03365v2 [cs.cv] 19 Jun 2015 LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop Fisher Yu Yinda Zhang Shuran Song Ari Seff Jianxiong Xiao arxiv:1506.03365v2 [cs.cv] 19 Jun 2015 Princeton

More information

Object Categorization using Co-Occurrence, Location and Appearance

Object Categorization using Co-Occurrence, Location and Appearance Object Categorization using Co-Occurrence, Location and Appearance Carolina Galleguillos Andrew Rabinovich Serge Belongie Department of Computer Science and Engineering University of California, San Diego

More information

Scale-Invariant Object Categorization using a Scale-Adaptive Mean-Shift Search

Scale-Invariant Object Categorization using a Scale-Adaptive Mean-Shift Search in DAGM 04 Pattern Recognition Symposium, Tübingen, Germany, Aug 2004. Scale-Invariant Object Categorization using a Scale-Adaptive Mean-Shift Search Bastian Leibe ½ and Bernt Schiele ½¾ ½ Perceptual Computing

More information

FIRST TERM ENGLISH WORKSHEET 2016-2017

FIRST TERM ENGLISH WORKSHEET 2016-2017 FIRST TERM ENGLISH WORKSHEET 2016-2017 Class -1 1& 11 GIRLS SECTION 1.Make sentences. Flower - Mother - Water - Butterfly - Lovely - Big - 1 2. Circle the correct word. a. It is [Mother s / Father s] birthday.

More information

Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite

Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Philip Lenz 1 Andreas Geiger 2 Christoph Stiller 1 Raquel Urtasun 3 1 KARLSRUHE INSTITUTE OF TECHNOLOGY 2 MAX-PLANCK-INSTITUTE IS 3

More information

How To Generate Object Proposals On A Computer With A Large Image Of A Large Picture

How To Generate Object Proposals On A Computer With A Large Image Of A Large Picture Geodesic Object Proposals Philipp Krähenbühl 1 and Vladlen Koltun 2 1 Stanford University 2 Adobe Research Abstract. We present an approach for identifying a set of candidate objects in a given image.

More information

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles mjhustc@ucla.edu and lunbo

More information

Difference between a vector and a scalar quantity. N or 90 o. S or 270 o

Difference between a vector and a scalar quantity. N or 90 o. S or 270 o Vectors Vectors and Scalars Distinguish between vector and scalar quantities, and give examples of each. method. A vector is represented in print by a bold italicized symbol, for example, F. A vector has

More information

Parts of Speech. Skills Team, University of Hull

Parts of Speech. Skills Team, University of Hull Parts of Speech Skills Team, University of Hull Language comes before grammar, which is only an attempt to describe a language. Knowing the grammar of a language does not mean you can speak or write it

More information

PHYS 117- Exam I. Multiple Choice Identify the letter of the choice that best completes the statement or answers the question.

PHYS 117- Exam I. Multiple Choice Identify the letter of the choice that best completes the statement or answers the question. PHYS 117- Exam I Multiple Choice Identify the letter of the choice that best completes the statement or answers the question. 1. Car A travels from milepost 343 to milepost 349 in 5 minutes. Car B travels

More information

English for Spanish Speakers. Second Edition. Caroline Nixon & Michael Tomlinson

English for Spanish Speakers. Second Edition. Caroline Nixon & Michael Tomlinson English for Spanish Speakers Second Edition s 1 Caroline Nixon & Michael Tomlinson 1 Red and yellow and pink and green, Orange and purple and blue. I can sing a rainbow, Sing a rainbow, Sing a rainbow

More information

Unsupervised Discovery of Mid-Level Discriminative Patches

Unsupervised Discovery of Mid-Level Discriminative Patches Unsupervised Discovery of Mid-Level Discriminative Patches Saurabh Singh, Abhinav Gupta, and Alexei A. Efros Carnegie Mellon University, Pittsburgh, PA 15213, USA http://graphics.cs.cmu.edu/projects/discriminativepatches/

More information

Jointly Optimizing 3D Model Fitting and Fine-Grained Classification

Jointly Optimizing 3D Model Fitting and Fine-Grained Classification Jointly Optimizing 3D Model Fitting and Fine-Grained Classification Yen-Liang Lin 1, Vlad I. Morariu 2, Winston Hsu 1, and Larry S. Davis 2 1 National Taiwan University, Taipei, Taiwan 2 University of

More information

Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification

Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati tnpatil2@gmail.com, ss_sherekar@rediffmail.com

More information

MODULE 4: Passenger Safety

MODULE 4: Passenger Safety Foundation, Levels 1 and 2 MODULE 4: Passenger Safety Activities Being a responsible car passenger Being a responsible bus passenger Links to AusVELS Foundation Level English Speaking and Listening: Literacy

More information