Total Cluster: A person agnostic clustering method for broadcast videos

Size: px
Start display at page:

Download "Total Cluster: A person agnostic clustering method for broadcast videos"

Transcription

1 Total Cluster: A person agnostic clustering method for broadcast videos Makarand Tapaswi, Omkar M. Parkhi, Esa Rahtu, Eric Sommerlade, Rainer Stiefelhagen, Andrew Zisserman

2 Face track clustering Input Video with face tracks Output Face track clusters 2

3 Why cluster face tracks? Large number of videos Good basis for manual or automatic labelling Facilitates video retrieval or summarization 3

4 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3

5 Related work Person Identification in TV series (Everingham et al., 2006) (Cour et al., 2009) (Bojanowski et al., 2013) (Ramanathan et al., 2014) Face Clustering (Ramanan et al., 2007) (Cinbis et al., 2011) (Khoury et al., 2013) (Wu et al., 2013, 2013b) Novelty of the current work Error free clustering Utilize video-editing structure Use state-of-the-art face track descriptor 5

6 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3

7 Video editing overview Episode Scenes Threads Shots 7

8 Shots and Threads shot thread (n.) (n.) A sequence set of shots of taken uninterrupted from the frames same camera for a short with the period same of viewing time angle and content A B A C D C Scrubs S01 E01 D 8

9 Scenes scene (n.) A sequence of shots in a video with the same characters, at the same location and time Scene 4 Scene 5 Scene 6 Scene boundary detection (Tapaswi et al. 2014) Maximize within-scene visual similarity Don t break shot threads 9

10 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Face tracking Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3

11 Face tracks Face tracking Face track descriptor (Parkhi et al. 2014) frontal+profile VJ detector multi-pose head detector KLT tracking false positive removal dense SIFT Fisher encoding discriminative dimensionality reduction 11

12 Negative constraints do NOT merge these pairs! Tracks in the same frame (Cinbis et al. 2011, Wu et al. 2013) Tracks in a thread A B A C B 12

13 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3

14 Within scene clustering for tracks not in negative pairs Strong face similarity T1 T2 In a thread, area overlap relaxed face similarity 128D T4 T3 T1 T2 Dr. Kelso Turk 14

15 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3

16 Cluster descriptor compare frontal vs. frontal, profile vs. profile Frontal Frontal descriptor Mixed Both descriptors Profile Profile descriptor 16

17 Full episode clustering Exemplar SVMs Cluster across scenes one positive cluster against negative clusters + YouTube Faces score clusters with e-svm merge high-scoring 17

18 Recap Video editing structure A B A B Do not merge track pairs T1 T2 Scene level clustering Turk Episode level clustering 18

19 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3

20 Data set episodes 1..5, 23 ~20 minutes each sitcom, interns at a hospital episodes 1..6 ~40 minutes each supernatural drama, action average episode SCRUBS BUFFY # named characters # tracks # tracks in thread # do not merge pairs in shots in threads

21 Evaluation metrics Goal: minimize #clusters, don t make errors! WCP: weighted clustering purity Cluster 1: Purity p 1 = 3/3 Cluster 2: Purity p 2 = 4/5 WCP: 1 N c n c p c = Trade-off WCP vs. #clusters Cluster 1 Purity: 1 Cluster 2 Purity: Purity # clusters 21

22 #Clusters Within scene clustering #Clusters SCRUBS Tracks Best HAC Ours BUFFY SCR-1 SCR-2 SCR-3 SCR-4 SCR BF-1 BF-2 BF-3 BF-5 BF-6 Lower number of purity = 1 Prevent errors 22

23 #Clusters Full episode clustering #Clusters SCRUBS Tracks Best HAC Ours BUFFY 100 SCR-1 SCR-2 SCR-3 SCR-4 SCR BF-1 BF-2 BF-3 BF-5 BF-6 Full episode clustering hard to choose threshold for HAC proposed method reduces purity 1 23

24 Summary 1 Purity Minimize #clusters at very high purity Affords manual and automatic annotation Leverage video editing structure Face tracks are not independent data points 4x number of do not merge track pairs A B # clusters A Stage-wise clustering Small number of characters in one scene At purity 1, #clusters is about half #tracks 24

25 Thank you! Questions? Intro, Related work, Video-editing, Negative pairs Scene clustering, Episode clustering, Dataset, Evaluation

Interactive person re-identification in TV series

Interactive person re-identification in TV series Interactive person re-identification in TV series Mika Fischer Hazım Kemal Ekenel Rainer Stiefelhagen CV:HCI lab, Karlsruhe Institute of Technology Adenauerring 2, 76131 Karlsruhe, Germany E-mail: {mika.fischer,ekenel,rainer.stiefelhagen}@kit.edu

More information

Who are you? Learning person specific classifiers from video

Who are you? Learning person specific classifiers from video Who are you? Learning person specific classifiers from video Josef Sivic, Mark Everingham 2 and Andrew Zisserman 3 INRIA, WILLOW Project, Laboratoire d Informatique de l Ecole Normale Superieure, Paris,

More information

Image and Vision Computing

Image and Vision Computing Image and Vision Computing 27 (2009) 545 559 Contents lists available at ScienceDirect Image and Vision Computing journal homepage: www.elsevier.com/locate/imavis Taking the bite out of automated naming

More information

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform

More information

Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28

Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28 Recognition Topics that we will try to cover: Indexing for fast retrieval (we still owe this one) History of recognition techniques Object classification Bag-of-words Spatial pyramids Neural Networks Object

More information

How To Cluster Of Complex Systems

How To Cluster Of Complex Systems Entropy based Graph Clustering: Application to Biological and Social Networks Edward C Kenley Young-Rae Cho Department of Computer Science Baylor University Complex Systems Definition Dynamically evolving

More information

Movie/Script: Alignment and Parsing of Video and Text Transcription

Movie/Script: Alignment and Parsing of Video and Text Transcription Movie/Script: Alignment and Parsing of Video and Text Transcription Timothee Cour, Chris Jordan, Eleni Miltsakaki, Ben Taskar University of Pennsylvania, Philadelphia, PA 19104, USA, {timothee,wjc,elenimi,taskar}@seas.upenn.edu

More information

Local features and matching. Image classification & object localization

Local features and matching. Image classification & object localization Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to

More information

UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003. Yun Zhai, Zeeshan Rasheed, Mubarak Shah

UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003. Yun Zhai, Zeeshan Rasheed, Mubarak Shah UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003 Yun Zhai, Zeeshan Rasheed, Mubarak Shah Computer Vision Laboratory School of Computer Science University of Central Florida, Orlando, Florida ABSTRACT In this

More information

Tattoo Detection for Soft Biometric De-Identification Based on Convolutional NeuralNetworks

Tattoo Detection for Soft Biometric De-Identification Based on Convolutional NeuralNetworks 1 Tattoo Detection for Soft Biometric De-Identification Based on Convolutional NeuralNetworks Tomislav Hrkać, Karla Brkić, Zoran Kalafatić Faculty of Electrical Engineering and Computing University of

More information

Neovision2 Performance Evaluation Protocol

Neovision2 Performance Evaluation Protocol Neovision2 Performance Evaluation Protocol Version 3.0 4/16/2012 Public Release Prepared by Rajmadhan Ekambaram rajmadhan@mail.usf.edu Dmitry Goldgof, Ph.D. goldgof@cse.usf.edu Rangachar Kasturi, Ph.D.

More information

2010 School-assessed Task Report. Media

2010 School-assessed Task Report. Media 2010 School-assessed Task Report Media GENERAL COMMENTS Task summary This task involves three outcomes, two in Unit 3 and the one in Unit 4. In Unit 3, students undertake Outcome 2 Media Production Skills,

More information

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree. Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table

More information

Tracking performance evaluation on PETS 2015 Challenge datasets

Tracking performance evaluation on PETS 2015 Challenge datasets Tracking performance evaluation on PETS 2015 Challenge datasets Tahir Nawaz, Jonathan Boyle, Longzhen Li and James Ferryman Computational Vision Group, School of Systems Engineering University of Reading,

More information

Everett Public Schools Framework: Digital Video Production II

Everett Public Schools Framework: Digital Video Production II Course: CIP Code: 100202 Career Cluster: Video ProductionTechnology/Technician Everett Public Schools Framework: Digital Video Production II Arts, Audio/Video Technology & Communications Total Framework

More information

Classification of Bad Accounts in Credit Card Industry

Classification of Bad Accounts in Credit Card Industry Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition

More information

Automatic Maritime Surveillance with Visual Target Detection

Automatic Maritime Surveillance with Visual Target Detection Automatic Maritime Surveillance with Visual Target Detection Domenico Bloisi, PhD bloisi@dis.uniroma1.it Maritime Scenario Maritime environment represents a challenging scenario for automatic video surveillance

More information

Speed Performance Improvement of Vehicle Blob Tracking System

Speed Performance Improvement of Vehicle Blob Tracking System Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu, nevatia@usc.edu Abstract. A speed

More information

How does Person Identity Recognition Help Multi-Person Tracking?

How does Person Identity Recognition Help Multi-Person Tracking? How does Person Identity Recognition Help Multi-Person Tracking? Cheng-Hao Kuo and Ram Nevatia University of Southern California, Institute for Robotics and Intelligent Systems Los Angeles, CA 90089, USA

More information

Probabilistic Latent Semantic Analysis (plsa)

Probabilistic Latent Semantic Analysis (plsa) Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} References

More information

Online, Real-Time Tracking Using a Category-to-Individual Detector

Online, Real-Time Tracking Using a Category-to-Individual Detector Online, Real-Time Tracking Using a Category-to-Individual Detector David Hall and Pietro Perona California Institute of Technology dhall,perona@vision.caltech.edu Abstract. A method for online, real-time

More information

Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite

Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Philip Lenz 1 Andreas Geiger 2 Christoph Stiller 1 Raquel Urtasun 3 1 KARLSRUHE INSTITUTE OF TECHNOLOGY 2 MAX-PLANCK-INSTITUTE IS 3

More information

Lesson 3: Behind the Scenes with Production

Lesson 3: Behind the Scenes with Production Lesson 3: Behind the Scenes with Production Overview: Being in production is the second phase of the production process and involves everything that happens from the first shot to the final wrap. In this

More information

COVERAGE You re going to have to edit your film later. How do you make sure you get enough footage for that to work?

COVERAGE You re going to have to edit your film later. How do you make sure you get enough footage for that to work? MAKE IT YOUR STORY CINEMATOGRAPHY Film is a visual medium - which just means it s a story you consume through your eyeballs. So, when you ve got your story figured out, you need to think about the best

More information

Pedestrian Detection with RCNN

Pedestrian Detection with RCNN Pedestrian Detection with RCNN Matthew Chen Department of Computer Science Stanford University mcc17@stanford.edu Abstract In this paper we evaluate the effectiveness of using a Region-based Convolutional

More information

Multi-view Intelligent Vehicle Surveillance System

Multi-view Intelligent Vehicle Surveillance System Multi-view Intelligent Vehicle Surveillance System S. Denman, C. Fookes, J. Cook, C. Davoren, A. Mamic, G. Farquharson, D. Chen, B. Chen and S. Sridharan Image and Video Research Laboratory Queensland

More information

Monitoring Creatures Great and Small: Computer Vision Systems for Looking at Grizzly Bears, Fish, and Grasshoppers

Monitoring Creatures Great and Small: Computer Vision Systems for Looking at Grizzly Bears, Fish, and Grasshoppers Monitoring Creatures Great and Small: Computer Vision Systems for Looking at Grizzly Bears, Fish, and Grasshoppers Greg Mori, Maryam Moslemi, Andy Rova, Payam Sabzmeydani, Jens Wawerla Simon Fraser University

More information

CS231M Project Report - Automated Real-Time Face Tracking and Blending

CS231M Project Report - Automated Real-Time Face Tracking and Blending CS231M Project Report - Automated Real-Time Face Tracking and Blending Steven Lee, slee2010@stanford.edu June 6, 2015 1 Introduction Summary statement: The goal of this project is to create an Android

More information

COC131 Data Mining - Clustering

COC131 Data Mining - Clustering COC131 Data Mining - Clustering Martin D. Sykora m.d.sykora@lboro.ac.uk Tutorial 05, Friday 20th March 2009 1. Fire up Weka (Waikako Environment for Knowledge Analysis) software, launch the explorer window

More information

Open-Set Face Recognition-based Visitor Interface System

Open-Set Face Recognition-based Visitor Interface System Open-Set Face Recognition-based Visitor Interface System Hazım K. Ekenel, Lorant Szasz-Toth, and Rainer Stiefelhagen Computer Science Department, Universität Karlsruhe (TH) Am Fasanengarten 5, Karlsruhe

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems University of Koblenz Landau, Germany Exploiting Spatial Context in Images Using Fuzzy Constraint Reasoning Carsten Saathoff & Agenda Semantic Web: Our Context Knowledge Annotation

More information

YouTube Scale, Large Vocabulary Video Annotation

YouTube Scale, Large Vocabulary Video Annotation YouTube Scale, Large Vocabulary Video Annotation Nicholas Morsillo, Gideon Mann and Christopher Pal Abstract As video content on the web continues to expand, it is increasingly important to properly annotate

More information

Classifying Manipulation Primitives from Visual Data

Classifying Manipulation Primitives from Visual Data Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if

More information

Character Image Patterns as Big Data

Character Image Patterns as Big Data 22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,

More information

Blocks that Shout: Distinctive Parts for Scene Classification

Blocks that Shout: Distinctive Parts for Scene Classification Blocks that Shout: Distinctive Parts for Scene Classification Mayank Juneja 1 Andrea Vedaldi 2 C. V. Jawahar 1 Andrew Zisserman 2 1 Center for Visual Information Technology, International Institute of

More information

Teacher Guide. English Examining Film. Teacher Guide. Series overview. Curriculum links. Educational approach

Teacher Guide. English Examining Film. Teacher Guide. Series overview. Curriculum links. Educational approach 3. Series overview Learners enjoy watching films, but are often intimidated by having to analyse them as a genre of literature. This series aims to introduce learners to films as texts. We cover basic

More information

What more can we do with videos?

What more can we do with videos? What more can we do with videos? Occlusion and Mo6on Reasoning for Tracking & Human Pose Es6ma6on Karteek Alahari Inria Grenoble Rhone- Alpes Joint work with Anoop Cherian, Yang Hua, Julien Mairal, Cordelia

More information

Skills Inventory: Art/Media Communications. 1. Pre-employment Training/Career Development. A. Formal; e.g., certificates. Date Description Location

Skills Inventory: Art/Media Communications. 1. Pre-employment Training/Career Development. A. Formal; e.g., certificates. Date Description Location Skills Inventory: Art/Media Communications 1. Pre-employment Training/Career Development A. Formal; e.g., certificates Date Description Location Art/Design and Communication Skills Inventory: Art/Media

More information

Finding people in repeated shots of the same scene

Finding people in repeated shots of the same scene Finding people in repeated shots of the same scene Josef Sivic 1 C. Lawrence Zitnick Richard Szeliski 1 University of Oxford Microsoft Research Abstract The goal of this work is to find all occurrences

More information

ACE: After Effects CS6

ACE: After Effects CS6 Adobe Training Services Exam Guide ACE: After Effects CS6 Adobe Training Services provides this exam guide to help prepare partners, customers, and consultants who are actively seeking accreditation as

More information

Circulant temporal encoding for video retrieval and temporal alignment

Circulant temporal encoding for video retrieval and temporal alignment Circulant temporal encoding for video retrieval and temporal alignment Matthijs Douze, Jérôme Revaud, Jakob Verbeek, Hervé Jégou, Cordelia Schmid To cite this version: Matthijs Douze, Jérôme Revaud, Jakob

More information

GCSE Media Studies. Course Outlines. version 1.2

GCSE Media Studies. Course Outlines. version 1.2 GCSE Media Studies Course Outlines version 1.2 GCSE Media Studies and GCSE Media Studies Double Award Models of Delivery Introduction All GCSEs taken after summer 2013 will be linear in structure. Candidates

More information

Computer Forensics Application. ebay-uab Collaborative Research: Product Image Analysis for Authorship Identification

Computer Forensics Application. ebay-uab Collaborative Research: Product Image Analysis for Authorship Identification Computer Forensics Application ebay-uab Collaborative Research: Product Image Analysis for Authorship Identification Project Overview A new framework that provides additional clues extracted from images

More information

Human behavior analysis from videos using optical flow

Human behavior analysis from videos using optical flow L a b o r a t o i r e I n f o r m a t i q u e F o n d a m e n t a l e d e L i l l e Human behavior analysis from videos using optical flow Yassine Benabbas Directeur de thèse : Chabane Djeraba Multitel

More information

How To Use A Near Neighbor To A Detector

How To Use A Near Neighbor To A Detector Ensemble of -SVMs for Object Detection and Beyond Tomasz Malisiewicz Carnegie Mellon University Abhinav Gupta Carnegie Mellon University Alexei A. Efros Carnegie Mellon University Abstract This paper proposes

More information

Edge Boxes: Locating Object Proposals from Edges

Edge Boxes: Locating Object Proposals from Edges Edge Boxes: Locating Object Proposals from Edges C. Lawrence Zitnick and Piotr Dollár Microsoft Research Abstract. The use of object proposals is an effective recent approach for increasing the computational

More information

www.myndplay.com info@myndplay.com Creating Interactive Video Content for the MyndPlay Platform

www.myndplay.com info@myndplay.com Creating Interactive Video Content for the MyndPlay Platform Creating Interactive Video Content for the MyndPlay Platform www.myndplay.com info@myndplay.com There are many things to consider when developing mind controlled content for the MyndPlayer. Though the

More information

Employing signed TV broadcasts for automated learning of British Sign Language

Employing signed TV broadcasts for automated learning of British Sign Language Employing signed TV broadcasts for automated learning of British Sign Language Patrick Buehler 1, Mark Everingham 2, Andrew Zisserman 1 1 Department of Engineering Science, University of Oxford, UK 2 School

More information

PROGRAM OUTLINE 2015 - PAGE 1

PROGRAM OUTLINE 2015 - PAGE 1 PROGRAM OUTLINE 2015 - PAGE 1 MODULE 1 (8 WEEKS) INTRO TO 3D MODELING WITH MAYA An introduction on how to begin building and manipulating objects in 3D Space. DRAWING BASICS Here we will learn drawing

More information

Vehicle Tracking by Simultaneous Detection and Viewpoint Estimation

Vehicle Tracking by Simultaneous Detection and Viewpoint Estimation Vehicle Tracking by Simultaneous Detection and Viewpoint Estimation Ricardo Guerrero-Gómez-Olmedo, Roberto López-Sastre, Saturnino Maldonado-Bascón, and Antonio Fernández-Caballero 2 GRAM, Department of

More information

Introduction to Learning & Decision Trees

Introduction to Learning & Decision Trees Artificial Intelligence: Representation and Problem Solving 5-38 April 0, 2007 Introduction to Learning & Decision Trees Learning and Decision Trees to learning What is learning? - more than just memorizing

More information

Evaluating Sources and Strategies for Learning Video Concepts from Social Media

Evaluating Sources and Strategies for Learning Video Concepts from Social Media Evaluating Sources and Strategies for Learning Video Concepts from Social Media Svetlana Kordumova Intelligent Systems Lab Amsterdam University of Amsterdam The Netherlands Email: s.kordumova@uva.nl Xirong

More information

BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES

BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents

More information

Search and Information Retrieval

Search and Information Retrieval Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search

More information

A Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof

A Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof A Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof Institute for Computer Graphics and Vision Graz University of Technology, Austria {mauthner,bischof}@icg.tu-graz.ac.at

More information

Is that you? Metric Learning Approaches for Face Identification

Is that you? Metric Learning Approaches for Face Identification Is that you? Metric Learning Approaches for Face Identification Matthieu Guillaumin, Jakob Verbeek and Cordelia Schmid LEAR, INRIA Grenoble Laboratoire Jean Kuntzmann firstname.lastname@inria.fr Abstract

More information

Human Focused Action Localization in Video

Human Focused Action Localization in Video Human Focused Action Localization in Video Alexander Kläser 1, Marcin Marsza lek 2, Cordelia Schmid 1, and Andrew Zisserman 2 1 INRIA Grenoble, LEAR, LJK {klaser,schmid}@inrialpes.fr 2 Engineering Science,

More information

Towards Weakly-Supervised Action Localization

Towards Weakly-Supervised Action Localization Towards Weakly-Supervised Action Localization Philippe Weinzaepfel, Martin Xavier, Cordelia Schmid To cite this version: Philippe Weinzaepfel, Martin Xavier, Cordelia Schmid. Towards Weakly-Supervised

More information

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance 2012 IEEE International Conference on Multimedia and Expo Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance Rogerio Feris, Sharath Pankanti IBM T. J. Watson Research Center

More information

Jefferson Township Public Schools. Technology Curriculum. Video Production II: Television Studio. Grades 10, 11 & 12. August 2011

Jefferson Township Public Schools. Technology Curriculum. Video Production II: Television Studio. Grades 10, 11 & 12. August 2011 Jefferson Township Public Schools Technology Curriculum Video Production II: Television Studio Grades 10, 11 & 12 August 2011 Video Production II Curriculum 2011 Page 2 TABLE OF CONTENTS JEFFERSON TOWNSHIP

More information

Criteria A. Appropriateness to Mission.

Criteria A. Appropriateness to Mission. TV and Video Production Chaffey College DEVELOPMENT CRITERIA NARRATIVE Criteria A. Appropriateness to Mission. Chaffey College improves lives within the diverse communities it serves through equal access

More information

Teacher Resource Bank Unit 2 Exemplar Assignments

Teacher Resource Bank Unit 2 Exemplar Assignments Teacher Resource Bank Unit 2 Exemplar Assignments GCSE Media Studies Version 1.2 Contents Assignment 1 - Introduction to the Media Page 2-5 Assignment 2 - Cross-Media Study Page 6-11 Assignment 3 Practical

More information

VideoStory A New Mul1media Embedding for Few- Example Recogni1on and Transla1on of Events

VideoStory A New Mul1media Embedding for Few- Example Recogni1on and Transla1on of Events VideoStory A New Mul1media Embedding for Few- Example Recogni1on and Transla1on of Events Amirhossein Habibian, Thomas Mensink, Cees Snoek ISLA, University of Amsterdam Problem statement Recognize and

More information

Semantic Recognition: Object Detection and Scene Segmentation

Semantic Recognition: Object Detection and Scene Segmentation Semantic Recognition: Object Detection and Scene Segmentation Xuming He xuming.he@nicta.com.au Computer Vision Research Group NICTA Robotic Vision Summer School 2015 Acknowledgement: Slides from Fei-Fei

More information

Dublin City University at the TRECVid 2008 BBC Rushes Summarisation Task

Dublin City University at the TRECVid 2008 BBC Rushes Summarisation Task Dublin City University at the TRECVid 2008 BBC Rushes Summarisation Tas Hervé Bredin, Daragh Byrne, Hyowon Lee, Noel E. O Connor and Gareth J.F. Jones Centre for Digital Video Processing and Adaptive Information

More information

How To Test Video Quality With Real Time Monitor

How To Test Video Quality With Real Time Monitor White Paper Real Time Monitoring Explained Video Clarity, Inc. 1566 La Pradera Dr Campbell, CA 95008 www.videoclarity.com 408-379-6952 Version 1.0 A Video Clarity White Paper page 1 of 7 Real Time Monitor

More information

Big Data: Image & Video Analytics

Big Data: Image & Video Analytics Big Data: Image & Video Analytics How it could support Archiving & Indexing & Searching Dieter Haas, IBM Deutschland GmbH The Big Data Wave 60% of internet traffic is multimedia content (images and videos)

More information

Consumer video dataset with marked head trajectories

Consumer video dataset with marked head trajectories Consumer video dataset with marked head trajectories Jouni Sarvanko jouni.sarvanko@ee.oulu.fi Mika Rautiainen mika.rautiainen@ee.oulu.fi Mika Ylianttila mika.ylianttila@oulu.fi Arto Heikkinen arto.heikkinen@ee.oulu.fi

More information

Lecture 6: Classification & Localization. boris. ginzburg@intel.com

Lecture 6: Classification & Localization. boris. ginzburg@intel.com Lecture 6: Classification & Localization boris. ginzburg@intel.com 1 Agenda ILSVRC 2014 Overfeat: integrated classification, localization, and detection Classification with Localization Detection. 2 ILSVRC-2014

More information

Machine Learning What, how, why?

Machine Learning What, how, why? Machine Learning What, how, why? Rémi Emonet (@remiemonet) 2015-09-30 Web En Vert $ whoami $ whoami Software Engineer Researcher: machine learning, computer vision Teacher: web technologies, computing

More information

CS 4300 Computer Graphics. Prof. Harriet Fell Fall 2012 Lecture 33 November 26, 2012

CS 4300 Computer Graphics. Prof. Harriet Fell Fall 2012 Lecture 33 November 26, 2012 CS 4300 Computer Graphics Prof. Harriet Fell Fall 2012 Lecture 33 November 26, 2012 1 Today s Topics Animation 2 Static to Animated we have mostly created static scenes except when we applied affine transformations

More information

Research Article Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics

Research Article Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics Hindawi Publishing Corporation EURASIP Journal on Image and Video Processing Volume 2008, Article ID 246309, 10 pages doi:10.1155/2008/246309 Research Article Evaluating Multiple Object Tracking Performance:

More information

MACHINE LEARNING IN HIGH ENERGY PHYSICS

MACHINE LEARNING IN HIGH ENERGY PHYSICS MACHINE LEARNING IN HIGH ENERGY PHYSICS LECTURE #1 Alex Rogozhnikov, 2015 INTRO NOTES 4 days two lectures, two practice seminars every day this is introductory track to machine learning kaggle competition!

More information

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic

More information

Automatic parameter regulation for a tracking system with an auto-critical function

Automatic parameter regulation for a tracking system with an auto-critical function Automatic parameter regulation for a tracking system with an auto-critical function Daniela Hall INRIA Rhône-Alpes, St. Ismier, France Email: Daniela.Hall@inrialpes.fr Abstract In this article we propose

More information

Has my Algorithm Succeeded? An Evaluator for Human Pose Estimators

Has my Algorithm Succeeded? An Evaluator for Human Pose Estimators Has my Algorithm Succeeded? An Evaluator for Human Pose Estimators Nataraj Jammalamadaka, Andrew Zisserman, Marcin Eichner, Vittorio Ferrari, and C. V. Jawahar IIIT-Hyderabad, University of Oxford, ETH

More information

Scalable Mining of Large Video Databases Using Copy Detection

Scalable Mining of Large Video Databases Using Copy Detection Scalable Mining of Large Video Databases Using Copy Detection Sébastien Poullot Institut National de l Audiovisuel and CEDRIC - CNAM, France spoullot@ina.fr Michel Crucianu CEDRIC - CNAM 292 rue St Martin

More information

University of Central Florida at TRECVID 2004

University of Central Florida at TRECVID 2004 University of Central Florida at TRECVID 2004 Yun Zhai, Xiaochun Cao, Yunjun Zhang, Omar Javed, Alper Yilmaz Fahd Rafi, Saad Ali, Orkun Alatas, Saad Khan, and Mubarak Shah Computer Vision Laboratory University

More information

The REPERE Corpus : a multimodal corpus for person recognition

The REPERE Corpus : a multimodal corpus for person recognition The REPERE Corpus : a multimodal corpus for person recognition Aude Giraudel 1, Matthieu Carré 2, Valérie Mapelli 2 Juliette Kahn 3, Olivier Galibert 3, Ludovic Quintard 3 1 Direction générale de l armement,

More information

Forschungskolleg Data Analytics Methods and Techniques

Forschungskolleg Data Analytics Methods and Techniques Forschungskolleg Data Analytics Methods and Techniques Martin Hahmann, Gunnar Schröder, Phillip Grosse Prof. Dr.-Ing. Wolfgang Lehner Why do we need it? We are drowning in data, but starving for knowledge!

More information

Support Vector Machine (SVM)

Support Vector Machine (SVM) Support Vector Machine (SVM) CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Margin concept Hard-Margin SVM Soft-Margin SVM Dual Problems of Hard-Margin

More information

The Delicate Art of Flower Classification

The Delicate Art of Flower Classification The Delicate Art of Flower Classification Paul Vicol Simon Fraser University University Burnaby, BC pvicol@sfu.ca Note: The following is my contribution to a group project for a graduate machine learning

More information

Removing Moving Objects from Point Cloud Scenes

Removing Moving Objects from Point Cloud Scenes 1 Removing Moving Objects from Point Cloud Scenes Krystof Litomisky klitomis@cs.ucr.edu Abstract. Three-dimensional simultaneous localization and mapping is a topic of significant interest in the research

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems University of Koblenz Landau, Germany Semantic Multimedia Management - Multimedia Annotation Tools http://isweb.uni-koblenz.de Multimedia Annotation Different levels of annotations

More information

T O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN

T O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN T O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN Goal is to process 360 degree images and detect two object categories 1. Pedestrians,

More information

Conclusions and Future Directions

Conclusions and Future Directions Chapter 9 This chapter summarizes the thesis with discussion of (a) the findings and the contributions to the state-of-the-art in the disciplines covered by this work, and (b) future work, those directions

More information

Introduzione alle Biblioteche Digitali Audio/Video

Introduzione alle Biblioteche Digitali Audio/Video Introduzione alle Biblioteche Digitali Audio/Video Biblioteche Digitali 1 Gestione del video Perchè è importante poter gestire biblioteche digitali di audiovisivi Caratteristiche specifiche dell audio/video

More information

Feature Tracking and Optical Flow

Feature Tracking and Optical Flow 02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve

More information

VOL. 1, NO.8, November 2011 ISSN 2222-9833 ARPN Journal of Systems and Software 2009-2011 AJSS Journal. All rights reserved

VOL. 1, NO.8, November 2011 ISSN 2222-9833 ARPN Journal of Systems and Software 2009-2011 AJSS Journal. All rights reserved Image Processing - A Gizmo for Video Content Extraction 1 G.S.Sarma, 2 V.Shiva Kumar, 3 SK.Nizmi 1 Asst Professor, Electronics and computers engg, KLCE,vijayawada,india 2 B.tech 4 th year, Electronics

More information

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS

More information

Using Email templates and Quick Parts to deal with common queries

Using Email templates and Quick Parts to deal with common queries Using Email templates and Quick Parts to deal with common queries In this document you will find instructions to help you 1) create and use email templates; and 2) embed previously stored answers which

More information

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012 Clustering Big Data Anil K. Jain (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012 Outline Big Data How to extract information? Data clustering

More information

The Visual Internet of Things System Based on Depth Camera

The Visual Internet of Things System Based on Depth Camera The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed

More information

Privacy Preserving Automatic Fall Detection for Elderly Using RGBD Cameras

Privacy Preserving Automatic Fall Detection for Elderly Using RGBD Cameras Privacy Preserving Automatic Fall Detection for Elderly Using RGBD Cameras Chenyang Zhang 1, Yingli Tian 1, and Elizabeth Capezuti 2 1 Media Lab, The City University of New York (CUNY), City College New

More information

IBM SPSS Direct Marketing 23

IBM SPSS Direct Marketing 23 IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release

More information

Online Play Segmentation for Broadcasted American Football TV Programs

Online Play Segmentation for Broadcasted American Football TV Programs Online Play Segmentation for Broadcasted American Football TV Programs Liexian Gu 1, Xiaoqing Ding 1, and Xian-Sheng Hua 2 1 Department of Electronic Engineering, Tsinghua University, Beijing, China {lxgu,

More information

Distributed Smart Cameras for Aging in Place*

Distributed Smart Cameras for Aging in Place* Distributed Smart Cameras for Aging in Place* Adam Williams, Dan Xie, Shichao Ou, Roderic Grupen, Allen Hanson, and Edward Riseman Department of Computer Science University of Massachusetts Amherst, MA

More information

Online Learning for Fast Segmentation of Moving Objects

Online Learning for Fast Segmentation of Moving Objects Online Learning for Fast Segmentation of Moving Objects Liam Ellis, Vasileios Zografos {liam.ellis,vasileios.zografos}@liu.se CVL, Linköping University, Linköping, Sweden Abstract. This work addresses

More information

Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features

Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features Semantic Video Annotation by Mining Association Patterns from and Speech Features Vincent. S. Tseng, Ja-Hwung Su, Jhih-Hong Huang and Chih-Jen Chen Department of Computer Science and Information Engineering

More information