Total Cluster: A person agnostic clustering method for broadcast videos
|
|
- Darcy Allen
- 7 years ago
- Views:
Transcription
1 Total Cluster: A person agnostic clustering method for broadcast videos Makarand Tapaswi, Omkar M. Parkhi, Esa Rahtu, Eric Sommerlade, Rainer Stiefelhagen, Andrew Zisserman
2 Face track clustering Input Video with face tracks Output Face track clusters 2
3 Why cluster face tracks? Large number of videos Good basis for manual or automatic labelling Facilitates video retrieval or summarization 3
4 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3
5 Related work Person Identification in TV series (Everingham et al., 2006) (Cour et al., 2009) (Bojanowski et al., 2013) (Ramanathan et al., 2014) Face Clustering (Ramanan et al., 2007) (Cinbis et al., 2011) (Khoury et al., 2013) (Wu et al., 2013, 2013b) Novelty of the current work Error free clustering Utilize video-editing structure Use state-of-the-art face track descriptor 5
6 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3
7 Video editing overview Episode Scenes Threads Shots 7
8 Shots and Threads shot thread (n.) (n.) A sequence set of shots of taken uninterrupted from the frames same camera for a short with the period same of viewing time angle and content A B A C D C Scrubs S01 E01 D 8
9 Scenes scene (n.) A sequence of shots in a video with the same characters, at the same location and time Scene 4 Scene 5 Scene 6 Scene boundary detection (Tapaswi et al. 2014) Maximize within-scene visual similarity Don t break shot threads 9
10 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Face tracking Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3
11 Face tracks Face tracking Face track descriptor (Parkhi et al. 2014) frontal+profile VJ detector multi-pose head detector KLT tracking false positive removal dense SIFT Fisher encoding discriminative dimensionality reduction 11
12 Negative constraints do NOT merge these pairs! Tracks in the same frame (Cinbis et al. 2011, Wu et al. 2013) Tracks in a thread A B A C B 12
13 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3
14 Within scene clustering for tracks not in negative pairs Strong face similarity T1 T2 In a thread, area overlap relaxed face similarity 128D T4 T3 T1 T2 Dr. Kelso Turk 14
15 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3
16 Cluster descriptor compare frontal vs. frontal, profile vs. profile Frontal Frontal descriptor Mixed Both descriptors Profile Profile descriptor 16
17 Full episode clustering Exemplar SVMs Cluster across scenes one positive cluster against negative clusters + YouTube Faces score clusters with e-svm merge high-scoring 17
18 Recap Video editing structure A B A B Do not merge track pairs T1 T2 Scene level clustering Turk Episode level clustering 18
19 Overview Editing structure in videos Shots, threads and scenes Using editing structure for clustering Negative pairs Scene level clustering Episode level clustering Dataset and evaluation Scrubs, Buffy Weighted clustering purity Clustering results T1 T4 T2 T3
20 Data set episodes 1..5, 23 ~20 minutes each sitcom, interns at a hospital episodes 1..6 ~40 minutes each supernatural drama, action average episode SCRUBS BUFFY # named characters # tracks # tracks in thread # do not merge pairs in shots in threads
21 Evaluation metrics Goal: minimize #clusters, don t make errors! WCP: weighted clustering purity Cluster 1: Purity p 1 = 3/3 Cluster 2: Purity p 2 = 4/5 WCP: 1 N c n c p c = Trade-off WCP vs. #clusters Cluster 1 Purity: 1 Cluster 2 Purity: Purity # clusters 21
22 #Clusters Within scene clustering #Clusters SCRUBS Tracks Best HAC Ours BUFFY SCR-1 SCR-2 SCR-3 SCR-4 SCR BF-1 BF-2 BF-3 BF-5 BF-6 Lower number of purity = 1 Prevent errors 22
23 #Clusters Full episode clustering #Clusters SCRUBS Tracks Best HAC Ours BUFFY 100 SCR-1 SCR-2 SCR-3 SCR-4 SCR BF-1 BF-2 BF-3 BF-5 BF-6 Full episode clustering hard to choose threshold for HAC proposed method reduces purity 1 23
24 Summary 1 Purity Minimize #clusters at very high purity Affords manual and automatic annotation Leverage video editing structure Face tracks are not independent data points 4x number of do not merge track pairs A B # clusters A Stage-wise clustering Small number of characters in one scene At purity 1, #clusters is about half #tracks 24
25 Thank you! Questions? Intro, Related work, Video-editing, Negative pairs Scene clustering, Episode clustering, Dataset, Evaluation
Interactive person re-identification in TV series
Interactive person re-identification in TV series Mika Fischer Hazım Kemal Ekenel Rainer Stiefelhagen CV:HCI lab, Karlsruhe Institute of Technology Adenauerring 2, 76131 Karlsruhe, Germany E-mail: {mika.fischer,ekenel,rainer.stiefelhagen}@kit.edu
More informationWho are you? Learning person specific classifiers from video
Who are you? Learning person specific classifiers from video Josef Sivic, Mark Everingham 2 and Andrew Zisserman 3 INRIA, WILLOW Project, Laboratoire d Informatique de l Ecole Normale Superieure, Paris,
More informationImage and Vision Computing
Image and Vision Computing 27 (2009) 545 559 Contents lists available at ScienceDirect Image and Vision Computing journal homepage: www.elsevier.com/locate/imavis Taking the bite out of automated naming
More informationRecognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang
Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform
More informationRecognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28
Recognition Topics that we will try to cover: Indexing for fast retrieval (we still owe this one) History of recognition techniques Object classification Bag-of-words Spatial pyramids Neural Networks Object
More informationHow To Cluster Of Complex Systems
Entropy based Graph Clustering: Application to Biological and Social Networks Edward C Kenley Young-Rae Cho Department of Computer Science Baylor University Complex Systems Definition Dynamically evolving
More informationMovie/Script: Alignment and Parsing of Video and Text Transcription
Movie/Script: Alignment and Parsing of Video and Text Transcription Timothee Cour, Chris Jordan, Eleni Miltsakaki, Ben Taskar University of Pennsylvania, Philadelphia, PA 19104, USA, {timothee,wjc,elenimi,taskar}@seas.upenn.edu
More informationLocal features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
More informationUNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003. Yun Zhai, Zeeshan Rasheed, Mubarak Shah
UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003 Yun Zhai, Zeeshan Rasheed, Mubarak Shah Computer Vision Laboratory School of Computer Science University of Central Florida, Orlando, Florida ABSTRACT In this
More informationTattoo Detection for Soft Biometric De-Identification Based on Convolutional NeuralNetworks
1 Tattoo Detection for Soft Biometric De-Identification Based on Convolutional NeuralNetworks Tomislav Hrkać, Karla Brkić, Zoran Kalafatić Faculty of Electrical Engineering and Computing University of
More informationNeovision2 Performance Evaluation Protocol
Neovision2 Performance Evaluation Protocol Version 3.0 4/16/2012 Public Release Prepared by Rajmadhan Ekambaram rajmadhan@mail.usf.edu Dmitry Goldgof, Ph.D. goldgof@cse.usf.edu Rangachar Kasturi, Ph.D.
More information2010 School-assessed Task Report. Media
2010 School-assessed Task Report Media GENERAL COMMENTS Task summary This task involves three outcomes, two in Unit 3 and the one in Unit 4. In Unit 3, students undertake Outcome 2 Media Production Skills,
More informationCees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.
Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table
More informationTracking performance evaluation on PETS 2015 Challenge datasets
Tracking performance evaluation on PETS 2015 Challenge datasets Tahir Nawaz, Jonathan Boyle, Longzhen Li and James Ferryman Computational Vision Group, School of Systems Engineering University of Reading,
More informationEverett Public Schools Framework: Digital Video Production II
Course: CIP Code: 100202 Career Cluster: Video ProductionTechnology/Technician Everett Public Schools Framework: Digital Video Production II Arts, Audio/Video Technology & Communications Total Framework
More informationClassification of Bad Accounts in Credit Card Industry
Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition
More informationAutomatic Maritime Surveillance with Visual Target Detection
Automatic Maritime Surveillance with Visual Target Detection Domenico Bloisi, PhD bloisi@dis.uniroma1.it Maritime Scenario Maritime environment represents a challenging scenario for automatic video surveillance
More informationSpeed Performance Improvement of Vehicle Blob Tracking System
Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu, nevatia@usc.edu Abstract. A speed
More informationHow does Person Identity Recognition Help Multi-Person Tracking?
How does Person Identity Recognition Help Multi-Person Tracking? Cheng-Hao Kuo and Ram Nevatia University of Southern California, Institute for Robotics and Intelligent Systems Los Angeles, CA 90089, USA
More informationProbabilistic Latent Semantic Analysis (plsa)
Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} References
More informationOnline, Real-Time Tracking Using a Category-to-Individual Detector
Online, Real-Time Tracking Using a Category-to-Individual Detector David Hall and Pietro Perona California Institute of Technology dhall,perona@vision.caltech.edu Abstract. A method for online, real-time
More informationAre we ready for Autonomous Driving? The KITTI Vision Benchmark Suite
Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Philip Lenz 1 Andreas Geiger 2 Christoph Stiller 1 Raquel Urtasun 3 1 KARLSRUHE INSTITUTE OF TECHNOLOGY 2 MAX-PLANCK-INSTITUTE IS 3
More informationLesson 3: Behind the Scenes with Production
Lesson 3: Behind the Scenes with Production Overview: Being in production is the second phase of the production process and involves everything that happens from the first shot to the final wrap. In this
More informationCOVERAGE You re going to have to edit your film later. How do you make sure you get enough footage for that to work?
MAKE IT YOUR STORY CINEMATOGRAPHY Film is a visual medium - which just means it s a story you consume through your eyeballs. So, when you ve got your story figured out, you need to think about the best
More informationPedestrian Detection with RCNN
Pedestrian Detection with RCNN Matthew Chen Department of Computer Science Stanford University mcc17@stanford.edu Abstract In this paper we evaluate the effectiveness of using a Region-based Convolutional
More informationMulti-view Intelligent Vehicle Surveillance System
Multi-view Intelligent Vehicle Surveillance System S. Denman, C. Fookes, J. Cook, C. Davoren, A. Mamic, G. Farquharson, D. Chen, B. Chen and S. Sridharan Image and Video Research Laboratory Queensland
More informationMonitoring Creatures Great and Small: Computer Vision Systems for Looking at Grizzly Bears, Fish, and Grasshoppers
Monitoring Creatures Great and Small: Computer Vision Systems for Looking at Grizzly Bears, Fish, and Grasshoppers Greg Mori, Maryam Moslemi, Andy Rova, Payam Sabzmeydani, Jens Wawerla Simon Fraser University
More informationCS231M Project Report - Automated Real-Time Face Tracking and Blending
CS231M Project Report - Automated Real-Time Face Tracking and Blending Steven Lee, slee2010@stanford.edu June 6, 2015 1 Introduction Summary statement: The goal of this project is to create an Android
More informationCOC131 Data Mining - Clustering
COC131 Data Mining - Clustering Martin D. Sykora m.d.sykora@lboro.ac.uk Tutorial 05, Friday 20th March 2009 1. Fire up Weka (Waikako Environment for Knowledge Analysis) software, launch the explorer window
More informationOpen-Set Face Recognition-based Visitor Interface System
Open-Set Face Recognition-based Visitor Interface System Hazım K. Ekenel, Lorant Szasz-Toth, and Rainer Stiefelhagen Computer Science Department, Universität Karlsruhe (TH) Am Fasanengarten 5, Karlsruhe
More information<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany
Information Systems University of Koblenz Landau, Germany Exploiting Spatial Context in Images Using Fuzzy Constraint Reasoning Carsten Saathoff & Agenda Semantic Web: Our Context Knowledge Annotation
More informationYouTube Scale, Large Vocabulary Video Annotation
YouTube Scale, Large Vocabulary Video Annotation Nicholas Morsillo, Gideon Mann and Christopher Pal Abstract As video content on the web continues to expand, it is increasingly important to properly annotate
More informationClassifying Manipulation Primitives from Visual Data
Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if
More informationCharacter Image Patterns as Big Data
22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,
More informationBlocks that Shout: Distinctive Parts for Scene Classification
Blocks that Shout: Distinctive Parts for Scene Classification Mayank Juneja 1 Andrea Vedaldi 2 C. V. Jawahar 1 Andrew Zisserman 2 1 Center for Visual Information Technology, International Institute of
More informationTeacher Guide. English Examining Film. Teacher Guide. Series overview. Curriculum links. Educational approach
3. Series overview Learners enjoy watching films, but are often intimidated by having to analyse them as a genre of literature. This series aims to introduce learners to films as texts. We cover basic
More informationWhat more can we do with videos?
What more can we do with videos? Occlusion and Mo6on Reasoning for Tracking & Human Pose Es6ma6on Karteek Alahari Inria Grenoble Rhone- Alpes Joint work with Anoop Cherian, Yang Hua, Julien Mairal, Cordelia
More informationSkills Inventory: Art/Media Communications. 1. Pre-employment Training/Career Development. A. Formal; e.g., certificates. Date Description Location
Skills Inventory: Art/Media Communications 1. Pre-employment Training/Career Development A. Formal; e.g., certificates Date Description Location Art/Design and Communication Skills Inventory: Art/Media
More informationFinding people in repeated shots of the same scene
Finding people in repeated shots of the same scene Josef Sivic 1 C. Lawrence Zitnick Richard Szeliski 1 University of Oxford Microsoft Research Abstract The goal of this work is to find all occurrences
More informationACE: After Effects CS6
Adobe Training Services Exam Guide ACE: After Effects CS6 Adobe Training Services provides this exam guide to help prepare partners, customers, and consultants who are actively seeking accreditation as
More informationCirculant temporal encoding for video retrieval and temporal alignment
Circulant temporal encoding for video retrieval and temporal alignment Matthijs Douze, Jérôme Revaud, Jakob Verbeek, Hervé Jégou, Cordelia Schmid To cite this version: Matthijs Douze, Jérôme Revaud, Jakob
More informationGCSE Media Studies. Course Outlines. version 1.2
GCSE Media Studies Course Outlines version 1.2 GCSE Media Studies and GCSE Media Studies Double Award Models of Delivery Introduction All GCSEs taken after summer 2013 will be linear in structure. Candidates
More informationComputer Forensics Application. ebay-uab Collaborative Research: Product Image Analysis for Authorship Identification
Computer Forensics Application ebay-uab Collaborative Research: Product Image Analysis for Authorship Identification Project Overview A new framework that provides additional clues extracted from images
More informationHuman behavior analysis from videos using optical flow
L a b o r a t o i r e I n f o r m a t i q u e F o n d a m e n t a l e d e L i l l e Human behavior analysis from videos using optical flow Yassine Benabbas Directeur de thèse : Chabane Djeraba Multitel
More informationHow To Use A Near Neighbor To A Detector
Ensemble of -SVMs for Object Detection and Beyond Tomasz Malisiewicz Carnegie Mellon University Abhinav Gupta Carnegie Mellon University Alexei A. Efros Carnegie Mellon University Abstract This paper proposes
More informationEdge Boxes: Locating Object Proposals from Edges
Edge Boxes: Locating Object Proposals from Edges C. Lawrence Zitnick and Piotr Dollár Microsoft Research Abstract. The use of object proposals is an effective recent approach for increasing the computational
More informationwww.myndplay.com info@myndplay.com Creating Interactive Video Content for the MyndPlay Platform
Creating Interactive Video Content for the MyndPlay Platform www.myndplay.com info@myndplay.com There are many things to consider when developing mind controlled content for the MyndPlayer. Though the
More informationEmploying signed TV broadcasts for automated learning of British Sign Language
Employing signed TV broadcasts for automated learning of British Sign Language Patrick Buehler 1, Mark Everingham 2, Andrew Zisserman 1 1 Department of Engineering Science, University of Oxford, UK 2 School
More informationPROGRAM OUTLINE 2015 - PAGE 1
PROGRAM OUTLINE 2015 - PAGE 1 MODULE 1 (8 WEEKS) INTRO TO 3D MODELING WITH MAYA An introduction on how to begin building and manipulating objects in 3D Space. DRAWING BASICS Here we will learn drawing
More informationVehicle Tracking by Simultaneous Detection and Viewpoint Estimation
Vehicle Tracking by Simultaneous Detection and Viewpoint Estimation Ricardo Guerrero-Gómez-Olmedo, Roberto López-Sastre, Saturnino Maldonado-Bascón, and Antonio Fernández-Caballero 2 GRAM, Department of
More informationIntroduction to Learning & Decision Trees
Artificial Intelligence: Representation and Problem Solving 5-38 April 0, 2007 Introduction to Learning & Decision Trees Learning and Decision Trees to learning What is learning? - more than just memorizing
More informationEvaluating Sources and Strategies for Learning Video Concepts from Social Media
Evaluating Sources and Strategies for Learning Video Concepts from Social Media Svetlana Kordumova Intelligent Systems Lab Amsterdam University of Amsterdam The Netherlands Email: s.kordumova@uva.nl Xirong
More informationBEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents
More informationSearch and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
More informationA Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof
A Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof Institute for Computer Graphics and Vision Graz University of Technology, Austria {mauthner,bischof}@icg.tu-graz.ac.at
More informationIs that you? Metric Learning Approaches for Face Identification
Is that you? Metric Learning Approaches for Face Identification Matthieu Guillaumin, Jakob Verbeek and Cordelia Schmid LEAR, INRIA Grenoble Laboratoire Jean Kuntzmann firstname.lastname@inria.fr Abstract
More informationHuman Focused Action Localization in Video
Human Focused Action Localization in Video Alexander Kläser 1, Marcin Marsza lek 2, Cordelia Schmid 1, and Andrew Zisserman 2 1 INRIA Grenoble, LEAR, LJK {klaser,schmid}@inrialpes.fr 2 Engineering Science,
More informationTowards Weakly-Supervised Action Localization
Towards Weakly-Supervised Action Localization Philippe Weinzaepfel, Martin Xavier, Cordelia Schmid To cite this version: Philippe Weinzaepfel, Martin Xavier, Cordelia Schmid. Towards Weakly-Supervised
More informationLearning Detectors from Large Datasets for Object Retrieval in Video Surveillance
2012 IEEE International Conference on Multimedia and Expo Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance Rogerio Feris, Sharath Pankanti IBM T. J. Watson Research Center
More informationJefferson Township Public Schools. Technology Curriculum. Video Production II: Television Studio. Grades 10, 11 & 12. August 2011
Jefferson Township Public Schools Technology Curriculum Video Production II: Television Studio Grades 10, 11 & 12 August 2011 Video Production II Curriculum 2011 Page 2 TABLE OF CONTENTS JEFFERSON TOWNSHIP
More informationCriteria A. Appropriateness to Mission.
TV and Video Production Chaffey College DEVELOPMENT CRITERIA NARRATIVE Criteria A. Appropriateness to Mission. Chaffey College improves lives within the diverse communities it serves through equal access
More informationTeacher Resource Bank Unit 2 Exemplar Assignments
Teacher Resource Bank Unit 2 Exemplar Assignments GCSE Media Studies Version 1.2 Contents Assignment 1 - Introduction to the Media Page 2-5 Assignment 2 - Cross-Media Study Page 6-11 Assignment 3 Practical
More informationVideoStory A New Mul1media Embedding for Few- Example Recogni1on and Transla1on of Events
VideoStory A New Mul1media Embedding for Few- Example Recogni1on and Transla1on of Events Amirhossein Habibian, Thomas Mensink, Cees Snoek ISLA, University of Amsterdam Problem statement Recognize and
More informationSemantic Recognition: Object Detection and Scene Segmentation
Semantic Recognition: Object Detection and Scene Segmentation Xuming He xuming.he@nicta.com.au Computer Vision Research Group NICTA Robotic Vision Summer School 2015 Acknowledgement: Slides from Fei-Fei
More informationDublin City University at the TRECVid 2008 BBC Rushes Summarisation Task
Dublin City University at the TRECVid 2008 BBC Rushes Summarisation Tas Hervé Bredin, Daragh Byrne, Hyowon Lee, Noel E. O Connor and Gareth J.F. Jones Centre for Digital Video Processing and Adaptive Information
More informationHow To Test Video Quality With Real Time Monitor
White Paper Real Time Monitoring Explained Video Clarity, Inc. 1566 La Pradera Dr Campbell, CA 95008 www.videoclarity.com 408-379-6952 Version 1.0 A Video Clarity White Paper page 1 of 7 Real Time Monitor
More informationBig Data: Image & Video Analytics
Big Data: Image & Video Analytics How it could support Archiving & Indexing & Searching Dieter Haas, IBM Deutschland GmbH The Big Data Wave 60% of internet traffic is multimedia content (images and videos)
More informationConsumer video dataset with marked head trajectories
Consumer video dataset with marked head trajectories Jouni Sarvanko jouni.sarvanko@ee.oulu.fi Mika Rautiainen mika.rautiainen@ee.oulu.fi Mika Ylianttila mika.ylianttila@oulu.fi Arto Heikkinen arto.heikkinen@ee.oulu.fi
More informationLecture 6: Classification & Localization. boris. ginzburg@intel.com
Lecture 6: Classification & Localization boris. ginzburg@intel.com 1 Agenda ILSVRC 2014 Overfeat: integrated classification, localization, and detection Classification with Localization Detection. 2 ILSVRC-2014
More informationMachine Learning What, how, why?
Machine Learning What, how, why? Rémi Emonet (@remiemonet) 2015-09-30 Web En Vert $ whoami $ whoami Software Engineer Researcher: machine learning, computer vision Teacher: web technologies, computing
More informationCS 4300 Computer Graphics. Prof. Harriet Fell Fall 2012 Lecture 33 November 26, 2012
CS 4300 Computer Graphics Prof. Harriet Fell Fall 2012 Lecture 33 November 26, 2012 1 Today s Topics Animation 2 Static to Animated we have mostly created static scenes except when we applied affine transformations
More informationResearch Article Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics
Hindawi Publishing Corporation EURASIP Journal on Image and Video Processing Volume 2008, Article ID 246309, 10 pages doi:10.1155/2008/246309 Research Article Evaluating Multiple Object Tracking Performance:
More informationMACHINE LEARNING IN HIGH ENERGY PHYSICS
MACHINE LEARNING IN HIGH ENERGY PHYSICS LECTURE #1 Alex Rogozhnikov, 2015 INTRO NOTES 4 days two lectures, two practice seminars every day this is introductory track to machine learning kaggle competition!
More informationModelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
More informationAutomatic parameter regulation for a tracking system with an auto-critical function
Automatic parameter regulation for a tracking system with an auto-critical function Daniela Hall INRIA Rhône-Alpes, St. Ismier, France Email: Daniela.Hall@inrialpes.fr Abstract In this article we propose
More informationHas my Algorithm Succeeded? An Evaluator for Human Pose Estimators
Has my Algorithm Succeeded? An Evaluator for Human Pose Estimators Nataraj Jammalamadaka, Andrew Zisserman, Marcin Eichner, Vittorio Ferrari, and C. V. Jawahar IIIT-Hyderabad, University of Oxford, ETH
More informationScalable Mining of Large Video Databases Using Copy Detection
Scalable Mining of Large Video Databases Using Copy Detection Sébastien Poullot Institut National de l Audiovisuel and CEDRIC - CNAM, France spoullot@ina.fr Michel Crucianu CEDRIC - CNAM 292 rue St Martin
More informationUniversity of Central Florida at TRECVID 2004
University of Central Florida at TRECVID 2004 Yun Zhai, Xiaochun Cao, Yunjun Zhang, Omar Javed, Alper Yilmaz Fahd Rafi, Saad Ali, Orkun Alatas, Saad Khan, and Mubarak Shah Computer Vision Laboratory University
More informationThe REPERE Corpus : a multimodal corpus for person recognition
The REPERE Corpus : a multimodal corpus for person recognition Aude Giraudel 1, Matthieu Carré 2, Valérie Mapelli 2 Juliette Kahn 3, Olivier Galibert 3, Ludovic Quintard 3 1 Direction générale de l armement,
More informationForschungskolleg Data Analytics Methods and Techniques
Forschungskolleg Data Analytics Methods and Techniques Martin Hahmann, Gunnar Schröder, Phillip Grosse Prof. Dr.-Ing. Wolfgang Lehner Why do we need it? We are drowning in data, but starving for knowledge!
More informationSupport Vector Machine (SVM)
Support Vector Machine (SVM) CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Margin concept Hard-Margin SVM Soft-Margin SVM Dual Problems of Hard-Margin
More informationThe Delicate Art of Flower Classification
The Delicate Art of Flower Classification Paul Vicol Simon Fraser University University Burnaby, BC pvicol@sfu.ca Note: The following is my contribution to a group project for a graduate machine learning
More informationRemoving Moving Objects from Point Cloud Scenes
1 Removing Moving Objects from Point Cloud Scenes Krystof Litomisky klitomis@cs.ucr.edu Abstract. Three-dimensional simultaneous localization and mapping is a topic of significant interest in the research
More information<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany
Information Systems University of Koblenz Landau, Germany Semantic Multimedia Management - Multimedia Annotation Tools http://isweb.uni-koblenz.de Multimedia Annotation Different levels of annotations
More informationT O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN
T O B C A T C A S E G E O V I S A T DETECTIE E N B L U R R I N G V A N P E R S O N E N IN P A N O R A MISCHE BEELDEN Goal is to process 360 degree images and detect two object categories 1. Pedestrians,
More informationConclusions and Future Directions
Chapter 9 This chapter summarizes the thesis with discussion of (a) the findings and the contributions to the state-of-the-art in the disciplines covered by this work, and (b) future work, those directions
More informationIntroduzione alle Biblioteche Digitali Audio/Video
Introduzione alle Biblioteche Digitali Audio/Video Biblioteche Digitali 1 Gestione del video Perchè è importante poter gestire biblioteche digitali di audiovisivi Caratteristiche specifiche dell audio/video
More informationFeature Tracking and Optical Flow
02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve
More informationVOL. 1, NO.8, November 2011 ISSN 2222-9833 ARPN Journal of Systems and Software 2009-2011 AJSS Journal. All rights reserved
Image Processing - A Gizmo for Video Content Extraction 1 G.S.Sarma, 2 V.Shiva Kumar, 3 SK.Nizmi 1 Asst Professor, Electronics and computers engg, KLCE,vijayawada,india 2 B.tech 4 th year, Electronics
More informationMIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS
More informationUsing Email templates and Quick Parts to deal with common queries
Using Email templates and Quick Parts to deal with common queries In this document you will find instructions to help you 1) create and use email templates; and 2) embed previously stored answers which
More informationClustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012
Clustering Big Data Anil K. Jain (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012 Outline Big Data How to extract information? Data clustering
More informationThe Visual Internet of Things System Based on Depth Camera
The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed
More informationPrivacy Preserving Automatic Fall Detection for Elderly Using RGBD Cameras
Privacy Preserving Automatic Fall Detection for Elderly Using RGBD Cameras Chenyang Zhang 1, Yingli Tian 1, and Elizabeth Capezuti 2 1 Media Lab, The City University of New York (CUNY), City College New
More informationIBM SPSS Direct Marketing 23
IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release
More informationOnline Play Segmentation for Broadcasted American Football TV Programs
Online Play Segmentation for Broadcasted American Football TV Programs Liexian Gu 1, Xiaoqing Ding 1, and Xian-Sheng Hua 2 1 Department of Electronic Engineering, Tsinghua University, Beijing, China {lxgu,
More informationDistributed Smart Cameras for Aging in Place*
Distributed Smart Cameras for Aging in Place* Adam Williams, Dan Xie, Shichao Ou, Roderic Grupen, Allen Hanson, and Edward Riseman Department of Computer Science University of Massachusetts Amherst, MA
More informationOnline Learning for Fast Segmentation of Moving Objects
Online Learning for Fast Segmentation of Moving Objects Liam Ellis, Vasileios Zografos {liam.ellis,vasileios.zografos}@liu.se CVL, Linköping University, Linköping, Sweden Abstract. This work addresses
More informationSemantic Video Annotation by Mining Association Patterns from Visual and Speech Features
Semantic Video Annotation by Mining Association Patterns from and Speech Features Vincent. S. Tseng, Ja-Hwung Su, Jhih-Hong Huang and Chih-Jen Chen Department of Computer Science and Information Engineering
More information