Youtube. Mining Specific Actions from Youtube Video with Spatio-Temporal Features

Size: px
Start display at page:

Download "Youtube. Mining Specific Actions from Youtube Video with Spatio-Temporal Features"

Transcription

1 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. Youtube DOHANG NGA {dohang,yanai}@mm.cs.uec.ac.jp Web Web Youtube Web2.0 bag-of-features VisualRank VisualRank Web %, % Web VisualRank Web2.0 Mining Specific Actions from Youtube Video with Spatio-Temporal Features DO HANG NGA and Keiji YANAI The University of Electro-Communication, Choufugaoka 1 5 1, Choufu, Tokyo, Japan {dohang,yanai}@mm.cs.uec.ac.jp Abstract In this paper, we present a new method of automatically extracting from tagged Web videos the video-shots correspond to specific actions with just inputing the action keywords such as walking, eating etc. Key words Spatio-Temporal Feature, Web video, unsupervised learning Web Web YouTube Web Web Web eating eating eating eating eating Web Web 15 [1] Web 1

2 1.2 6 Web2.0 VisualRank 1 Web running marathon running marathon running marathon MIL Satkin [8] multi-class SVM Cinbis [9] Web YouTube Niebles [2] Dollar [3] plsa KTH [4] Web Web [5] 91% Liu [6] YouTube 11 Wild YouTube [3] SIFT Adaboost PageRank Liu PageRank Liu Cinbis [7] Multiple Instance Learning (MIL) 2 Web Web2.0 [10] Web2.0 VisualRank Web Web2.0 Web2.0 VisualRank 2

3 Web2.0 Web2.0 [10] 0() 1( ) Web API 1000 ID Web2.0 Web2.0 (eating running ) 4.2 Youtube API 1000 ID Web RGB Noguchi [5] ( [11] ) step1 : step2 : step2-1 : SURF step2-2 : step2-3 : Delaunay step3 : step3-1 : Lucas-Kanade step3-2 : SURF diminant rotation step4 : Noguchi SURF SURF Lucas-Kanade Delaunay Lucas-Kanade Lucas-Kanade bag-of-features(bof) BoF 1 BoF 4.1 BoF step1 : step2 : codebook step3 : codebook a ) (step 1) = b ) codebook (step 2) codebook k-means visual words codebook c ) (step 3) codebook visual words visual words codebook 4.6 MKL cross-validation 3

4 [5] 2:1:1 4.7 VisualRank Visual- Rank [12] VisualRank 2 (1) 3 Youtube s(h 1, H 2 ) = H i=1 min(h 1i, H 2i ) (1) H H (2) S combined = w st S st + w m S m + w v S v (2) where w st = 1 2, wm = 1 4, wv = 1 4 st m v S w VisualRank Web2.0 VisualRank (3) V R = V R ds + (1 d)q (3) { 1 where q j =, j < m m 0, j > = m m m = 1000 VisualRank Web Web Web2.0 VisualRank d d > = 0.8 d = Youtube batting, running marathon, walking street, shoot football, jumping trampoline, eating ramen 6 ( 3) 1 VisualRank VisualRank 2000 (4) Web2.0 MS(V i ) = S(V i AS 3 + P (4) NS/2, 20 < NS < 50 where P = NS/3, 50 < = NS < 90 40, NS > = 90 MS S Web2.0 AS NS 20 P Web2.0 1 [5] 100 [5] 1 [5] batting % eating % jumping % running % shoot % walking % 1, : 80% YouTube Web2.0 Web VisualRank 2 3 Web2.0 4

5 2 3 Web batting 68% 8% eating 42% 19% jumping 76% 19% running 18% 17% shoot 6% 5% walking 7% 8% 36.17% 12.67% Web batting 100% 86.7% 72% 68% 2% eating 50% 33.3% 40% 47% 0% jumping 80% 83.3% 82% 82% 5% running 30% 33.3% 30% 34% 5% shoot 20% 26.7% 32% 33% 1% walking 20% 23.3% 20% 21% 1% 50% 47.8% 46% 47.5% 2.2% 4 Web batting 70% 83.3% 76% 66% 3% eating 50% 60% 48% 41% 1% jumping 100% 90% 86% 85% 9% running 30% 40% 42% 38% 8% shoot 50% 53.3% 38% 29% 1% walking 20% 50% 40% 38% 2% 53.3% 62.8% 55% 48.7% 4% Web2.0 running, walking, shoot 3 Web2.0 batting jumping ( 68% 79%) shoot walking ( 6% 7%) VisualRank Web2.0 Web2.0 Web % % 62.8% 5. Web % % % 100 4% VisualRank Web2.0 Web % VisualRank Web Web [1] J. Sun, X. Wu, S. Yan, L.F. Cheong, T.S. Chua, and J. Li. Hierarchical spatio-temporal context modeling for action recognition. CVPR 2009, pp , [2] J. Niebles, H. Wang, and L. Fei-Fei. Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision, Vol. 79, pp , [3] P. Dollar, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Visual Surveillance and Performance Evaluation of Tracking and Surveillance, nd Joint IEEE International Workshop on, pp , [4] I. Laptev. On space-time interest points. International Journal of Computer Vision, Vol. 64, pp , [5] A. Noguchi and K. Yanai. A surf-based spatio-temporal feature for feature-fusion-based action recognition. In Proc. of ECCV WS on Human Motion: Understanding, Modeling, Capture and Animation, [6] J. Liu, L. Jiebo, and M. Shah. Recognizing realistic actions from videos in the wild. In CVPR 2009, pp , [7] N. Ikizler-Cinbis and S. Sclaroff. Object, scene and actions: Combining multiple features for human action recognition. In ECCV 2010, Vol. 6311, pp [8] S. Satkin and M. Hebert. Modeling the temporal extent of actions. In ECCV 2010, Vol. 6311, pp [9] N. Ikizler-Cinbis, R.G. Cinbis, and S. Sclaroff. Learning actions from the web. In ICCV 2009, pp , [10] Q. Yang, X. Chen, and G. Wang. Web 2.0 dictionary. In Proc. of ACM International Conference on Image and Video Retrieval, pp , [11],,.. (MIRU2010), [12] Y. Jing and S. Baluja. Visualrank: Applying pagerank to large-scale image search. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 30, pp ,

6 4 batting 10 7 batting 10 5 jumping 10 8 jumping 10 6 eating 10 9 eating 10 6

Learning Motion Categories using both Semantic and Structural Information

Learning Motion Categories using both Semantic and Structural Information Learning Motion Categories using both Semantic and Structural Information Shu-Fai Wong, Tae-Kyun Kim and Roberto Cipolla Department of Engineering, University of Cambridge, Cambridge, CB2 1PZ, UK {sfw26,

More information

Recognizing 50 Human Action Categories of Web Videos

Recognizing 50 Human Action Categories of Web Videos Noname manuscript No. Machine Vision and Applications Journal MVAP-D-12-00032 Recognizing 50 Human Action Categories of Web Videos Kishore K. Reddy Mubarak Shah Received: date / Accepted: date Abstract

More information

IMAGE PROCESSING BASED APPROACH TO FOOD BALANCE ANALYSIS FOR PERSONAL FOOD LOGGING

IMAGE PROCESSING BASED APPROACH TO FOOD BALANCE ANALYSIS FOR PERSONAL FOOD LOGGING IMAGE PROCESSING BASED APPROACH TO FOOD BALANCE ANALYSIS FOR PERSONAL FOOD LOGGING Keigo Kitamura, Chaminda de Silva, Toshihiko Yamasaki, Kiyoharu Aizawa Department of Information and Communication Engineering

More information

Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011

Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Crowded Environments XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Sparse Scenes Zelnik-Manor & Irani CVPR

More information

First-Person Activity Recognition: What Are They Doing to Me?

First-Person Activity Recognition: What Are They Doing to Me? IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, OR, June 23 First-Person Activity Recognition: What Are They Doing to Me? M. S. Ryoo and Larry Matthies Jet Propulsion Laboratory,

More information

ENHANCED WEB IMAGE RE-RANKING USING SEMANTIC SIGNATURES

ENHANCED WEB IMAGE RE-RANKING USING SEMANTIC SIGNATURES International Journal of Computer Engineering & Technology (IJCET) Volume 7, Issue 2, March-April 2016, pp. 24 29, Article ID: IJCET_07_02_003 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=7&itype=2

More information

Speaker: Prof. Mubarak Shah, University of Central Florida. Title: Representing Human Actions as Motion Patterns

Speaker: Prof. Mubarak Shah, University of Central Florida. Title: Representing Human Actions as Motion Patterns Speaker: Prof. Mubarak Shah, University of Central Florida Title: Representing Human Actions as Motion Patterns Abstract: Automatic analysis of videos is one of most challenging problems in Computer vision.

More information

Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition

Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition IWNEST PUBLISHER Journal of Industrial Engineering Research (ISSN: 2077-4559) Journal home page: http://www.iwnest.com/aace/ Adaptive sequence of Key Pose Detection for Human Action Recognition 1 T. Sindhu

More information

TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service

TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service Feng Tang, Daniel R. Tretter, Qian Lin HP Laboratories HPL-2012-131R1 Keyword(s): image recognition; cloud service;

More information

An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector

An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector Geert Willems 1, Tinne Tuytelaars 1, and Luc Van Gool 1,2 1 ESAT-PSI, K.U. Leuven, Belgium, {gwillems,tuytelaa,vangool}@esat.kuleuven.be

More information

Human behavior analysis from videos using optical flow

Human behavior analysis from videos using optical flow L a b o r a t o i r e I n f o r m a t i q u e F o n d a m e n t a l e d e L i l l e Human behavior analysis from videos using optical flow Yassine Benabbas Directeur de thèse : Chabane Djeraba Multitel

More information

Fast Matching of Binary Features

Fast Matching of Binary Features Fast Matching of Binary Features Marius Muja and David G. Lowe Laboratory for Computational Intelligence University of British Columbia, Vancouver, Canada {mariusm,lowe}@cs.ubc.ca Abstract There has been

More information

View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems

View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems Avinash Ravichandran, Rizwan Chaudhry and René Vidal Center for Imaging Science, Johns Hopkins University, Baltimore, MD 21218,

More information

MALLET-Privacy Preserving Influencer Mining in Social Media Networks via Hypergraph

MALLET-Privacy Preserving Influencer Mining in Social Media Networks via Hypergraph MALLET-Privacy Preserving Influencer Mining in Social Media Networks via Hypergraph Janani K 1, Narmatha S 2 Assistant Professor, Department of Computer Science and Engineering, Sri Shakthi Institute of

More information

An AI Approach to Measuring Resident-on-Resident Physical Aggression In Nursing Homes

An AI Approach to Measuring Resident-on-Resident Physical Aggression In Nursing Homes An AI Approach to Measuring Resident-on-Resident Physical Aggression In Nursing Homes Datong Chen 1, Howard Wactlar 1, Ashok Bharucha 2, Ming-yu Chen 1, Can Gao 1 and Alex Hauptmann 1 1 Carnegie Mellon

More information

How To Recognize Human Activities From A Robot'S Perspective

How To Recognize Human Activities From A Robot'S Perspective Robot-Centric Activity Prediction from First-Person Videos: What Will They Do to Me? M. S. Ryoo 1, Thomas J. Fuchs 1, Lu Xia 2,3, J. K. Aggarwal 3, Larry Matthies 1 1 Jet Propulsion Laboratory, California

More information

Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I

Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} PSNR_Y

More information

Solving Big Data Problems in Computer Vision with MATLAB Loren Shure

Solving Big Data Problems in Computer Vision with MATLAB Loren Shure Solving Big Data Problems in Computer Vision with MATLAB Loren Shure 2015 The MathWorks, Inc. 1 Why Are We Talking About Big Data? 100 hours of video uploaded to YouTube per minute 1 Explosive increase

More information

Tracking and Recognition in Sports Videos

Tracking and Recognition in Sports Videos Tracking and Recognition in Sports Videos Mustafa Teke a, Masoud Sattari b a Graduate School of Informatics, Middle East Technical University, Ankara, Turkey mustafa.teke@gmail.com b Department of Computer

More information

Image Classification for Dogs and Cats

Image Classification for Dogs and Cats Image Classification for Dogs and Cats Bang Liu, Yan Liu Department of Electrical and Computer Engineering {bang3,yan10}@ualberta.ca Kai Zhou Department of Computing Science kzhou3@ualberta.ca Abstract

More information

3D Model based Object Class Detection in An Arbitrary View

3D Model based Object Class Detection in An Arbitrary View 3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/

More information

An Intelligent Video Surveillance Framework for Remote Monitoring M.Sivarathinabala, S.Abirami

An Intelligent Video Surveillance Framework for Remote Monitoring M.Sivarathinabala, S.Abirami An Intelligent Video Surveillance Framework for Remote Monitoring M.Sivarathinabala, S.Abirami Abstract Video Surveillance has been used in many applications including elderly care and home nursing etc.

More information

Blog Post Extraction Using Title Finding

Blog Post Extraction Using Title Finding Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School

More information

RANKING WEB PAGES RELEVANT TO SEARCH KEYWORDS

RANKING WEB PAGES RELEVANT TO SEARCH KEYWORDS ISBN: 978-972-8924-93-5 2009 IADIS RANKING WEB PAGES RELEVANT TO SEARCH KEYWORDS Ben Choi & Sumit Tyagi Computer Science, Louisiana Tech University, USA ABSTRACT In this paper we propose new methods for

More information

A Model of Media Information Management for Content Retrieval

A Model of Media Information Management for Content Retrieval , pp.159-170 http://dx.doi.org/10.14257/ijmue.2014.9.9.18 A Model of Media Information Management for Content Retrieval Lin Liu The Department Of Computer Science, Chongqing College of Electronic Engineering,

More information

What more can we do with videos?

What more can we do with videos? What more can we do with videos? Occlusion and Mo6on Reasoning for Tracking & Human Pose Es6ma6on Karteek Alahari Inria Grenoble Rhone- Alpes Joint work with Anoop Cherian, Yang Hua, Julien Mairal, Cordelia

More information

Evaluating Sources and Strategies for Learning Video Concepts from Social Media

Evaluating Sources and Strategies for Learning Video Concepts from Social Media Evaluating Sources and Strategies for Learning Video Concepts from Social Media Svetlana Kordumova Intelligent Systems Lab Amsterdam University of Amsterdam The Netherlands Email: s.kordumova@uva.nl Xirong

More information

Learning realistic human actions from movies

Learning realistic human actions from movies Learning realistic human actions from movies Ivan Laptev Marcin Marszałek Cordelia Schmid Benjamin Rozenfeld INRIA Rennes, IRISA INRIA Grenoble, LEAR - LJK Bar-Ilan University ivan.laptev@inria.fr marcin.marszalek@inria.fr

More information

The Visual Internet of Things System Based on Depth Camera

The Visual Internet of Things System Based on Depth Camera The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed

More information

Transform-based Domain Adaptation for Big Data

Transform-based Domain Adaptation for Big Data Transform-based Domain Adaptation for Big Data Erik Rodner University of Jena Judy Hoffman Jeff Donahue Trevor Darrell Kate Saenko UMass Lowell Abstract Images seen during test time are often not from

More information

Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization

Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization Journal of Computer Science 6 (9): 1008-1013, 2010 ISSN 1549-3636 2010 Science Publications Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization

More information

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms H. Kandil and A. Atwan Information Technology Department, Faculty of Computer and Information Sciences, Mansoura University,El-Gomhoria

More information

Top Top 10 Algorithms in Data Mining

Top Top 10 Algorithms in Data Mining ICDM 06 Panel on Top Top 10 Algorithms in Data Mining 1. The 3-step identification process 2. The 18 identified candidates 3. Algorithm presentations 4. Top 10 algorithms: summary 5. Open discussions ICDM

More information

Top 10 Algorithms in Data Mining

Top 10 Algorithms in Data Mining Top 10 Algorithms in Data Mining Xindong Wu ( 吴 信 东 ) Department of Computer Science University of Vermont, USA; 合 肥 工 业 大 学 计 算 机 与 信 息 学 院 1 Top 10 Algorithms in Data Mining by the IEEE ICDM Conference

More information

Detection of Collusion Behaviors in Online Reputation Systems

Detection of Collusion Behaviors in Online Reputation Systems Detection of Collusion Behaviors in Online Reputation Systems Yuhong Liu, Yafei Yang, and Yan Lindsay Sun University of Rhode Island, Kingston, RI Email: {yuhong, yansun}@ele.uri.edu Qualcomm Incorporated,

More information

Neural Network based Vehicle Classification for Intelligent Traffic Control

Neural Network based Vehicle Classification for Intelligent Traffic Control Neural Network based Vehicle Classification for Intelligent Traffic Control Saeid Fazli 1, Shahram Mohammadi 2, Morteza Rahmani 3 1,2,3 Electrical Engineering Department, Zanjan University, Zanjan, IRAN

More information

An Automatic and Accurate Segmentation for High Resolution Satellite Image S.Saumya 1, D.V.Jiji Thanka Ligoshia 2

An Automatic and Accurate Segmentation for High Resolution Satellite Image S.Saumya 1, D.V.Jiji Thanka Ligoshia 2 An Automatic and Accurate Segmentation for High Resolution Satellite Image S.Saumya 1, D.V.Jiji Thanka Ligoshia 2 Assistant Professor, Dept of ECE, Bethlahem Institute of Engineering, Karungal, Tamilnadu,

More information

EFFECTIVE DATA RECOVERY FOR CONSTRUCTIVE CLOUD PLATFORM

EFFECTIVE DATA RECOVERY FOR CONSTRUCTIVE CLOUD PLATFORM INTERNATIONAL JOURNAL OF REVIEWS ON RECENT ELECTRONICS AND COMPUTER SCIENCE EFFECTIVE DATA RECOVERY FOR CONSTRUCTIVE CLOUD PLATFORM Macha Arun 1, B.Ravi Kumar 2 1 M.Tech Student, Dept of CSE, Holy Mary

More information

Data Mining in Web Search Engine Optimization and User Assisted Rank Results

Data Mining in Web Search Engine Optimization and User Assisted Rank Results Data Mining in Web Search Engine Optimization and User Assisted Rank Results Minky Jindal Institute of Technology and Management Gurgaon 122017, Haryana, India Nisha kharb Institute of Technology and Management

More information

Interactive Information Visualization of Trend Information

Interactive Information Visualization of Trend Information Interactive Information Visualization of Trend Information Yasufumi Takama Takashi Yamada Tokyo Metropolitan University 6-6 Asahigaoka, Hino, Tokyo 191-0065, Japan ytakama@sd.tmu.ac.jp Abstract This paper

More information

Object and Action Classification with Latent Variables

Object and Action Classification with Latent Variables BILEN ET AL.: OBJECT AND ACTION CLASSIFICATION WITH LATENT VARIABLES 1 Object and Action Classification with Latent Variables Hakan Bilen 1 hakan.bilen@esat.kuleuven.be Vinay P. Namboodiri 1 vinay.namboodiri@esat.kuleuven.be

More information

An Imbalanced Spam Mail Filtering Method

An Imbalanced Spam Mail Filtering Method , pp. 119-126 http://dx.doi.org/10.14257/ijmue.2015.10.3.12 An Imbalanced Spam Mail Filtering Method Zhiqiang Ma, Rui Yan, Donghong Yuan and Limin Liu (College of Information Engineering, Inner Mongolia

More information

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree. Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table

More information

False alarm in outdoor environments

False alarm in outdoor environments Accepted 1.0 Savantic letter 1(6) False alarm in outdoor environments Accepted 1.0 Savantic letter 2(6) Table of contents Revision history 3 References 3 1 Introduction 4 2 Pre-processing 4 3 Detection,

More information

Feature Tracking and Optical Flow

Feature Tracking and Optical Flow 02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve

More information

A MOBILE SERVICE ORIENTED MULTIPLE OBJECT TRACKING AUGMENTED REALITY ARCHITECTURE FOR EDUCATION AND LEARNING EXPERIENCES

A MOBILE SERVICE ORIENTED MULTIPLE OBJECT TRACKING AUGMENTED REALITY ARCHITECTURE FOR EDUCATION AND LEARNING EXPERIENCES A MOBILE SERVICE ORIENTED MULTIPLE OBJECT TRACKING AUGMENTED REALITY ARCHITECTURE FOR EDUCATION AND LEARNING EXPERIENCES Sasithorn Rattanarungrot, Martin White and Paul Newbury University of Sussex ABSTRACT

More information

CLASSIFYING NETWORK TRAFFIC IN THE BIG DATA ERA

CLASSIFYING NETWORK TRAFFIC IN THE BIG DATA ERA CLASSIFYING NETWORK TRAFFIC IN THE BIG DATA ERA Professor Yang Xiang Network Security and Computing Laboratory (NSCLab) School of Information Technology Deakin University, Melbourne, Australia http://anss.org.au/nsclab

More information

Group Sparse Coding. Fernando Pereira Google Mountain View, CA pereira@google.com. Dennis Strelow Google Mountain View, CA strelow@google.

Group Sparse Coding. Fernando Pereira Google Mountain View, CA pereira@google.com. Dennis Strelow Google Mountain View, CA strelow@google. Group Sparse Coding Samy Bengio Google Mountain View, CA bengio@google.com Fernando Pereira Google Mountain View, CA pereira@google.com Yoram Singer Google Mountain View, CA singer@google.com Dennis Strelow

More information

Consumer video dataset with marked head trajectories

Consumer video dataset with marked head trajectories Consumer video dataset with marked head trajectories Jouni Sarvanko jouni.sarvanko@ee.oulu.fi Mika Rautiainen mika.rautiainen@ee.oulu.fi Mika Ylianttila mika.ylianttila@oulu.fi Arto Heikkinen arto.heikkinen@ee.oulu.fi

More information

A Dynamic Approach to Extract Texts and Captions from Videos

A Dynamic Approach to Extract Texts and Captions from Videos Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

WEB-SCALE image search engines mostly use keywords

WEB-SCALE image search engines mostly use keywords 810 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 36, NO. 4, APRIL 2014 Web Image Re-Ranking Using Query-Specific Semantic Signatures Xiaogang Wang, Member, IEEE, Shi Qiu, Ke Liu,

More information

Analysis of Preview Behavior in E-Book System

Analysis of Preview Behavior in E-Book System Analysis of Preview Behavior in E-Book System Atsushi SHIMADA *, Fumiya OKUBO, Chengjiu YIN, Misato OI, Kentaro KOJIMA, Masanori YAMADA, Hiroaki OGATA Faculty of Arts and Science, Kyushu University, Japan

More information

Mobile Storage and Search Engine of Information Oriented to Food Cloud

Mobile Storage and Search Engine of Information Oriented to Food Cloud Advance Journal of Food Science and Technology 5(10): 1331-1336, 2013 ISSN: 2042-4868; e-issn: 2042-4876 Maxwell Scientific Organization, 2013 Submitted: May 29, 2013 Accepted: July 04, 2013 Published:

More information

Instagram Post Data Analysis

Instagram Post Data Analysis Instagram Post Data Analysis Yanling He Xin Yang Xiaoyi Zhang Abstract Because of the spread of the Internet, social platforms become big data pools. From there we can learn about the trends, culture and

More information

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance 2012 IEEE International Conference on Multimedia and Expo Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance Rogerio Feris, Sharath Pankanti IBM T. J. Watson Research Center

More information

A General Framework for Tracking Objects in a Multi-Camera Environment

A General Framework for Tracking Objects in a Multi-Camera Environment A General Framework for Tracking Objects in a Multi-Camera Environment Karlene Nguyen, Gavin Yeung, Soheil Ghiasi, Majid Sarrafzadeh {karlene, gavin, soheil, majid}@cs.ucla.edu Abstract We present a framework

More information

Naive-Deep Face Recognition: Touching the Limit of LFW Benchmark or Not?

Naive-Deep Face Recognition: Touching the Limit of LFW Benchmark or Not? Naive-Deep Face Recognition: Touching the Limit of LFW Benchmark or Not? Erjin Zhou zej@megvii.com Zhimin Cao czm@megvii.com Qi Yin yq@megvii.com Abstract Face recognition performance improves rapidly

More information

Spatio-Temporal Patterns of Passengers Interests at London Tube Stations

Spatio-Temporal Patterns of Passengers Interests at London Tube Stations Spatio-Temporal Patterns of Passengers Interests at London Tube Stations Juntao Lai *1, Tao Cheng 1, Guy Lansley 2 1 SpaceTimeLab for Big Data Analytics, Department of Civil, Environmental &Geomatic Engineering,

More information

High Level Describable Attributes for Predicting Aesthetics and Interestingness

High Level Describable Attributes for Predicting Aesthetics and Interestingness High Level Describable Attributes for Predicting Aesthetics and Interestingness Sagnik Dhar Vicente Ordonez Tamara L Berg Stony Brook University Stony Brook, NY 11794, USA tlberg@cs.stonybrook.edu Abstract

More information

HIGH DIMENSIONAL UNSUPERVISED CLUSTERING BASED FEATURE SELECTION ALGORITHM

HIGH DIMENSIONAL UNSUPERVISED CLUSTERING BASED FEATURE SELECTION ALGORITHM HIGH DIMENSIONAL UNSUPERVISED CLUSTERING BASED FEATURE SELECTION ALGORITHM Ms.Barkha Malay Joshi M.E. Computer Science and Engineering, Parul Institute Of Engineering & Technology, Waghodia. India Email:

More information

Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream Convolutional Networks for Action Recognition in Videos Two-Stream Convolutional Networks for Action Recognition in Videos Karen Simonyan Andrew Zisserman Visual Geometry Group, University of Oxford {karen,az}@robots.ox.ac.uk Abstract We investigate architectures

More information

Towards License Plate Recognition: Comparying Moving Objects Segmentation Approaches

Towards License Plate Recognition: Comparying Moving Objects Segmentation Approaches 1 Towards License Plate Recognition: Comparying Moving Objects Segmentation Approaches V. J. Oliveira-Neto, G. Cámara-Chávez, D. Menotti UFOP - Federal University of Ouro Preto Computing Department Ouro

More information

Mean-Shift Tracking with Random Sampling

Mean-Shift Tracking with Random Sampling 1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of

More information

An Analysis of Single-Layer Networks in Unsupervised Feature Learning

An Analysis of Single-Layer Networks in Unsupervised Feature Learning An Analysis of Single-Layer Networks in Unsupervised Feature Learning Adam Coates 1, Honglak Lee 2, Andrew Y. Ng 1 1 Computer Science Department, Stanford University {acoates,ang}@cs.stanford.edu 2 Computer

More information

Big Data in Web Age - 互 联 网 时 代 的 大 数 据

Big Data in Web Age - 互 联 网 时 代 的 大 数 据 Big Data in Web Age - 互 联 网 时 代 的 大 数 据 Zhang Bo( 张 钹 ) Department of Computer Science &Technology, Tsinghua University 大 数 据 时 代 Volume: 2.8ZB (10 21 bytes), Variety, Velocity, 大 海 捞 针 Searching for a

More information

Dynamic Composition Techniques for Video Production

Dynamic Composition Techniques for Video Production Dynamic Composition Techniques for Video Production M.R. Preethi #1, S. Romy #2, B. Ruth Angel #3, M. Maheswari * 4 #1, #2, #3 UG Student, Dept. of CSE., Anand Institute of Higher Technology, Chennai,

More information

Monitoring Creatures Great and Small: Computer Vision Systems for Looking at Grizzly Bears, Fish, and Grasshoppers

Monitoring Creatures Great and Small: Computer Vision Systems for Looking at Grizzly Bears, Fish, and Grasshoppers Monitoring Creatures Great and Small: Computer Vision Systems for Looking at Grizzly Bears, Fish, and Grasshoppers Greg Mori, Maryam Moslemi, Andy Rova, Payam Sabzmeydani, Jens Wawerla Simon Fraser University

More information

Character Image Patterns as Big Data

Character Image Patterns as Big Data 22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,

More information

Original Research Articles

Original Research Articles Original Research Articles Researchers Mr.Ramchandra K. Gurav, Prof. Mahesh S. Kumbhar Department of Electronics & Telecommunication, Rajarambapu Institute of Technology, Sakharale, M.S., INDIA Email-

More information

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform

More information

Cloud Resource Management for Image and Video Analysis of Big Data from Network Cameras

Cloud Resource Management for Image and Video Analysis of Big Data from Network Cameras -RXIVREXMSREP SRJIVIRGISR PSYH SQTYXMRKERH&MK(EXE Cloud Resource Management for Image and Video Analysis of Big Data from Network Cameras Ahmed S. Kaseb, Anup Mohan, Yung-Hsiang Lu School of Electrical

More information

Interactive Flag Identification Using a Fuzzy-Neural Technique

Interactive Flag Identification Using a Fuzzy-Neural Technique Proceedings of Student/Faculty Research Day, CSIS, Pace University, May 7th, 2004 Interactive Flag Identification Using a Fuzzy-Neural Technique 1. Introduction Eduardo art, Sung-yuk Cha, Charles Tappert

More information

Video-based Animal Behavior Analysis From Multiple Cameras

Video-based Animal Behavior Analysis From Multiple Cameras Video-based Animal Behavior Analysis From Multiple Cameras Xinwei Xue and Thomas C. Henderson Abstract It has become increasingly popular to study animal behaviors with the assistance of video recordings.

More information

Mining Actionlet Ensemble for Action Recognition with Depth Cameras

Mining Actionlet Ensemble for Action Recognition with Depth Cameras Mining Actionlet Ensemble for Action Recognition with Depth Cameras Jiang Wang 1 Zicheng Liu 2 Ying Wu 1 Junsong Yuan 3 jwa368@eecs.northwestern.edu zliu@microsoft.com yingwu@northwestern.edu jsyuan@ntu.edu.sg

More information

A QoS-Aware Web Service Selection Based on Clustering

A QoS-Aware Web Service Selection Based on Clustering International Journal of Scientific and Research Publications, Volume 4, Issue 2, February 2014 1 A QoS-Aware Web Service Selection Based on Clustering R.Karthiban PG scholar, Computer Science and Engineering,

More information

IJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS

IJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS IJCSES Vol.7 No.4 October 2013 pp.165-168 Serials Publications BEHAVIOR PERDITION VIA MINING SOCIAL DIMENSIONS V.Sudhakar 1 and G. Draksha 2 Abstract:- Collective behavior refers to the behaviors of individuals

More information

Signature Segmentation and Recognition from Scanned Documents

Signature Segmentation and Recognition from Scanned Documents Signature Segmentation and Recognition from Scanned Documents Ranju Mandal, Partha Pratim Roy, Umapada Pal and Michael Blumenstein School of Information and Communication Technology, Griffith University,

More information

A Learning Based Method for Super-Resolution of Low Resolution Images

A Learning Based Method for Super-Resolution of Low Resolution Images A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 emre.ugur@ceng.metu.edu.tr Abstract The main objective of this project is the study of a learning based method

More information

Friendly Medical Image Sharing Scheme

Friendly Medical Image Sharing Scheme Journal of Information Hiding and Multimedia Signal Processing 2014 ISSN 2073-4212 Ubiquitous International Volume 5, Number 3, July 2014 Frily Medical Image Sharing Scheme Hao-Kuan Tso Department of Computer

More information

How To Solve The Kd Cup 2010 Challenge

How To Solve The Kd Cup 2010 Challenge A Lightweight Solution to the Educational Data Mining Challenge Kun Liu Yan Xing Faculty of Automation Guangdong University of Technology Guangzhou, 510090, China catch0327@yahoo.com yanxing@gdut.edu.cn

More information

Actions in Context. Ivan Laptev INRIA Rennes. Marcin Marszałek INRIA Grenoble. Cordelia Schmid INRIA Grenoble. Abstract. 1.

Actions in Context. Ivan Laptev INRIA Rennes. Marcin Marszałek INRIA Grenoble. Cordelia Schmid INRIA Grenoble. Abstract. 1. Actions in Context Marcin Marszałek INRIA Grenoble marcin.marszalek@inria.fr Ivan Laptev INRIA Rennes ivan.laptev@inria.fr Cordelia Schmid INRIA Grenoble cordelia.schmid@inria.fr Abstract This paper exploits

More information

Mining Signatures in Healthcare Data Based on Event Sequences and its Applications

Mining Signatures in Healthcare Data Based on Event Sequences and its Applications Mining Signatures in Healthcare Data Based on Event Sequences and its Applications Siddhanth Gokarapu 1, J. Laxmi Narayana 2 1 Student, Computer Science & Engineering-Department, JNTU Hyderabad India 1

More information

PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO

PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO V. Conotter, E. Bodnari, G. Boato H. Farid Department of Information Engineering and Computer Science University of Trento, Trento (ITALY)

More information

Exploiting Simple Hierarchies for Unsupervised Human Behavior Analysis

Exploiting Simple Hierarchies for Unsupervised Human Behavior Analysis Exploiting Simple Hierarchies for Unsupervised Human Behavior Analysis Fabian Nater 1 Helmut Grabner 1 Luc Van Gool 1,2 1 Computer Vision Laboratory 2 ESAT - PSI / IBBT ETH Zurich K.U. Leuven {fnater,grabner,vangool}@vision.ee.ethz.ch

More information

Intinno: A Web Integrated Digital Library and Learning Content Management System

Intinno: A Web Integrated Digital Library and Learning Content Management System Intinno: A Web Integrated Digital Library and Learning Content Management System Synopsis of the Thesis to be submitted in Partial Fulfillment of the Requirements for the Award of the Degree of Master

More information

ConTag: Conceptual Tag Clouds Video Browsing in e-learning

ConTag: Conceptual Tag Clouds Video Browsing in e-learning ConTag: Conceptual Tag Clouds Video Browsing in e-learning 1 Ahmad Nurzid Rosli, 2 Kee-Sung Lee, 3 Ivan A. Supandi, 4 Geun-Sik Jo 1, First Author Department of Information Technology, Inha University,

More information

IT services for analyses of various data samples

IT services for analyses of various data samples IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical

More information

LIBSVX and Video Segmentation Evaluation

LIBSVX and Video Segmentation Evaluation CVPR 14 Tutorial! 1! LIBSVX and Video Segmentation Evaluation Chenliang Xu and Jason J. Corso!! Computer Science and Engineering! SUNY at Buffalo!! Electrical Engineering and Computer Science! University

More information

Local features and matching. Image classification & object localization

Local features and matching. Image classification & object localization Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to

More information

Interactive person re-identification in TV series

Interactive person re-identification in TV series Interactive person re-identification in TV series Mika Fischer Hazım Kemal Ekenel Rainer Stiefelhagen CV:HCI lab, Karlsruhe Institute of Technology Adenauerring 2, 76131 Karlsruhe, Germany E-mail: {mika.fischer,ekenel,rainer.stiefelhagen}@kit.edu

More information

Effects of Pronunciation Practice System Based on Personalized CG Animations of Mouth Movement Model

Effects of Pronunciation Practice System Based on Personalized CG Animations of Mouth Movement Model Effects of Pronunciation Practice System Based on Personalized CG Animations of Mouth Movement Model Kohei Arai 1 Graduate School of Science and Engineering Saga University Saga City, Japan Mariko Oda

More information

Proposed Advance Taxi Recommender System Based On a Spatiotemporal Factor Analysis Model

Proposed Advance Taxi Recommender System Based On a Spatiotemporal Factor Analysis Model Proposed Advance Taxi Recommender System Based On a Spatiotemporal Factor Analysis Model Santosh Thakkar, Supriya Bhosale, Namrata Gawade, Prof. Sonia Mehta Department of Computer Engineering, Alard College

More information

User Modeling in Big Data. Qiang Yang, Huawei Noah s Ark Lab and Hong Kong University of Science and Technology 杨 强, 华 为 诺 亚 方 舟 实 验 室, 香 港 科 大

User Modeling in Big Data. Qiang Yang, Huawei Noah s Ark Lab and Hong Kong University of Science and Technology 杨 强, 华 为 诺 亚 方 舟 实 验 室, 香 港 科 大 User Modeling in Big Data Qiang Yang, Huawei Noah s Ark Lab and Hong Kong University of Science and Technology 杨 强, 华 为 诺 亚 方 舟 实 验 室, 香 港 科 大 Who we are: Noah s Ark LAB Have you watched the movie 2012?

More information

Clustering Technique in Data Mining for Text Documents

Clustering Technique in Data Mining for Text Documents Clustering Technique in Data Mining for Text Documents Ms.J.Sathya Priya Assistant Professor Dept Of Information Technology. Velammal Engineering College. Chennai. Ms.S.Priyadharshini Assistant Professor

More information

Social media has recently played a critical

Social media has recently played a critical C Y B E R - P H Y S I C A L - S O C I A L S Y S T E M S Editor: Daniel Zeng, University of Arizona, zeng@email.arizona.edu Harnessing the Crowdsourcing Power of Social Media for Disaster Relief Huiji Gao

More information

Research on Trust Management Strategies in Cloud Computing Environment

Research on Trust Management Strategies in Cloud Computing Environment Journal of Computational Information Systems 8: 4 (2012) 1757 1763 Available at http://www.jofcis.com Research on Trust Management Strategies in Cloud Computing Environment Wenjuan LI 1,2,, Lingdi PING

More information

Metaheuristics in Big Data: An Approach to Railway Engineering

Metaheuristics in Big Data: An Approach to Railway Engineering Metaheuristics in Big Data: An Approach to Railway Engineering Silvia Galván Núñez 1,2, and Prof. Nii Attoh-Okine 1,3 1 Department of Civil and Environmental Engineering University of Delaware, Newark,

More information

PULLING OUT OPINION TARGETS AND OPINION WORDS FROM REVIEWS BASED ON THE WORD ALIGNMENT MODEL AND USING TOPICAL WORD TRIGGER MODEL

PULLING OUT OPINION TARGETS AND OPINION WORDS FROM REVIEWS BASED ON THE WORD ALIGNMENT MODEL AND USING TOPICAL WORD TRIGGER MODEL Journal homepage: www.mjret.in ISSN:2348-6953 PULLING OUT OPINION TARGETS AND OPINION WORDS FROM REVIEWS BASED ON THE WORD ALIGNMENT MODEL AND USING TOPICAL WORD TRIGGER MODEL Utkarsha Vibhute, Prof. Soumitra

More information

Quasi Real-Time Summarization for Consumer Videos

Quasi Real-Time Summarization for Consumer Videos Quasi Real-Time Summarization for Consumer Videos Bin Zhao Eric P. Xing School of Computer Science, Carnegie Mellon University {binzhao,epxing}@cs.cmu.edu Abstract With the widespread availability of video

More information

Florida International University - University of Miami TRECVID 2014

Florida International University - University of Miami TRECVID 2014 Florida International University - University of Miami TRECVID 2014 Miguel Gavidia 3, Tarek Sayed 1, Yilin Yan 1, Quisha Zhu 1, Mei-Ling Shyu 1, Shu-Ching Chen 2, Hsin-Yu Ha 2, Ming Ma 1, Winnie Chen 4,

More information