Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011



Similar documents
Object Recognition. Selim Aksoy. Bilkent University

Tracking in flussi video 3D. Ing. Samuele Salti

The goal is multiply object tracking by detection with application on pedestrians.

VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance

RAMIN MEHRAN B.S. K.N. Toosi University of Technology M.S. K.N. Toosi University of Technology

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

3D Model based Object Class Detection in An Arbitrary View

Learning Motion Categories using both Semantic and Structural Information

Human behavior analysis from videos using optical flow

Tracking Many Vehicles in Wide Area Aerial Surveillance

Mean-Shift Tracking with Random Sampling

Observing Human Behavior in Image Sequences: the Video Hermeneutics Challenge

False alarm in outdoor environments

Real-Time Tracking of Pedestrians and Vehicles

The Visual Internet of Things System Based on Depth Camera

Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006

Vision based Vehicle Tracking using a high angle camera

Colorado School of Mines Computer Vision Professor William Hoff

Probabilistic Latent Semantic Analysis (plsa)

Tracking and Recognition in Sports Videos

Towards License Plate Recognition: Comparying Moving Objects Segmentation Approaches

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

TRACKING DRIVER EYE MOVEMENTS AT PERMISSIVE LEFT-TURNS

Cloud-Empowered Multimedia Service: An Automatic Video Storytelling Tool

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

Interactive Offline Tracking for Color Objects

An Automatic and Accurate Segmentation for High Resolution Satellite Image S.Saumya 1, D.V.Jiji Thanka Ligoshia 2

Fast Matching of Binary Features

Real Time Target Tracking with Pan Tilt Zoom Camera

Tracking Groups of Pedestrians in Video Sequences

Disputed Red Light Accidents: A Primer on Signal Control. Jon B. Landerville, MSME, PE Edward C. Fatzinger, MSME, PE Philip S.

Introduction to Computer Graphics

VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS

Neural Network based Vehicle Classification for Intelligent Traffic Control

Bi-directional Tracking using Trajectory Segment Analysis

EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM

Exploiting Simple Hierarchies for Unsupervised Human Behavior Analysis

Data Analytics at NICTA. Stephen Hardy National ICT Australia (NICTA)

Automatic Labeling of Lane Markings for Autonomous Vehicles

UNIVERSITY OF CENTRAL FLORIDA AT TRECVID Yun Zhai, Zeeshan Rasheed, Mubarak Shah

A self-growing Bayesian network classifier for online learning of human motion patterns. Title

How does Person Identity Recognition Help Multi-Person Tracking?

Applications of Deep Learning to the GEOINT mission. June 2015

FLEXSYS Motion-based Traffic Analysis and Incident Detection

LIBSVX and Video Segmentation Evaluation

Traffic Monitoring Systems. Technology and sensors

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Parallel Data Selection Based on Neurodynamic Optimization in the Era of Big Data

TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service

A Movement Tracking Management Model with Kalman Filtering Global Optimization Techniques and Mahalanobis Distance

Online Learning for Fast Segmentation of Moving Objects

Tracking Pedestrians using Local Spatio-temporal Motion Patterns in Extremely Crowded Scenes

Tracking And Object Classification For Automated Surveillance

Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection

Data Mining Yelp Data - Predicting rating stars from review text

. Learn the number of classes and the structure of each class using similarity between unlabeled training patterns

Urban Vehicle Tracking using a Combined 3D Model Detector and Classifier

Learning the Behavior of Users in a Public Space through Video Tracking

Introduction to Pattern Recognition

WP1: Video Data Analysis

Video-based Animal Behavior Analysis From Multiple Cameras

Segmentation & Clustering

Naive-Deep Face Recognition: Touching the Limit of LFW Benchmark or Not?

Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite

Object Categorization using Co-Occurrence, Location and Appearance

Mobile Tracking and Resource Reservation Scheme for Cellular Networks

Applied mathematics and mathematical statistics

DATA MINING CLUSTER ANALYSIS: BASIC CONCEPTS

ENHANCED WEB IMAGE RE-RANKING USING SEMANTIC SIGNATURES

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network

A method of generating free-route walk-through animation using vehicle-borne video image

Character Image Patterns as Big Data

Big Data: Image & Video Analytics

Data Mining Clustering (2) Sheets are based on the those provided by Tan, Steinbach, and Kumar. Introduction to Data Mining

Particles, Flocks, Herds, Schools

Advanced Methods for Pedestrian and Bicyclist Sensing

A feature-based tracking algorithm for vehicles in intersections

Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences

Dissertation TOPIC MODELS FOR IMAGE RETRIEVAL ON LARGE-SCALE DATABASES. Eva Hörster

A Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof

Machine Learning for Data Science (CS4786) Lecture 1

A Learning Based Method for Super-Resolution of Low Resolution Images

Detecting and Tracking Moving Objects for Video Surveillance

A Genetic Algorithm-Evolved 3D Point Cloud Descriptor

Big Data Text Mining and Visualization. Anton Heijs

Search Result Optimization using Annotators

A Computer Vision System for Monitoring Production of Fast Food

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

3D Template-Based Single Camera Multiple Object Tracking

Semantic Recognition: Object Detection and Scene Segmentation

CCTV - Video Analytics for Traffic Management

Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization

Speed Performance Improvement of Vehicle Blob Tracking System

The Advantages of Using a Fixed Stereo Vision sensor

9. Text & Documents. Visualizing and Searching Documents. Dr. Thorsten Büring, 20. Dezember 2007, Vorlesung Wintersemester 2007/08

Tracking People and Their Objects

Human Behavior Interpretation System and Modeling

Density-aware person detection and tracking in crowds

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Transcription:

Behavior Analysis in Crowded Environments XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011

Behavior Analysis in Sparse Scenes Zelnik-Manor & Irani CVPR 04

Crowded Environments

Crowded Environments

Outline Introduction Why is behavior analysis in crowded environment interesting? Major challenges Behavior analysis under hierarchical Bayesian models Based local motions Based noisy tracklets Other works Conclusions and future work

Crowd The crowd, an agglomeration of people, presents new characteristics very different from those of the individuals composing it, the sentiments and ideas of all the persons in the gathering take one and the same direction, and their conscious personality vanishes. -- by Le Bon (1841~1931) in The Crowd:A Study of the Popular Mind

Why is behavior analysis in crowded environment interesting? Many places of security interest are crowded Train station Shopping mall Airport Street intersection Crowd control Providing guidelines for planning and designing crowded areas

Why is behavior analysis in crowded environment interesting? The study of pedestrian crowds is an interesting subject of social research Pedestrian behaviors in crowd present new characteristics than individual personalities Self-organization of collective behavior patterns due to nonlinear interactions among pedestrians Variables of pedestrian motions are measurable: leading to a deep insight of other social processes, such opinion formation

Why is behavior analysis in crowded environment interesting? The self-organization phenomena are observed in other fields At medium and high pedestrian densities, the motion of pedestrians shows striking analogies with the motion of gases and fluids [Helbing Statistical Mechanics 97] The self-organization of collective behaviors are also observed in animal groups [Moussaid et al. Topics in Cognitive Science 09]

Interdisciplinary subject Statistical physics: understanding the fundamental mechanism of forming collective behaviors from interactions of individuals Computer graphics: simulating crowd behaviors Computer vision: Learning and detecting collective motion patterns Temporally segmenting the video sequences into different global crowd behaviors Detecting abnormal behaviors

Formation of collective motion Externally planned or organized Self-organization from nonlinear interactions of pedestrians

Major challenges Crowded: it is difficult to do detection and tracking due to frequent occlusions and scene clutters Complex: many different types of behaviors happen together Crowded and complex Crowded but relatively simple

Outline Introduction Why is behavior analysis in crowded environment interesting? Major challenges Behavior analysis under hierarchical Bayesian models Based local motions Based noisy tracklets Other works Conclusions X. Wang, X. Ma, and E. Grimson, Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models, IEEE Trans. on PAMI, Vol. 31, 539-555, 2009. X. Wang, X. Ma, and E. Grimson, Unsupervised Activity Perception by Hierarchical Bayesian Models, CVPR 2007.

Learning motion patterns without tracking Motion patterns are the pathways of moving objects Learning motions patterns from the temporal co-occurrence of moving pixels

0.3 0.25 0.2 0.15 0.1 0.05 0 Our tasks Cluster moving pixels into atomic activities and segment a video sequence into global behaviors in crowded scenes Example of data Model of an atomic activity Model of an global behavior

Our tasks Cluster moving pixels into atomic activities and segment a video sequence into global behaviors in crowded scenes Motion segmentation Video segmentation Abnormality detection Interaction query

0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 0.3 0.25 0.2 0.15 0.1 0.05 0 High level picture of our approach Motion Features (a) Atomic activities modeled as distributions over the feature codebook (b) Global behaviors modeled as distributions over atomic activities (c)

0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 0.3 0.25 0.2 0.15 0.1 0.05 0 Parametric hierarchical Bayesian model 1 2 c j 1 2 Global behavior models (L = 2) 0 c j L Global behavior models z ji 1 2 3 4 1 2 3 4 Atomic activity models (K=4) H k K Atomic activity models x ji N j M j Video clip j (j = 1 M)

Advantages of this hierarchical Bayesian model More compact representation of video clips on the top of atomic activities Number of atomic activity categories (29) versus size of feature codebook (13,824) c j 0 H c L Global behavior models k K Atomic activity models Our model j z ji x ji N j M 0 c L Global behavior models c j x ji N j M Cluster video clips directly using motion feature vectors without atomic activities

Advantages of this hierarchical Bayesian model (cont) Priors of global behaviors help to cluster moving pixels 0 j c j z ji 0 c L Global behavior models j z ji H k K Atomic activity models x ji N j M Cluster moving pixels without modeling interactions Latent Dirichlet Allocation (LDA) [Blei et al. JASA 03] H k K Atomic activity models Our model x ji N j M Two atomic models have ambiguity in yellow circle area

Dual Hierarchical Dirichlet Processes (Dual-HDP) c j c j 0 c L Global behavior models j z ji 0 c Global behavior models j z ji H k K Atomic activity models x ji N j M H k Atomic activity models x ji N j M Parametric hierarchical Bayesian model Dual-HDP Related to the nested HDP [Rodriguez et al. 2006], Transformed HDP [Sudderth et. al IJCV 07], HDP [Teh JASA 04]

Learned atomic activities from a traffic scene (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) (17) (18) (19) (20) (21) (22) (23) (24) (25) (26) (27) (28) (29)

Global behavior I: green light for south/north traffic prior index of atomic activities Top six atomic activities vehicles northbound vehicles northbound vehicles southbound vehicles incoming northbound vehicles incoming southbound vehicles outgoing eastbound

Global behavior II: green light for east/west traffic prior index of atomic activities Top six atomic activities vehicles incoming westbound vehicles outgoing westbound vehicles outgoing southbound vehicles incoming eastbound vehicles outgoing eastbound pedestrians westbound

Global behavior III: left turn signal for east/west traffic prior index of atomic activities Top six atomic activities vehicles turning left eastbound vehicles outgoing northbound vehicles outgoing northbound vehicles incoming eastbound vehicles outgoing eastbound vehicles stopping southbound

Global behavior IV: walk sign prior index of atomic activities Top six atomic activities pedestrians incoming eastbound pedestrians outgoing eastbound pedestrians westbound pedestrians westbound vehicles stopping vehicles stopping

Global behavior V: northbound right turns prior index of atomic activities Top two atomic activities vehicles incoming northbound vehicles outgoing eastbound

Temporal video segmentation green light for east/west traffic green light for south/north traffic left turn signal for east/west traffic walk sign northbound right turns

Confusion matrix of video segmentation Clustering result Manual label The average accuracy is 85.74% using our approach. The average accuracy is 65.6% when modeling atomic activities and global behaviors in two separate steps. The approaches of using a motion feature vector to represent a video clip perform poorly on this data.

Abnormality detection results Top four abnormal video clips

Interaction query vehicles approaching 0.5 0.45 0.4 0.35 0.3 0.25 0.2 pedestrians crossing the street 0.15 0.1 0.05 0 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 Query distribution

Top four retrieved jay-walking examples

Precision and recall of jay-walking retrieval 1 0.9 Precision 0.8 0.7 0.6 0.5 0.4 Recall 0.3 0.2 0.1 0 5 10 15 20 25 30 35 37 # video clips retrieved There are totally 18 instances, all found among the top 37 video clips out of 540 video clips.

Train Station Scene Atomic activities

A scene where it fails

Learned models of anatomic activities

Outline Introduction Why is behavior analysis in crowded environment interesting? Major challenges Behavior analysis under hierarchical Bayesian models Based local motions Based noisy tracklets Other works Conclusions B. Zhou, X. Wang, and X. Tang, Random Field Topic for Semantic Region Analysis, CVPR 2011.

Tracklets Fragments of trajectories obtained by weak trackers. They are short and very noisy.

Statistics of 47,866 Tracklets The size of the scene is 1080 x 1920 Sources: regions where objects appear Sinks: regions where objects disappear Histogram of the lengths of tracklets Sources and sinks of the scene Numbers of tracklets connecting sources and sinks

Directly Clustering Tracklets Spectral clustering + Hausdorff distance: X. Wang et al ECCV 06

Random Field Topic Models MRF models the dependency between tracklets based on their spatial and temporal consistency and velocity similarity Model the sources and sinks B. Zhou, X. Wang, and X. Tang, Random Field Topic for Semantic Region Analysis, CVPR 2011.

Learned Models of Paths

Tracklet Clustering Results

Potential Applications Estimating the transition probabilities and traffic flows between sources and sinks Predicting the behaviors of pedestrians

Outline Introduction Why is behavior analysis in crowded environment interesting? Major challenges Behavior analysis under hierarchical Bayesian models Based local motions Based noisy tracklets Other works Conclusions

Other Works Crowd flow segmentation from Lagrangian coherent structures S. Ali and M. Shah, A Lagrangian particle dynamic approach for crowd flow segmentation and stability analysis, CVPR 2007 Flow field segmentation using a Lie algebraic approach D. Lin, J. Fisher, E. Grimson, Learning visual flows: A Lie algebraic approach, CVPR 2009.

Other Works Abnormal crowd behavior detection using social force model Mehran et al. CVPR 09 Modeling social behavior for multitarget tracking Pellegrini et al. ICCV 09 Crowd behavior analysis across multiple camera views Loy et al. CVPR 2009

Conclusions Behavior analysis in crowded environments receives a lot attentions from different fields It is challenging problem from the computer vision point of view Propose an nonparametric hierarchical Bayesian model to learn behavior models in crowded environments from local motions It models single-agent activities, multi-agent interactions and global behaviors at different hierarchical levels Propose a random field topic model to learn behavior models from tracklets

Future Work Integrating the macroscopic models and the microscopic models Modeling the dynamic variations of the crowd behaviors Predicting the behaviors of individuals and the crowds Using the behavior models to improve detection and tracking M. Wang and X. Wang, Automatic Adaptation of a Generic Pedestrian Detector to a Specific Traffic Scene, CVPR 2011 Atomic activities related to vehicles Atomic activities related to pedestrians

Thank you!