Spatio-Temporal Nonparametric Background Modeling and Subtraction



Similar documents
Vision based Vehicle Tracking using a high angle camera

Real-Time Tracking of Pedestrians and Vehicles

Fall detection in the elderly by head tracking

AUTOMATIC CROWD ANALYSIS FROM VERY HIGH RESOLUTION SATELLITE IMAGES

Autonomous Monitoring of Cliff Nesting Seabirds using Computer Vision

Tracking and Recognition in Sports Videos

Virtual Mouse Using a Webcam

Mean-Shift Tracking with Random Sampling

Mouse Control using a Web Camera based on Colour Detection


Real-Time Event Detection System for Intelligent Video Surveillance

VSSN 06 Algorithm Competition

HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT

Canny Edge Detection

Tracking And Object Classification For Automated Surveillance

Video Surveillance System for Security Applications

A Real Time Hand Tracking System for Interactive Applications

Object tracking & Motion detection in video sequences

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Machine Learning in Statistical Arbitrage

Robert Collins CSE598G. More on Mean-shift. R.Collins, CSE, PSU CSE598G Spring 2006

Computational Foundations of Cognitive Science

HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER

A New Image Edge Detection Method using Quality-based Clustering. Bijay Neupane Zeyar Aung Wei Lee Woon. Technical Report DNA #

Edge detection. (Trucco, Chapt 4 AND Jain et al., Chapt 5) -Edges are significant local changes of intensity in an image.

A Learning Based Method for Super-Resolution of Low Resolution Images

Object tracking in video scenes

Efficient Background Subtraction and Shadow Removal Technique for Multiple Human object Tracking

degrees of freedom and are able to adapt to the task they are supposed to do [Gupta].

Automatic Traffic Estimation Using Image Processing

Parallel Computing of Kernel Density Estimates with MPI

jorge s. marques image processing

Indoor Surveillance System Using Android Platform

Morphological segmentation of histology cell images

Tracking performance evaluation on PETS 2015 Challenge datasets

NPTEL STRUCTURAL RELIABILITY

CCTV - Video Analytics for Traffic Management

A ROBUST BACKGROUND REMOVAL ALGORTIHMS

Coding and decoding with convolutional codes. The Viterbi Algor

A Method of Caption Detection in News Video

A Method for Controlling Mouse Movement using a Real- Time Camera

False alarm in outdoor environments

Real Time Target Tracking with Pan Tilt Zoom Camera

High Quality Image Magnification using Cross-Scale Self-Similarity

An Active Head Tracking System for Distance Education and Videoconferencing Applications

Real time vehicle detection and tracking on multiple lanes

OBJECT TRACKING USING LOG-POLAR TRANSFORMATION

Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006

Assessment of Camera Phone Distortion and Implications for Watermarking

VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS

A General Framework for Tracking Objects in a Multi-Camera Environment

Human behavior analysis from videos using optical flow

The Dynamic Background Generation Scheme Using an Image Frame

International Journal of Innovative Research in Computer and Communication Engineering. (A High Impact Factor, Monthly, Peer Reviewed Journal)

Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition

Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization

Automatic Extraction of Signatures from Bank Cheques and other Documents

Acknowledgments. Data Mining with Regression. Data Mining Context. Overview. Colleagues

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions

3D Vehicle Extraction and Tracking from Multiple Viewpoints for Traffic Monitoring by using Probability Fusion Map

Linear Threshold Units

DYNAMIC RANGE IMPROVEMENT THROUGH MULTIPLE EXPOSURES. Mark A. Robertson, Sean Borman, and Robert L. Stevenson

Implementation of Canny Edge Detector of color images on CELL/B.E. Architecture.

An Improved Adaptive Background Mixture Model for Realtime Tracking with Shadow Detection

Low-resolution Character Recognition by Video-based Super-resolution

Linear Discrimination. Linear Discrimination. Linear Discrimination. Linearly Separable Systems Pairwise Separation. Steven J Zeil.

Density Map Visualization for Overlapping Bicycle Trajectories

One-Class Classifiers: A Review and Analysis of Suitability in the Context of Mobile-Masquerader Detection

Chromatic Improvement of Backgrounds Images Captured with Environmental Pollution Using Retinex Model

Color Segmentation Based Depth Image Filtering

Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition

Vision-Based Human Tracking and Activity Recognition

521466S Machine Vision Assignment #7 Hough transform

Real-time Traffic Congestion Detection Based on Video Analysis

The Scientific Data Mining Process

Tracking in flussi video 3D. Ing. Samuele Salti

The Visual Internet of Things System Based on Depth Camera

Two Topics in Parametric Integration Applied to Stochastic Simulation in Industrial Engineering

Linköping University Electronic Press

Bayesian Image Super-Resolution

Component Ordering in Independent Component Analysis Based on Data Power

Maximum Likelihood Estimation of ADC Parameters from Sine Wave Test Data. László Balogh, Balázs Fodor, Attila Sárhegyi, and István Kollár

A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA

Real-time pedestrian detection in FIR and grayscale images

Automatic parameter regulation for a tracking system with an auto-critical function

A Fast Algorithm for Multilevel Thresholding

Learning with Dynamic Group Sparsity

Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf. Flow Visualization. Image-Based Methods (integration-based)

Real-time Visual Tracker by Stream Processing

Tracking Groups of Pedestrians in Video Sequences

EXPLORING IMAGE-BASED CLASSIFICATION TO DETECT VEHICLE MAKE AND MODEL FINAL REPORT

Transcription:

Spatio-Temporal Nonparametric Background Modeling and Subtraction Raviteja Vemulapalli and R. Aravind Department of Electrical engineering Indian Institute of Technology, Madras

Background subtraction Technique used for separating moving objects (foreground) from rest of the scene (background) in a video. Figure: Block diagram of background subtraction process

Nonparametric background model Current intensity of pixel (x, y): I (x, y) Set of past intensity samples: {I 1 (x, y), I 2 (x, y),..., I N (x, y)} Intensity PDF of the pixel (x, y) using a kernel K. ( ) p = p I (x, y)/{i i (x, y)} N i=1 = 1 N N K(I (x, y) I i (x, y)). i=1 Gaussian kernel p = 1 N N 1 i=1 (2π) d 2 Σ 1 2 { exp 1 } 2 [I (x, y) I i(x, y)] T Σ 1 [I (x, y) I i (x, y)]

Nonparametric background model For independent colour channels σ 2 1 0 0 Σ = 0 σ2 2 0 0 0 σ3 2 p = 1 N N 3 i=1 j=1 1 2πσj 2 exp { [I j (x, y) I j i 2σ 2 j } (x, y)]2 where I j i (x, y) denotes the j th colour component of I i (x, y). p > thr = pixel (x, y) is background. p < thr = pixel (x, y) is foreground.

Short term and Long term models Short term model: Uses the most recent N background samples. New sample is added to the sample set only if it belongs to background. Long term model: Uses N samples taken from W (> N) past samples. Every new sample is added to the sample set irrespective of whether it belongs to background or foreground.

Foreground detection Combination of short term and long term models Combination results (0 for background and 1 for foreground) Short Term Model Long Term Model Final Result O ST (x, y) = 0 O LT (x, y) = 0 O(x, y) = 0 O ST (x, y) = 0 O LT (x, y) = 1 O(x, y) = 0 O ST (x, y) = 1 O LT (x, y) = 0 O(x, y) = O (x, y) O ST (x, y) = 1 O LT (x, y) = 1 O(x, y) = 1 O (x, y) = { 1 if 0 else 1 1 i= 1 i= 1 O ST (x i, y j)o LT (x i, y j) 0

Spatio-temporal nonparametric model for gray-scale videos 3 3 blocks instead of individual pixels. Block centered on (x, y) in the current frame: F (x, y) Set of past samples: {F 1 (x, y), F 2 (x, y),..., F N (x, y)} Intensity PDF of this block using a kernel K ( ) f = p F (x, y)/ {F i (x, y)} N i=1 = 1 N Gaussian kernel N K(F (x, y) F i (x, y)) i=1 f = 1 N N 1 i=1 (2π) d 2 Σ 1 2 e 1 2 [F (x,y) F i (x,y)] T Σ 1 [F (x,y) F i (x,y)]

Hyperspherical kernel Hyperspherical kernel instead of Gaussian to reduce computational complexity. Hyperspherical kernel of radius r 1 if u r K(u) = V 0 otherwise Probability density becomes M = f = M N V N ( ) F (x, y) Fi (x, y) φ r { 1 if u 1 φ(u) = 0 otherwise i=1

Foreground detection f > thr = pixel (x, y) is background. f < thr = pixel (x, y) is foreground. Both short term and long term models used. Combination rules similar to nonparametric model. Tradeoff between computational efficiency and foreground detection. Hyperspherical kernel computationally simple compared to Gaussian kernel. Gaussian kernel gives better estimate of the PDF and hence better detection results compared to hyperspherical kernel.

Model for colour videos Color channels assumed to be independent. Each channel processed separately like a gray-scale video. R(x, y), G(x, y), B(x, y) Processing results of the three colour channels at pixel (x, y) (0 for background and 1 for foreground). Final result O(x, y) = R(x, y) G(x, y) B(x, y)

Results: Lab video Detection results for 346 th frame of lab video under different noise levels. Figure: (a)original frames corrupted by noise; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Lab video Detection results for 375 th frame of lab video under different noise levels. Figure: (a)original frames corrupted by noise; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Lab video Detection results for 408 th frame of lab video under different noise levels. Figure: (a)original frames corrupted by noise; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Lab video Detection results for 434 th frame of lab video under different noise levels. Figure: (a)original frames corrupted by noise; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Crowd video Detection results for 253 rd, 332 nd and 340 th frames of crowd video. Figure: (a)original frames corrupted by noise; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Crowd video Detection results for 350 th, 378 th and 381 st frames of crowd video. Figure: (a)original frames corrupted by noise; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Bottle video Detection results for 251 st, 291 st and 300 th frames of crowd video. Figure: (a)original frames; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Bottle video Detection results for 302 nd, 306 th and 328 th frames of bottle video. Figure: (a)original frames; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Ducks video Detection results for 292 nd, 302 nd and 307 th frames of ducks video. Figure: (a)original frames; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Results: Ducks video Detection results for 340 th, 363 rd and 372 nd frames of ducks video. Figure: (a)original frames; (b)detection results of the proposed model; (c)detection results of nonparametric model for low threshold; (d)detection results of nonparametric model for high threshold.

Conclusion and future work Advantages of the proposed approach: Computationally simple. Performs very well even in the cases of noisy videos and dynamic backgrounds. Future work Considering all colour channels and using incremental PCA to avoid dealing with 27-dimensional data. Making the parameters (kernel radius and threshold) adaptive.

Applications Traffic monitoring Human motion estimation and tracking Human activity recognition Abandoned object detection Crowd density and behaviour estimation Silhouette extraction for object recognition

References A. Elgammal, D. Harwood and L. Davis, Nonparametric model for background subtraction, in European conference on Computer Vision, pages 751-767, Dublin, Ireland, May 2000. T.Tanka, A.Shimada, D.Arita, R.Taniguchi, A fast algorithm for adaptive background model construction using parzen density estimation, in IEEE International Conference on Advanced Video and Signal based Surveillance, pages 528-533, London, Sep 2007. A. Monnet, A. Mittal, N. Paragios and v. Ramesh, Background modeling and subtraction of dynamic scenes, in International Conference on Computer Vision, pages 1305-1312, Nice, France, Oct 2003.