PEDESTRIAN DETECTION BASED ON FOREGROUND SEGMENTATION IN VIDEO SURVEILLANCE

Similar documents
The Visual Internet of Things System Based on Depth Camera

Face Recognition in Low-resolution Images by Using Local Zernike Moments

VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS

Local features and matching. Image classification & object localization

REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network

Vision based Vehicle Tracking using a high angle camera

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE.

EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM

Robust Real-Time Face Detection

A Reliability Point and Kalman Filter-based Vehicle Tracking Technique

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION

Application of Face Recognition to Person Matching in Trains

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report

A secure face tracking system

An Active Head Tracking System for Distance Education and Videoconferencing Applications

QUALITY TESTING OF WATER PUMP PULLEY USING IMAGE PROCESSING

Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) ( ) Roman Kern. KTI, TU Graz

Video Surveillance System for Security Applications

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow

Urban Vehicle Tracking using a Combined 3D Model Detector and Classifier

A Model-based Vehicle Segmentation Method for Tracking

IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS

Part-Based Pedestrian Detection and Tracking for Driver Assistance using two stage Classifier

REAL-TIME FACE AND HAND DETECTION FOR VIDEOCONFERENCING ON A MOBILE DEVICE. Frank M. Ciaramello and Sheila S. Hemami

Canny Edge Detection

A Study of Automatic License Plate Recognition Algorithms and Techniques

EXPLORING IMAGE-BASED CLASSIFICATION TO DETECT VEHICLE MAKE AND MODEL FINAL REPORT

Analecta Vol. 8, No. 2 ISSN

Neural Network based Vehicle Classification for Intelligent Traffic Control

Building an Advanced Invariant Real-Time Human Tracking System

A ROBUST BACKGROUND REMOVAL ALGORTIHMS

UNIVERSITY OF CENTRAL FLORIDA AT TRECVID Yun Zhai, Zeeshan Rasheed, Mubarak Shah

Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization

3D Model based Object Class Detection in An Arbitrary View

A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA

Probabilistic Latent Semantic Analysis (plsa)

A Learning Based Method for Super-Resolution of Low Resolution Images

Automated Attendance Management System using Face Recognition

Human behavior analysis from videos using optical flow

Low-resolution Character Recognition by Video-based Super-resolution

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

CS 534: Computer Vision 3D Model-based recognition

Finding people in repeated shots of the same scene

Circle Object Recognition Based on Monocular Vision for Home Security Robot

International Journal of Advanced Information in Arts, Science & Management Vol.2, No.2, December 2014

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition

Tracking Groups of Pedestrians in Video Sequences

The use of computer vision technologies to augment human monitoring of secure computing facilities

CLOUD CHARACTERIZATION USING LOCAL TEXTURE INFORMATION. Antti Isosalo, Markus Turtinen and Matti Pietikäinen

Automatic Traffic Estimation Using Image Processing


Tracking performance evaluation on PETS 2015 Challenge datasets

PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY

The goal is multiply object tracking by detection with application on pedestrians.

A self-growing Bayesian network classifier for online learning of human motion patterns. Title

Big Data: Image & Video Analytics

A Dynamic Approach to Extract Texts and Captions from Videos

Tracking And Object Classification For Automated Surveillance

An Efficient Geometric feature based License Plate Localization and Stop Line Violation Detection System

How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm

Mean-Shift Tracking with Random Sampling

The Scientific Data Mining Process

Morphological segmentation of histology cell images

Tracking and Recognition in Sports Videos

ECE 533 Project Report Ashish Dhawan Aditi R. Ganesan

An Experimental Comparison of Online Object Tracking Algorithms

CS231M Project Report - Automated Real-Time Face Tracking and Blending

MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos Iasonas.kokkinos@ecp.fr

Open-Set Face Recognition-based Visitor Interface System

A Real Time Hand Tracking System for Interactive Applications

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006

Analysis of Preview Behavior in E-Book System

On-line Boosting and Vision

Feature Tracking and Optical Flow

A Genetic Algorithm-Evolved 3D Point Cloud Descriptor

The Big Data methodology in computer vision systems

Vision based approach to human fall detection

Detection and Recognition of Mixed Traffic for Driver Assistance System

Automatic Calibration of an In-vehicle Gaze Tracking System Using Driver s Typical Gaze Behavior

PageX: An Integrated Document Processing and Management Software for Digital Libraries

Algorithm for License Plate Localization and Recognition for Tanzania Car Plate Numbers

Jiří Matas. Hough Transform

Poker Vision: Playing Cards and Chips Identification based on Image Processing

Determining optimal window size for texture feature extraction methods

Automatic Maritime Surveillance with Visual Target Detection

Speed Performance Improvement of Vehicle Blob Tracking System

A Study on M2M-based AR Multiple Objects Loading Technology using PPHT

Navigation Aid And Label Reading With Voice Communication For Visually Impaired People

Human Pose Estimation from RGB Input Using Synthetic Training Data

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

Real-time Traffic Congestion Detection Based on Video Analysis

Classifying Manipulation Primitives from Visual Data

siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service

Machine Learning for Medical Image Analysis. A. Criminisi & the InnerEye MSRC

Robust Pedestrian Detection and Tracking From A Moving Vehicle

MAVIparticle Modular Algorithms for 3D Particle Characterization

Fall detection in the elderly by head tracking

Real-time pedestrian detection in FIR and grayscale images

Transcription:

0 th April 03. Vol. 50 No. 005-03 JATIT & LLS. All rights reserved. ISSN: 99-8645 www.jatit.org E-ISSN: 87-395 PEDESTRIAN DETECTION BASED ON FOREGROUND SEGMENTATION IN VIDEO SURVEILLANCE MING XIN, CAILI FANG Institute of Image Processing and Pattern Recognition, Henan University, Kaifeng 475004, China School of Computer and Information Engineering, Henan University, Kaifeng 475004, China E-mail: xinming@henu.edu.cn, fcl@henu.edu.cn ABSTRACT Real-time detection and tracing of moving pedestrians in image sequences is a fundamental tas in many computation vision applications such as automated visual surveillance system. In this paper we propose a human detection method based on foreground segmentation, and the detection speed is satisfying for the application of video surveillance. During detection, unlie the exhaustive scan typically used in general human detection systems, in order to avoid scanning regions lie the sy, the foreground segmentation stage is firstly implemented in the video surveillance sequences by utilizing Gaussian mixture model algorithm, and then, human detection stage is executed on the regions of interest (ROI) extracted from the video surveillance sequence frame. In contrast with the exhaustive scan without explicit segmentation, our proposed approach can meet the real-time requirement. Carefully designed experiments demonstrate the superiority of our proposed approach. Keywords: Detection, Object Tracing, Foreground Segmentation. INTRODUCTION Effective techniques for pedestrian detection are of special interest in computer vision since many applications involve people s locating and tracing. Thus, significant research has been devoted to detecting, locating and tracing people in images and videos. Both pedestrian tracing and pedestrian locating are the problem of finding the positions of all pedestrian objects in a video surveillance sequence. More specifically, the goal is to find the bounding box for each pedestrian object in video surveillance application. Considerable approaches to pedestrian detection have been explored over the last few years. One common approach is to use a sliding window to scan the image exhaustively in scale-space, and classify each candidate window individually [,], for which only a binary classifier is needed to indicate whether a target is pedestrian or not from a given window. However, this procedure has two main shortcomings: ) many irrelevant regions are passed to the next module (e.g., sy regions or ROIs inconsistent with perspective), which increases the potential number of false positives. ) The number of candidates is large, which maes it difficult to meet real-time requirements, although some proposals have recently studied this problem [3,4]. To meet the real-time requirement in video surveillance application, variable improved approaches are explored in recent years. Zhang et al. [5] propose a multi-resolution framewor to reduce the computational cost. Leibe et al. [6] present a technique termed as the implicit shape model. The idea is to use a eypoints detector, Hessian- Laplace in this case, then compute a shape context descriptor for each eypoint, and finally, cluster them to construct a codeboo. During recognition, each detected eypoint is matched to a cluster, which then votes for an object hypothesis using Hough voting, thus avoiding a candidate generation step. Zhu et al. [7] propose a rejection cascade using HOG features. Also based on HOG features, Begard et al. [8] address the problem of real-time pedestrian detection by considering different implementations of the AdaBoost algorithm. In general, the pose of surveillance camera does not change. According to the properties, the foreground segmentation stage is firstly implemented in the video surveillance sequences by utilizing Gaussian mixture model algorithm. Therefore, the candidate pedestrian regions are 450

0 th April 03. Vol. 50 No. 005-03 JATIT & LLS. All rights reserved. ISSN: 99-8645 www.jatit.org E-ISSN: 87-395 firstly obtained. There are twofold benefits with respect to the following pedestrian detection stage: ) The reduction of the image area to be processed means that the overall system time can also be reduced; ) by submitting fewer bacground regions for classification, the rate of false alarms can also be reduced while the same detection rate is maintained. Foreground segmentation and pedestrian detection do not exist in isolation. Both of them can provide useful prior or posterior information to each other. The foreground segmentation stage can effectively reduce the detection area such as sy regions or ROIs inconsistent with perspective. Simultaneously, the pedestrian detection stage can predict future pedestrians positions, and feed the foreground segmentation algorithm with precandidates. The algorithm flowchart is illustrated as Fig.. Video surveillance sequences input Foreground segmentation by Gaussian mixture model detection tracing position output The remainder of this wor is organized as follows. In Sec., we review Gaussian mixture model algorithm briefly. Sec. 3. and Sec. 3. introduce the Histogram of Oriented Gradients (HOG) and Local Binary Pattern (LBP) features, respectively. Experimental results are given in Sec. 4, followed by conclusions in Sec. 5.. GAUSSIAN MIXTURE MODEL Gaussian mixture model plays an important role in foreground segmentation fields due to its analytical tractability, asymptotic properties, and ease of implementation. For foreground segmentation, the basic Gaussian mixture model algorithm models each bacground pixel by a mixture of K Gaussian distributions. Different Gaussians are assumed to represent different colors. The weight parameters of the mixture represent the time proportions that those colors stay in the scene. The brief introduction is as follows: Each pixel in the detection frame is modeled by a mixture of K Gaussian distributions. The probability that a certain pixel has a value of x t at time t can be written as and ( t) wp j ( xθ t; j) p x where j= = () j= w j is the weight subject to Posterior information feedbac Fig. The Proposed Algorithm Flowchart w j > 0 wj =. Each component s density ( ; j ) p x θ is a normal probability distribution represented by ( ; θ ) = p( x; µ, Σ ) p x = ( π ) D Σ where θ { µ, } e Σ T ( x µ ) ( x µ ) () = Σ represents the th component center and covariance, respectively. Given a group of N independent and identically N distribution samples X = { x, x x }, the loglielihood corresponding to a -component mixture is i= i= j= i ( θ) log px ( ; θ) = log Π px; N N = i ( θ j) log wp x; j (3) Because the maximum lielihood estimate of θ cannot be solved analytically, Expectation Maximization algorithm has been widely used to estimate the parameters of Gaussian mixture model. It is an iterative algorithm, and the details were introduced in literatures [9]; 3. HOG AND LBP FEATURES 45

0 th April 03. Vol. 50 No. 005-03 JATIT & LLS. All rights reserved. ISSN: 99-8645 www.jatit.org E-ISSN: 87-395 Considerable literatures have shown that significant improvement in human detection can be achieved by combining different low-level features. A strong set of features provides high discriminatory power. In this section, we will introduce Histogram of Oriented Gradients (HOG) features and Local Binary Pattern (LBP) features, respectively. 3. Histogram Of Oriented Gradients In literature [,0], each detection window is represented as a set of overlapping blocs. Each bloc contains four non-overlapping regions called cells, figure displays the detailed position relationship between cells and blocs. Cell One Bloc consists of Cells Blocs with overlap in the Scan Window Fig. The Position Relationship Between Cells And Blocs The calculation process of HOG descriptor is as follows:. First, for each pixel of the cell, horizontal gradient G and vertical gradient G is calculation x through gradient operator [, 0,]. (, ) = ( +, ) (, ) (, ) = (, + ) (, ) Gx x y I x y I x y Gy xy I xy I xy (4). According the horizontal gradient G x and vertical gradient G, for each pixel of the cell, the gradient orientation Og (, ) ( xy, ) are computed as Vg y y xy and gradient values ( ) ( ) ( ) O xy, = arctan( G xy, / G xy, ) (5) g y x ( ) ( ) ( ) V xy, = ( G xy, ) + ( G xy, ) (6) g x y 3. According to figure, obtaining the interesting region of cells and blocs. 4. Finally, each cell is represented as a 9-bin histogram with each bin corresponding to a particular gradient orientation, here, gradient orientation is divided into 9 bins, each bin includes 0 0 interval. Then, each bloc is represented by a 36-D unit-length vector. To reduce the influence of large gradient magnitudes, the L-Hys normalization was applied. In this way, a 3780-D feature vector is used to represent a detection window. 3. Local Binary Pattern The original LBP operator introduced in literature [] labels the pixels of an image by thresholding the 3x3 neighborhood of each pixel with the center pixel value and considering the result as a binary number. Then the 56-bin histogram of the labels can be used as a texture descriptor. Fig.3 shows an example of LBP calculati O xy, (0,80 ]. where, ( ) 0 0 g 45

0 th April 03. Vol. 50 No. 005-03 JATIT & LLS. All rights reserved. ISSN: 99-8645 www.jatit.org E-ISSN: 87-395 Fig. 3 The Basic LBP Operator In order to deal with textures at different scales, the LBP operator is later extended to use neighborhoods of different sizes. For example, the operator LBP4, uses 4 neighbors while LBP6, considers the 6 neighbors on a circle of radius. In general, the operator LBP PR, refers to a neighborhood size of P equally spaced pixels on a circle of radius R that form a circularly symmetric neighbor set. Another extension to the original operator is the definition of so called uniform patterns. Ojala et al. [] defined these fundamental patterns as those with a small number of bitwise transitions from 0 to and vice versa. For example, 00000000 and contain 0 transition while 000000 and 00 contain transitions and so on. Accumulating the patterns which have more than transitions into a single bin yields an LBP descriptor. To consider the shape information of humans, we divide human images into M small non-overlapping cells CC,, CM. The LBP histograms extracted from each cell are then concatenated into a single, spatially enhanced feature histogram. Since feature combination can significantly improve detector performance, we combine gradient based HOG feature and texture based LBP feature, and the proposed human detection algorithm achieves desiring performance. More detailed experimental results are shown in the next section. 4. EXPERIMENTAL RESULTS To evaluate our proposed method, we have tested it on real video surveillance sequences. These sequences include objects such as pedestrian, cars etc. In general, pedestrians are interesting objects in video surveillance systems, and the main goal of surveillance system is to automatically detect and locate pedestrians in video sequences once they are initialized. (a) (b) (c) Fig.4 Detection Based On Foreground Segmentation Fig.4a displays the input frame; Fig.4b presents the foreground segmentation results derived by Gaussian mixture model; Fig.4c demonstrates the pedestrian detection results by utilizing our proposed approach. Fig.4 illustrates that pedestrian detection algorithm this paper proposed can avoid scanning the total image by utilizing the foreground segmentation step, which submits fewer foreground regions for classification. The reduction of the image area to be processed means that the overall system time can also be reduced. According the detection results, our proposed algorithm can effectively detect the location and size of pedestrian by excluding the interference foreground region yielded by moving car. 5. CONCLUSIONS This paper presents an effective pedestrian detection algorithm, and which can meet the realtime requirement in a certain extent. The general human detection systems adopt the exhaustive scan strategy, which maes it difficult to fulfill realtime requirements. In fact, just lie our proposed approach, we can tae advantage of some application prior nowledge so that the number of ROIs to process can be greatly reduced. However, there are still some drawbacs in this approach, such as the camera pose must be fixed. In the future wor, we will try to explore more effective and real-time pedestrian detection algorithm. 453

0 th April 03. Vol. 50 No. 005-03 JATIT & LLS. All rights reserved. ISSN: 99-8645 www.jatit.org E-ISSN: 87-395 ACKNOWLEDGMENTS The wor is supported by National Science Foundation of China (Grant No. 60979); REFERENCES: [] N. Dalal and B. Triggs, Histogram of oriented gradient for human detection, In IEEE Conference on Computer Vision and Pattern Recognition, 005, pp.886-893. [] P. Viola, M. J. Jones, and D. Snow. Detecting pedestrians using patterns of motion and appearance. International Journal of Computer Vision, 63(), 005, pp.53-6. [3] C. Woje, G. Doro, A. Schulz, and B. Schiele, Sliding-Windows for Rapid Object Class Localization: A Parallel Technique, Proc. Symp. German Assoc. for Pattern Recognition, 008, pp.7-8. [4] W. Zhang, G. Zelinsy, and D. Samaras, Real- Time Accurate Object Detection Using Multiple Resolutions, Proc. Int l Conf. Computer Vision, 007, pp.-8. [5] W. Zhang, G. Zelinsy, and D. Samaras. Realtime accurate object detection using multiple resolutions. In International Conference on Computer Vision, 007. [6] B. Leibe, A. Leonardis, and B. Schiele, Robust Object Detection with Interleaved Categorization and Segmentation, International journal of computer vision, 77(-3), 008, pp.59-89. [7] Q. Zhu, M.-C. Yeh, K.-T. Cheng, and S. Avidan. Fast human detection using a cascade of histograms of oriented gradients. In IEEE Conference on Computer Vision and Pattern Recognition, 006, pp.49-498. [8] J. Begard, N. Allezard, and P. Sayd. Real-time human detection in urban scenes: Local descriptors and classifiers selection with adaboost-lie algorithms. In IEEE Conference on Computer Vision and Pattern Recognition Worshops, 008. [9] P. KadewTraKuPong and R. Bowden, An improved adaptive bacground mixture model for real-time tracing with shadow detection. in Proc. nd European Worshp on Advanced Video-Based Surveillance Systems, 00. [0] Y.-T. Chen and C.-S. Chen. Fast human detection using a novel boosted cascading structure with meta stages. IEEE Trans. on Image Processing, 7(8), 008, pp.45 464. [] Timo Ojala, Matti Pietiäinen, Topi Mäenpää, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Transactions on Pattern Analysis and Machine Intelligence, 4(7), 00, pp.97 987. 454