Visual Tracking of Athletes in Volleyball Sport Videos

Similar documents
Tracking and Recognition in Sports Videos

Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006

Vision based Vehicle Tracking using a high angle camera

A Robust Multiple Object Tracking for Sport Applications 1) Thomas Mauthner, Horst Bischof

Tracking Groups of Pedestrians in Video Sequences

Mean-Shift Tracking with Random Sampling

VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS

Automatic Traffic Estimation Using Image Processing

Tracking in flussi video 3D. Ing. Samuele Salti

A Reliability Point and Kalman Filter-based Vehicle Tracking Technique

Deterministic Sampling-based Switching Kalman Filtering for Vehicle Tracking

Rafael Martín & José M. Martínez

Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011

Video Surveillance System for Security Applications

International Journal of Innovative Research in Computer and Communication Engineering. (A High Impact Factor, Monthly, Peer Reviewed Journal)

Real Time Target Tracking with Pan Tilt Zoom Camera

The Visual Internet of Things System Based on Depth Camera

An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network

Vehicle Tracking in Occlusion and Clutter

Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition

Efficient Background Subtraction and Shadow Removal Technique for Multiple Human object Tracking

False alarm in outdoor environments

Human behavior analysis from videos using optical flow

High Quality Image Magnification using Cross-Scale Self-Similarity

VSSN 06 Algorithm Competition

Speed Performance Improvement of Vehicle Blob Tracking System

Low-resolution Image Processing based on FPGA

Object Tracking System Using Approximate Median Filter, Kalman Filter and Dynamic Template Matching

Real-time Person Detection and Tracking in Panoramic Video

On-line Tracking Groups of Pedestrians with Bayesian Networks

Component Ordering in Independent Component Analysis Based on Data Power

Real-Time Tracking of Pedestrians and Vehicles

Spatio-Temporal Nonparametric Background Modeling and Subtraction

Design of Multi-camera Based Acts Monitoring System for Effective Remote Monitoring Control

REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING

VEHICLE TRACKING USING ACOUSTIC AND VIDEO SENSORS

3D Vehicle Extraction and Tracking from Multiple Viewpoints for Traffic Monitoring by using Probability Fusion Map

Low-resolution Character Recognition by Video-based Super-resolution


Building an Advanced Invariant Real-Time Human Tracking System

Real-time Visual Tracker by Stream Processing

Neural Network based Vehicle Classification for Intelligent Traffic Control

OBJECT TRACKING USING LOG-POLAR TRANSFORMATION

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms

Real time vehicle detection and tracking on multiple lanes

An automatic system for sports analytics in multi-camera tennis videos

Finding people in repeated shots of the same scene

Comparing Improved Versions of K-Means and Subtractive Clustering in a Tracking Application

Urban Vehicle Tracking using a Combined 3D Model Detector and Classifier

Support Vector Machine-Based Human Behavior Classification in Crowd through Projection and Star Skeletonization

Automatic parameter regulation for a tracking system with an auto-critical function

A Real Time Hand Tracking System for Interactive Applications

A Learning Based Method for Super-Resolution of Low Resolution Images

ROBUST VEHICLE TRACKING IN VIDEO IMAGES BEING TAKEN FROM A HELICOPTER

Kristine L. Bell and Harry L. Van Trees. Center of Excellence in C 3 I George Mason University Fairfax, VA , USA kbell@gmu.edu, hlv@gmu.

Probabilistic Latent Semantic Analysis (plsa)

Object tracking in video scenes

Detecting and Tracking Moving Objects for Video Surveillance

Object tracking & Motion detection in video sequences

Tracking And Object Classification For Automated Surveillance

A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA

A General Framework for Tracking Objects in a Multi-Camera Environment

Edge tracking for motion segmentation and depth ordering

How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm

How does the Kinect work? John MacCormick

Traffic Flow Monitoring in Crowded Cities

A Study of Automatic License Plate Recognition Algorithms and Techniques

Novelty Detection in image recognition using IRF Neural Networks properties

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

Tracking performance evaluation on PETS 2015 Challenge datasets

SPECIAL PERTURBATIONS UNCORRELATED TRACK PROCESSING

Autonomous Monitoring of Cliff Nesting Seabirds using Computer Vision

How does Person Identity Recognition Help Multi-Person Tracking?

Object Tracking for Laparoscopic Surgery Using the Adaptive Mean-Shift Kalman Algorithm

Evaluating the Performance of Systems for Tracking Football Players and Ball

Comparative Analysis of EM Clustering Algorithm and Density Based Clustering Algorithm Using WEKA tool.

Super-resolution method based on edge feature for high resolution imaging

Analecta Vol. 8, No. 2 ISSN

Environmental Remote Sensing GEOG 2021

Bayesian Statistics: Indian Buffet Process

Face Recognition in Low-resolution Images by Using Local Zernike Moments

Object Tracking System Using Motion Detection

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Bayesian Image Super-Resolution

Transcription:

Visual Tracking of Athletes in Volleyball Sport Videos H.Salehifar 1 and A.Bastanfard 2 1 Faculty of Electrical, Computer and IT, Islamic Azad University, Qazvin Branch, Qazvin, Iran 2 Computer Group, Faculty of Engineering, Islamic Azad University, Karaj Branch, Karaj, Iran Abstract - Locating, labeling, and tracking players have broad application, especially for a team analyzer, and in the broadcast sports videos industry. In this paper we have presented a framework based on Gaussian Mixture Model and Kalman Filter for detecting and segmenting players in volleyball games. In this method, the players is detected by Gaussian Mixture Model and using Kalman filter the next location of every player in next frames is predicted. The proposed approach has the robust ability to track moving objects in consecutive frames under some kinds of difficulties such as rapid appearance changes caused by image noise and occlusion. Keywords: Human Motion; Tracking; Volleyball Athletes 1 Introduction Object tracking has received tremendous attention in the video processing community due to its numerous potential applications in video surveillance, human activity analysis, traffic monitoring, and so on. Recently the focus of the community is on Multi-Target Tracking (MTT) that requires determining the number as well as the dynamics of targets. However, due to a combination of several factors, reliable target tracking remains a challenging domain of research. The underlying difficulties behind multi-target tracking are founded mostly upon the apparent similarity of targets and then multi-target occlusion. MTT for targets whose appearance is distinctive is comparatively easier since it can be solved reasonably well by using multiple Independent single-target trackers. However, MTT for targets whose appearance is similar such as pedestrians in crowded scenes is a much more difficult task. In addition, with this MTT must deal with multi-target occlusion, namely, the tracker must separate the targets and assign them correct labels. Computational complexity also plays an important role, as the tracking should be real time. All these issues make target tracking or multi-object tracking a sturdy task even now. One the remarkable application is analyzing the players activity in different sports. Locating, labeling, and tracking players have broad application, especially for a team analyzer, and in the broadcast sports videos industry. There are several issues which make this kind of video analyzing a challenging task. Players occlusion, similar player s appearance, various numbers of players, unexpected camera motion, video blur and noise are the samples of this difficulty. Many algorithms have been proposed to deal with the multiple target-tracking (MTT) problem in which Particle Filter [1], [2], Joint Probabilistic Data Association Filter (JPDAF) [3], Multiple Hypothesis Tracking (MHT) [4], MCMC data association [5], [6] and track linking [7], [8], [9] are used. Several researchers also study the specific problem of labeling and tracking of players in sports video [2][10][11]. In [10], a clustering based trajectory matching method is proposed to track players in a soccer video. In this work, labeling of individuals is achieved through a supervised classification method. Comaniciu et al [11] build a track graph, and take the tracking problem as an inference problem in a Bayesian network. In both works, a multi-camera system is used to get a fixed, high-resolution and wide-field view of a soccer game. These pre-defined settings ensured a reliable background subtraction system, which are not very practical approach. This paper presents our approach in volleyball for a vision-based tracking system which can be used in practice by trainers and athletes. In this paper we have presented a framework based on GMM 1 and Kalman Filter for detecting and segmenting players in volleyball games. This paper is organized as follows: proposed framework and tracking algorithm are described in section 2. Evaluation is shown in section 3, and conclusions are made in section 4. 2 Proposed framework Object tracking is the problem of estimating the positions and other relevant information of moving objects in image sequences. This section describes the tracking method used in this paper. The proposed approach is confection of GMM and Kalman Filter algorithms. At first, we explained the GMM and Kalman Filter. 2.1 Gaussian mixture model Before we start with tracking of moving objects, we need to extract moving objects from the background. Background subtraction is one of the most common approaches for detecting foreground objects from video sequences. Recently, some statistical methods are used to extract change regions from the background. The Gaussian mixture model is the most representative background model [12]. The value of a pixel at time t in RGB or some other color space is denoted by. Pixel-based background subtraction involves decision if the pixel belongs to background (BG) or some foreground object (FG). Bayesian decision R is made by: 1 Gaussian Mixture Model

In a general case we don't know anything about the foreground objects that can be seen nor when and how often they will be present. Therefore we set P(FG) = P(BG) and assume uniform distribution for the foreground object appearance. The pixel belongs to the background if: The squared distance from the m-th component is calculated as: D m 2 (X (t) )= δ m T δ m /σ m 2. If there are no 'close' components a new component is generated with π M+1 =α, µ M+1 = X (t) and σ M+1 =σ 0. Where σ 0 is some appropriate initial variance. If the maximum number of components is reached, the component is discard by the smallest π M. The presented algorithm presents an on-line clustering algorithm. Usually, the intruding foreground objects will be represented by some additional clusters with small weights π M. Therefore, the background model is approximated by the first B largest clusters: (3) Where is a threshold value. We will refer to as the background model. The background model is estimated from a training set denoted as. The estimated model is denoted by and depends on the training set as denoted explicitly. In practice, the illumination in the scene could change gradually or suddenly. New object could be brought into the scene or a present object removed from it. In order to adapt to changes we can update the training set by adding new samples and discarding the old ones. So is chosen reasonable time period T and at time t, T is: T = {X (t),, X (t-t) }. For each new sample, the training data set T is updated and is re-estimated. However, among the samples from the recent history there could be some values that belong to the foreground objects and this estimate is denoted as P (X, BG+FG). GMM is used with M components: If the components are sorted to have descending weights π M we have: Where is a measure of the maximum portion of the data that can belong to foreground objects without influencing the background model. Flowchart of GMM algorithm is illustrated in figure1. The output of this process is shown in figure2. Where µ 1,, µ 2 are the estimates of the means and σ 1,,σ 2 are the estimates of the variances that describe the Gaussian components. The covariance matrices are assumed to be diagonal and the identity matrix I has proper dimensions. The mixing weights denoted by π m are non-negative and add up to one. Given a new data sample X (t) at time the recursive update equations are [13]: Where δ m = X (t) - µ m. Instead of the time interval, T that was mentioned above, here constant α=1/t describes an exponentially decaying envelope that is used to limit the influence of the old data. For a new sample the ownership ο m (t) is set to 1 for the 'close' component with largest π m and the others are set to zero. The sample is defined 'close' to a component if the Mahalanobis distance from the component is for example less than three standard deviations. Figure1. Flowchart of GMM algorithm.

Besides, we represented the rectangular window using center coordinate. Because the moving state changed little in the neighboring consecutive frames, we thought of the system as linear Gaussian one and the state parameters of Kalman filter were object location, its velocity, and the width of the rectangle which represent the width of a Human. The statespace representation of the tracker used in the Kalman filter is given in Eq. (10) (a) Original frame where, and are the predicted coordinates of the object and and are the velocities in the respective direction, represents the width of the Human rectangle, Δt represents the time interval of state correction and is the white Gaussian noise with diagonal variance Q. The predicted coordinates and dimensions of the rectangle are used to locate the Human in the present frame. When the Humans are distinguished, the Kalman vector is updated using the measurement equation as shown in Eq. (11). (b) Foreground detection Figure2. The execution of the GMM algorithm. (a)original image (b) the foreground detection. 2.2 Kalman filter A Kalman filter is applied to estimate the state of a linear system where the state is assumed to be distributed by a Gaussian. Kalman filtering is composed of two steps, prediction and correction [14, 15]. The Kalman filter is a recursive estimator. This means that only the estimated state form the previous time step and the current measurement are needed to compute the estimate for the current state. In contrast to batch estimation techniques [16], no history of observations and/or estimates is required. Kalman filter consists of five equations and it can be divide them into two groups: the update equations and the correct equations. The update equations are responsible for projecting forward the current state and error covariance estimates to obtain the priori estimates for the next time step. The correct equations are responsible for the feedback, in other words, for incorporating a new measurement into the priori estimate to obtain an improved posteriori estimate. The state variable and observation of Kalman filter in this paper are object locations. Where and are the measured coordinates, the value is the measure width of Human at time t+1 and is white Gaussian noise with diagonal variance R. The position, velocity and acceleration are updated based on the values obtained in the present frame and the data from the previous frame. 2.3 Schematic description of the tracking algorithm After extract moving objects from background in each video frame using the GMM algorithm, the kalman-prediction function is used to predict the next position players in the next frame. Since each frame may identify new objects, we need to specify the previous objects and objects that are formed newly. This detection is done using the data-association function. New objects in the current frame using the Add New-Hypotheses function, are added And after removal of noise using the Kalman-Update function is transferred to the next frame. This loop repeated for each frame and thus tracking algorithm is complete. Overall procedure of the proposed method is illustrated in figure3.

Figure 4. The sample of images of PVD dataset. 2.4 Dataset Figure3.The proposed tracking algorithm scheme. The recent approach of researchers in the field of image processing is about video image and one of the most important usages of moving images processing is analyzing sport plays.to support all the motions and rules of the play, we need a dataset that is not specified to a special some action.. Cause of the lack a dataset in volleyball, we presented in previous paper [17] a complete view depended volleyball video dataset under the uncontrolled conditions. The proposed dataset includes the complete volleyball play video sequences. We tried to prepare data in uncontrolled conditions and different viewpoints when planning dataset. Since the ball is in various heights, we need an angle in appropriate height, which keep the ball and players image when playing. Therefore, the recording angles are selected witch make possible the best condition to cover players and ball. The data is prepared in various sport gym to guarantee the data comprehensively. The images of the dataset are prepared from official Iran volleyball league matches. The sample of images of dataset sequences are shown in figure4. 3 Evaluation This section show the results obtained by the proposed tracking algorithm. Our approach is implemented using Window Vista operation system, and Matlab R2007b are used. In addition, the image sequences consists of JPG images with 320x240 resolutions per frame. The system evaluation is measured by calculating the detection rate using temporal differencing [18] and the proposed tracking system. For tracking based on the temporal differencing, the difference between two corresponding pixels in successive frames is performed. If the difference is zero or vary small, the corresponding pixel does not belong to any moving object. Otherwise, the pixel is belonging to a moving object. The detection rate is calculated by dividing the true Humans detected by the proposed system by the valid moving Humans appears partially or completely over the video frames. The experiments show that results obtained by the proposed tracking system are better than that obtained using other methods for tracking group of players partially occluded with each others as described in table1. Table1. The performance evaluation for the proposed system with other method. Evaluation Valid moving players (frame 30-80) tracking based on the temporal differencing The proposed system 300 300 Correct detection 148 221 Missed detection 152 79 Detection rate 49.33% 73.66%

4 Conclusion Multiple-target tracking is a very active field nowadays due to its wide practical applicability in video processing. While talking about Multiple-target tracking, multi-target occlusion is a common problem that needs to be addressed. In this paper an innovative methodology for multi-players tracking in volleyball play has been proposed based on GMM and Kalman filter. Main contribution to this work is to overcome the problem of partial occlusion. The missed detection by the proposed system referred some complex situations due to full-occluded objects with each other since the proposed system deals only on the partial occlusion. Another reason for the missed detection by the proposed system is the image resolution. In conclusion, this fast and efficient algorithm can detect players in analyzing a Volleyball game. Figure5 demonstrates the ability of the proposed method in tracking and labeling the athletes. In conclusion, this fast and efficient algorithm can detect players in analyzing a Volleyball game. Figure5. The players are successfully tracked by using the proposed system, red color rectangle show the location where the object detected.

5 References [1] J. Sullivan and S. Carlsson, Tracking and Labeling of Interacting MultipleTargets, In Proc. European Conf. on Computer Vision (ECCV), 2006. [2] T. Mauthner1 and C. Koch, Visual Tracking of Athletes in Beach Volleyball Using a Single Camera, International Journal of Computer Science in Sport,2008. [3] M. Isard and J. MacCormick, BraMBLe: A Bayesian Multiple-Blob Tracker, In Proc. IEEE Int l Conf. on Computer Vision (ICCV), 2001. [15] M. Grewal and A. Andrews, Kalman Filtering Theory and Practice using MATLAB, second edition, John Wiley & Sons, Inc., 2001. [16] J. Świątek, Parameter Estimation of Systems Described by the Relation with Noisy Observations, Journal of Universal Computer Science, vol. 13, no. 2, 199-208, 2007. [17] H.salehifar and A.bastanfard, A Complete View Depended Volleyball Video Dataset Under The Uncontrolled Conditions, accepted in 2011. [18] H.salehifar and A.bastanfard, A Fast Algorithm for Detecting, Labeling and Tracking Volleyball Players in Sport Videos, 2011 3rd International Conference on Signal Acquisition and Processing (ICSAP 2011) [4] K. Okuma, A. Taleghani, N. de Freitas, J. J. Little, and D. G. Lowe, A Boosted Particle Filter: Multitarget Detection and Tracking, In Proc. European Conf. on Computer Vision (ECCV), 2004. [5] D. Reid, An Algorithm For Tracking Multiple Targets, IEEE Transaction on Automatic Control, vol. 24, no. 6, pp. 843 854, December 1979. [6] S. Oh, S. Russell, and S. Sastry, Markov Chain Monte Carlo Data Association for Multiple-Target Tracking, University of California, Berkeley, Technical Report UCB//ERL M05/19, June 2005. [7] Q. Yu, G. Medioni, Map-Enhanced Detection and Tracking from a Moving Platform with Local and Global Data Association, IEEE Workshop on Motion and Video Computing (WMVC'07), 2007. [8] A.G.A.PereraC.Srinivas,A.Hoogs,G.Brooksby,Wensheng Hu, Multi-Object Tracking Through Simultaneous Long Occlusions and Split-Merge Conditions, IEEE Int l Conf. on Computer Vision and Pattern Recognition (CVPR), 2006. [9] C. Stauffer, Estimating Track Sources and Sinks, In Proc. IEEE Workshop on Event Mining in Video, 2003. [10] P. Nillius, J. Sullivan and S. Carlsson, Multi-Target Tracking Linking Identities using Bayesian Network Inference, In Proc. IEEE Int l Conf. on Computer Vision and Pattern Recognition (CVPR), 2006. [11] D. Comaniciu, P. Meer, Mean shift: A Robust Approach Toward Feature Space Analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence 24 (5) (2002) 603 619. [12] C. Stauffer and W. Grimson, Adaptive Background Mixture Models For Real-Time Tracking, In Proceedings CVPR,pp. 246.252, 1999. [13] Z.Zivkovic and F.van der Heijden, Recursive Unsupervised Learning of Finite Mixture Models. IEEE Trans. on PAMI,vol.26., no.5, 2004. [14] B. Shalom and Y. Foreman, Tracking and Data Association. Academic Press Inc. 1988.