Color Segmentation Based Depth Image Filtering
|
|
|
- Joel Toby Richardson
- 9 years ago
- Views:
Transcription
1 Color Segmentation Based Depth Image Filtering Michael Schmeing and Xiaoyi Jiang Department of Computer Science, University of Münster Einsteinstraße 62, Münster, Germany, {m.schmeing Abstract. We present a novel enhancement method that addresses the problem of corrupted edge information in depth maps. Corrupted depth information manifests itself in zigzag edges instead of straight ones. We extract the depth information from an associated color stream and use this information to enhance the original depth map. Besides the visual results, a quantitative analysis is conducted to prove the capabilities of our approach. 1 Introduction Video-plus-depth is an important 3D scene representation format [1]. It consists of a color stream describing the texture of the scene and an associated depth stream describing for each pixel its distance to the camera. From this representation, arbitrary new views can be generated to enable stereo [1, 2], multi view [3] or free viewpoint video [4]. An important presumption for high quality rendering is a high quality depth map. However, there exists at the moment no depth map generation technique that is able to produce a perfect depth map, i.e., a depth map that is free of artifacts, holes, which is temporally stable and has video resolution all together. Different depth map enhancement methods have evolved to address different aspects of depth map corruption [5 9]. In this paper, we propose a novel enhancement algorithm that takes associated color information into account to enhance the quality of edges in a depth map. We use depth maps generated by the Microsoft Kinect depth camera for our approach, though our algorithm is not restricted to depth maps generated with this camera. The Kinect camera is a structured light depth sensor which suffers from quite poor edge reproduction. Figure 1 shows an example. We use edge information found in the corresponding color stream via a superpixel segmentation and compute a new representative depth map D r which stores robust edge information corresponding to the color
2 2 (a) (b) (c) (d) (e) (f) Fig. 1. (a) Example color image. (b) Associated depth map. (c) Magnification of (a). (d) Magnification of (b). (e) The green line marks the edge from the color stream. (f) Result of our approach. stream. D r is then used to enhance the source depth map D. A quantitative analysis shows that our method outperforms common depth enhancement algorithms in terms of edge restoration. The rest of our paper is organized as follows. Section 2 discusses previous work in this field. We describe our algorithm in Section 3 and present some results in Section 4. Section 5 concludes this paper. 2 Related Work There exist several methods that enhance depth maps. In [5], depth maps are filtered using a temporal median filter. Holes are filled using a spatial median. The work reported in [6] addresses special issues of the Microsoft Kinect depth camera. Holes that occur due to the offset between color and depth camera are closed by background extraction and other holes by row-wise linear interpolation. No temporal processing is applied. In [7], the authors port simple filters like Gaussian-weighted hole filling and temporal smoothing as well as edge-preserving denoising on the GPU and achieve very high framerates of 100fps. The framework is modal and not dependent on a certain depth camera technology. The method looks promising and is said to be applicable to dynamic scenes although no special evaluation was given in this case.
3 3 3 Our Method In our algorithm we address the common problem of corrupted edge information in depth images. Figure 1 shows an example. In the color image, the edge of the foreground object (the shelf) is a straight line, whereas it is corrupted in the depth stream. The corrupted edges in the depth image do not correspond with the edges of actual objects. When using this depth map for view synthesis (e.g. Depth Image Based Rendering [1]), artifacts will occur. Therefore, it is important to have edges in the depth map that correspond closely to the edges of the objects in the scene. Our method works for scenes in the video-plus-depth format. We assume the depth stream to have the same resolution as the video stream. Depth upsampling [10], which is another field of depth video enhancement, can be applied as a preprocessing step if necessary. There are two kinds of possible edge defects in a depth map. First, the edge is not straight but rather forms a zigzag line or consists of other visible steps. This can happen through the nature of the sensor (like the Kinect sensor) or through inadequate depth upsampling. The second defect is global misalignment, i.e. the complete edge of the depth map is shifted with respect to the edge in the color image. This defect can arise from insufficient registration between video camera and depth camera. Let I be a frame of the video sequence and D the corresponding depth map. Our goal is to process D in a way that the edges in D align with the edges (of objects) in I. As a first step, we perform normalized convolution [9] to fill holes in the depth map. A hole pixel x is filled with a weighted sum of the depth values of its non-hole neighboring pixels: D nc x (x) = N D(x)g(x, x ) x x N g(x, (1) x x ) where Nx is the set of neighboring pixels of x that have a valid depth value and g(x, x ) a Gaussian function with parameter σ: g(x, x ) = exp ( x x 2 ). (2) In the next step, we identify edges in the color image. Instead of finding edges directly with common methods like the Canny operator, we use the implicit edge information given by a segmentation, more precisely, an over-segmentation of the color image. While a normal segmentation divides the image into meaningful areas (usually guided by edges), an over-segmentation further divides these areas. Those areas can nevertheless be recovered by combining areas of the oversegmentation. Particularly, the over-segmentation respects the edges of objects in the color image. With an over-segmentation of the depth map, we can compute representative depth values for each segment, for example by taking the median or the average σ 2
4 4 (a) (b) (c) (d) Fig. 2. (a) An example over-segmentation. (b) A representative depth map D r with marked magnification region. (c) Cutout of original depth with projected segmentation. (d) Cutout of our method with projected segmentation: Depth and color edges align. of all the depth values of pixels in this segment. These representative values are more robust to noise than one single pixel. The representative depth map D r will be generated by filling each segment with its representative depth value. Since the segmentation respects the color image edges, the edges in the representative depth map will respect these edges, too. Using D r, we can later discard pixels as corrupted that are too dissimilar to the representative depth value of their segment. See Figure 2(c) for an example: In the upper part of the main light region, the dark depth region overlaps into the light region which means that the depth edge and the color edge (i.e., the segment border) do not correspond. Figure 2(d) shows the corrected edge produced by our algorithm. We tried different superpixel-segmentation methods including Mean-Shift and the method of [11] but in the end we used a simple watershed segmentation [12] because it delivers sufficient results for our purpose at a very high speed (more than 30fps at resolution. We also tried different marker distributions for the watershed segmentation: randomly, on a regular grid, and skewed on a regular grid (which means that the markers of two consecutive rows do not lie in the same column but are slightly shifted). We tested these distributions with the additional constraint that markers are not placed on edges in the color image. The best results where obtained with the regular, skewed grid and no additional constraint.
5 5 (a) (b) (c) (d) Fig. 3. (a) Example frame of the unfiltered depth generated by the Kinect depth sensor. (b) Frame filtered with method of Brednikov et al. [6]. (c) Frame filtered with method of Wasza et al. [7]. (d) Our method. Figure 2(a) shows an example segmentation. The quality of the segmentation can be degraded by a high amount of image noise, so we first apply a bilateral filter to reduce the noise while simultaneously protect edges in the color image. The filtered color value I(p) of a pixel p is given by: q N I(p) = K s( p q )K c ( p q )I(q) q N K (3) s( p q )K c ( p q ) with K s and K c being Kernel functions, typically Gaussian distributions. The obtained over-segmentation is then projected into the depth stream. In an ideal depth map, the edges of the over-segmentation would coincide with the edges in the depth map. Figure 2(c) shows what happens in real world depth maps (taken from a Kinect): Some areas overlap into neighboring segments. Using a sufficient segment size, though, we can ensure that at least half of the depth pixels in a segment have correct depth (this is clearly the case in Figure 2(c)). We build the representative depth map D r from this segmentation by computing for each segment the median depth value: D r (x, y) = {d k : (x, y) S k, d k = median (x,y ) S k d(x, y )}
6 6 Fig. 4. Sample color and depth frame from the edge test sequence. where S k is a segment in the color image. Figure 2(b) shows an example representative depth map. This depth map corresponds very well with the edges in the color image but suffers of course from the fact that it cannot represent smooth depth transitions but rather consists of discrete patches. The final filtered depth map D f uses the depth values of D r only, if D exhibits corrupted depth values. D f is obtained in the following way: { D f D r (x, y) if D(x, y) D r (x, y) > θ (x, y) = (4) D(x, y) otherwise with θ being a threshold. 4 Experimental Results 4.1 Qualitative Results Figure 3 shows some results of our method compared with other methods. Berdnikov et al.[6] address special issues of the Microsoft Kinect depth camera. Holes that occur due to the offset between color and depth camera are closed by background extraction and other holes by row-wise linear interpolation. The focus of Wasza et al.[7] lies on porting simple filters like Gaussian-weighted hole filling [9] and temporal smoothing as well as edge-preserving denoising on the GPU to achieve very high frame rates. Our method closes all holes and in contrast to other method, it can restore straight edges. This behavior can also be seen in Figure 2(c) and (d). 4.2 Quantitative Results It is difficult to obtain quantitative quality results for depth filtering algorithms. This is due to the fact that usually no ground truth depth map is available to compare the filtered depth map with. However, we designed a test method to assess the ability of our algorithm to restore edges.
7 7 1,00 0,98 0,95 Rand index 0,93 0,90 0,88 0,85 Our method Brednikov et al.[7] Knutsson et al.[10] Wasza et al.[8] Frames Fig. 5. Rand index values for different depth filtering algorithms. A value near to 1 means a very good correlation with the color stream. Recall Figure 1(e) for an example of a corrupted edge in the original depth map. We see that the edge does not correspond very well with the edge in the color stream (green line). Our algorithm, see Figure 1(f), performs way better and we want to quantify this result. To do this, we recorded a test sequence edge test of 71 frames (color and depth, see Figure 4) with very simple geometry: It depicts a homogeneous (in terms of depth values) foreground object with a straight edge in front of a homogeneous background. The foreground object also has different color texture than the background. In this situation, we can define two clusterings that divide the scene into foreground and background : C D is a 2-means clustering of the depth map and C C is a 2-means clustering of the color stream. If the depth map is aligned with the color stream and does not exhibit cracks or other corruption, then clustering C D and C C should be the same. To determine how similar the clustering C D and C C are, we compute the Rand index[13]. The Rand index R(, ) [0,... 1] is a very popular measure to describes how similar two clusterings are. A Rand index of 1 means they are the same whereas 0 means they are completely different. In our situation a high Rand index indicates a very good correlation between the color stream and the (filtered) depth stream. Figure 5 shows the Rand indices for all frames of the test sequence. We can see that our algorithm clearly outperforms all other algorithms.
8 8 5 Conclusion We have presented a method to increase the spatial accuracy of depth maps using edge information of the associated color stream. Our method can reliably enhance corrupted edges in the depth stream and outperforms common algorithms. Future work aims at the inclusion of inter-frame information to enforce time-consistency and further reduce edge artifacts. References 1. Fehn, C., de la Barre, R., Pastoor, R.S.: Interactive 3-DTV-Concepts and Key Technologies. Proceedings of the IEEE 94 (2006) Schmeing, M., Jiang, X.: Depth Image Based Rendering: A Faithful Approach for the Disocclusion Problem. In: Proc. 3DTV-Conf.: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON). (2010) Zitnick, C.L., Kang, S.B., Uyttendaele, M., Winder, S., Szeliski, R.: High-quality Video View Interpolation using a Layered Representation. ACM Transactions on Graphics 23(3) (2004) Smolic, A., Mueller, K., Merkle, P., Fehn, C., Kauff, P., Eisert, P., Wiegand, T.: 3D Video and Free Viewpoint Video - Technologies, Applications and MPEG Standards. In: 2006 IEEE International Conference on Multimedia and Expo. (2006) Matyunin, S., Vatolin, D., Berdnikov, Y., Smirnov, M.: Temporal Filtering for Depth Maps generated by Kinect Depth Camera. In: Proc. 3DTV Conf.: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON). (2011) Berdnikov, Y., Vatolin, D.: Real-time Depth Map Occlusion Filling and Scene Background Restoration for Projected-Pattern-based Depth Camera. In: 21th International Conference on Computer Graphics and Vision (GraphiCon2011). (2011) Wasza, J., Bauer, S., Hornegger, J.: Real-time Preprocessing for Dense 3-D Range Imaging on the GPU: Defect Interpolation, Bilateral Temporal Averaging and Guided Filtering. In: IEEE International Conference on Computer Vision Workshops (ICCV Workshops). (2011) Min, D., Lu, J., Do, M.: Depth Video Enhancement Based on Weighted Mode Filtering. IEEE Transactions on Image Processing 21(3) (2012) Knutsson, H., Westin, C.F.: Normalized and Differential Convolution. In: 1993 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, (1993) Diebel, J., Thrun, S.: An Application of Markov Random Fields to Range Sensing. In: Advances in Neural Information Processing Systems 18. MIT Press, Cambridge, MA (2006) Liu, M.Y., Tuzel, O., Ramalingam, S., Chellappa, R.: Entropy Rate Superpixel Segmentation. In: Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. (2011) Beucher, S., Lantuejoul, C.: Use of Watersheds in Contour Detection. In: International Workshop on Image Processing: Real-time Edge and Motion Detection/Estimation, Rennes, France. (1979) 13. Rand, W.M.: Objective Criteria for the Evaluation of Clustering Methods. Journal of the American Statistical Association 66(336) (1971) pp
Vision based Vehicle Tracking using a high angle camera
Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu [email protected] [email protected] Abstract A vehicle tracking and grouping algorithm is presented in this work
High Performance GPU-based Preprocessing for Time-of-Flight Imaging in Medical Applications
High Performance GPU-based Preprocessing for Time-of-Flight Imaging in Medical Applications Jakob Wasza 1, Sebastian Bauer 1, Joachim Hornegger 1,2 1 Pattern Recognition Lab, Friedrich-Alexander University
High Quality Image Magnification using Cross-Scale Self-Similarity
High Quality Image Magnification using Cross-Scale Self-Similarity André Gooßen 1, Arne Ehlers 1, Thomas Pralow 2, Rolf-Rainer Grigat 1 1 Vision Systems, Hamburg University of Technology, D-21079 Hamburg
A Prototype For Eye-Gaze Corrected
A Prototype For Eye-Gaze Corrected Video Chat on Graphics Hardware Maarten Dumont, Steven Maesen, Sammy Rogmans and Philippe Bekaert Introduction Traditional webcam video chat: No eye contact. No extensive
Automatic Labeling of Lane Markings for Autonomous Vehicles
Automatic Labeling of Lane Markings for Autonomous Vehicles Jeffrey Kiske Stanford University 450 Serra Mall, Stanford, CA 94305 [email protected] 1. Introduction As autonomous vehicles become more popular,
A NEW SUPER RESOLUTION TECHNIQUE FOR RANGE DATA. Valeria Garro, Pietro Zanuttigh, Guido M. Cortelazzo. University of Padova, Italy
A NEW SUPER RESOLUTION TECHNIQUE FOR RANGE DATA Valeria Garro, Pietro Zanuttigh, Guido M. Cortelazzo University of Padova, Italy ABSTRACT Current Time-of-Flight matrix sensors allow for the acquisition
Mean-Shift Tracking with Random Sampling
1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of
Template-based Eye and Mouth Detection for 3D Video Conferencing
Template-based Eye and Mouth Detection for 3D Video Conferencing Jürgen Rurainsky and Peter Eisert Fraunhofer Institute for Telecommunications - Heinrich-Hertz-Institute, Image Processing Department, Einsteinufer
Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences
Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences Byoung-moon You 1, Kyung-tack Jung 2, Sang-kook Kim 2, and Doo-sung Hwang 3 1 L&Y Vision Technologies, Inc., Daejeon,
Optical Flow. Shenlong Wang CSC2541 Course Presentation Feb 2, 2016
Optical Flow Shenlong Wang CSC2541 Course Presentation Feb 2, 2016 Outline Introduction Variation Models Feature Matching Methods End-to-end Learning based Methods Discussion Optical Flow Goal: Pixel motion
Very Low Frame-Rate Video Streaming For Face-to-Face Teleconference
Very Low Frame-Rate Video Streaming For Face-to-Face Teleconference Jue Wang, Michael F. Cohen Department of Electrical Engineering, University of Washington Microsoft Research Abstract Providing the best
Adaptive Block Truncation Filter for MVC Depth Image Enhancement
2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) Adaptive Block Truncation Filter for MVC Depth Image Enhancement Xuyuan Xu 1, Lai-Man Po 1, Chun-Ho Cheung 2, Litong
Quality improving techniques for free-viewpoint DIBR
Quality improving techniques for free-viewpoint DIBR Luat Do a, Sveta Zinger a and Peter H.N. de With a,b a Eindhoven University of Technology, P.O. Box 513, 5600 MB Eindhoven, Netherlands b Cyclomedia
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
A Noise-Aware Filter for Real-Time Depth Upsampling
A Noise-Aware Filter for Real-Time Depth Upsampling Derek Chan Hylke Buisman Christian Theobalt Sebastian Thrun Stanford University, USA Abstract. A new generation of active 3D range sensors, such as time-of-flight
REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4
REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4 S. Heymann, A. Smolic, K. Mueller, Y. Guo, J. Rurainsky, P. Eisert, T. Wiegand Fraunhofer Institute
Feature Tracking and Optical Flow
02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve
jorge s. marques image processing
image processing images images: what are they? what is shown in this image? What is this? what is an image images describe the evolution of physical variables (intensity, color, reflectance, condutivity)
Automatic Traffic Estimation Using Image Processing
Automatic Traffic Estimation Using Image Processing Pejman Niksaz Science &Research Branch, Azad University of Yazd, Iran [email protected] Abstract As we know the population of city and number of
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
Speed Performance Improvement of Vehicle Blob Tracking System
Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA [email protected], [email protected] Abstract. A speed
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,
Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition
Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition 1. Image Pre-Processing - Pixel Brightness Transformation - Geometric Transformation - Image Denoising 1 1. Image Pre-Processing
Real time vehicle detection and tracking on multiple lanes
Real time vehicle detection and tracking on multiple lanes Kristian Kovačić Edouard Ivanjko Hrvoje Gold Department of Intelligent Transportation Systems Faculty of Transport and Traffic Sciences University
How does the Kinect work? John MacCormick
How does the Kinect work? John MacCormick Xbox demo Laptop demo The Kinect uses structured light and machine learning Inferring body position is a two-stage process: first compute a depth map (using structured
The Visual Internet of Things System Based on Depth Camera
The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed
Two-Frame Motion Estimation Based on Polynomial Expansion
Two-Frame Motion Estimation Based on Polynomial Expansion Gunnar Farnebäck Computer Vision Laboratory, Linköping University, SE-581 83 Linköping, Sweden [email protected] http://www.isy.liu.se/cvl/ Abstract.
Edge tracking for motion segmentation and depth ordering
Edge tracking for motion segmentation and depth ordering P. Smith, T. Drummond and R. Cipolla Department of Engineering University of Cambridge Cambridge CB2 1PZ,UK {pas1001 twd20 cipolla}@eng.cam.ac.uk
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
A ROBUST BACKGROUND REMOVAL ALGORTIHMS
A ROBUST BACKGROUND REMOVAL ALGORTIHMS USING FUZZY C-MEANS CLUSTERING ABSTRACT S.Lakshmi 1 and Dr.V.Sankaranarayanan 2 1 Jeppiaar Engineering College, Chennai [email protected] 2 Director, Crescent
Accurate and robust image superresolution by neural processing of local image representations
Accurate and robust image superresolution by neural processing of local image representations Carlos Miravet 1,2 and Francisco B. Rodríguez 1 1 Grupo de Neurocomputación Biológica (GNB), Escuela Politécnica
REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING
REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING Ms.PALLAVI CHOUDEKAR Ajay Kumar Garg Engineering College, Department of electrical and electronics Ms.SAYANTI BANERJEE Ajay Kumar Garg Engineering
Low-resolution Character Recognition by Video-based Super-resolution
2009 10th International Conference on Document Analysis and Recognition Low-resolution Character Recognition by Video-based Super-resolution Ataru Ohkura 1, Daisuke Deguchi 1, Tomokazu Takahashi 2, Ichiro
CS 534: Computer Vision 3D Model-based recognition
CS 534: Computer Vision 3D Model-based recognition Ahmed Elgammal Dept of Computer Science CS 534 3D Model-based Vision - 1 High Level Vision Object Recognition: What it means? Two main recognition tasks:!
Tracking Moving Objects In Video Sequences Yiwei Wang, Robert E. Van Dyck, and John F. Doherty Department of Electrical Engineering The Pennsylvania State University University Park, PA16802 Abstract{Object
A Novel Hole filling method based on Projection onto Convex Set in DIBR
3rd International Conference on Multimedia Technology ICMT 2013) A Novel Hole filling method based on Projection onto Convex Set in DIBR Weiquan Wang1 and Yingyun Yang2 and Qian Liang3 Abstract. Depth
Tracking and Recognition in Sports Videos
Tracking and Recognition in Sports Videos Mustafa Teke a, Masoud Sattari b a Graduate School of Informatics, Middle East Technical University, Ankara, Turkey [email protected] b Department of Computer
Segmentation & Clustering
EECS 442 Computer vision Segmentation & Clustering Segmentation in human vision K-mean clustering Mean-shift Graph-cut Reading: Chapters 14 [FP] Some slides of this lectures are courtesy of prof F. Li,
Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection
CSED703R: Deep Learning for Visual Recognition (206S) Lecture 6: CNNs for Detection, Tracking, and Segmentation Object Detection Bohyung Han Computer Vision Lab. [email protected] 2 3 Object detection
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA N. Zarrinpanjeh a, F. Dadrassjavan b, H. Fattahi c * a Islamic Azad University of Qazvin - [email protected]
AUTOMATIC CROWD ANALYSIS FROM VERY HIGH RESOLUTION SATELLITE IMAGES
In: Stilla U et al (Eds) PIA11. International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences 38 (3/W22) AUTOMATIC CROWD ANALYSIS FROM VERY HIGH RESOLUTION SATELLITE IMAGES
How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm
IJSTE - International Journal of Science Technology & Engineering Volume 1 Issue 10 April 2015 ISSN (online): 2349-784X Image Estimation Algorithm for Out of Focus and Blur Images to Retrieve the Barcode
Virtual Fitting by Single-shot Body Shape Estimation
Virtual Fitting by Single-shot Body Shape Estimation Masahiro Sekine* 1 Kaoru Sugita 1 Frank Perbet 2 Björn Stenger 2 Masashi Nishiyama 1 1 Corporate Research & Development Center, Toshiba Corporation,
CHAPTER 6 TEXTURE ANIMATION
CHAPTER 6 TEXTURE ANIMATION 6.1. INTRODUCTION Animation is the creating of a timed sequence or series of graphic images or frames together to give the appearance of continuous movement. A collection of
Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang
Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform
LIBSVX and Video Segmentation Evaluation
CVPR 14 Tutorial! 1! LIBSVX and Video Segmentation Evaluation Chenliang Xu and Jason J. Corso!! Computer Science and Engineering! SUNY at Buffalo!! Electrical Engineering and Computer Science! University
EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines
EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation
A Method of Caption Detection in News Video
3rd International Conference on Multimedia Technology(ICMT 3) A Method of Caption Detection in News Video He HUANG, Ping SHI Abstract. News video is one of the most important media for people to get information.
ROBUST COLOR JOINT MULTI-FRAME DEMOSAICING AND SUPER- RESOLUTION ALGORITHM
ROBUST COLOR JOINT MULTI-FRAME DEMOSAICING AND SUPER- RESOLUTION ALGORITHM Theodor Heinze Hasso-Plattner-Institute for Software Systems Engineering Prof.-Dr.-Helmert-Str. 2-3, 14482 Potsdam, Germany [email protected]
A method of generating free-route walk-through animation using vehicle-borne video image
A method of generating free-route walk-through animation using vehicle-borne video image Jun KUMAGAI* Ryosuke SHIBASAKI* *Graduate School of Frontier Sciences, Shibasaki lab. University of Tokyo 4-6-1
Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur
Module 8 VIDEO CODING STANDARDS Version ECE IIT, Kharagpur Lesson H. andh.3 Standards Version ECE IIT, Kharagpur Lesson Objectives At the end of this lesson the students should be able to :. State the
Simultaneous Gamma Correction and Registration in the Frequency Domain
Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong [email protected] William Bishop [email protected] Department of Electrical and Computer Engineering University
HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT
International Journal of Scientific and Research Publications, Volume 2, Issue 4, April 2012 1 HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT Akhil Gupta, Akash Rathi, Dr. Y. Radhika
ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES
ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES B. Sirmacek, R. Lindenbergh Delft University of Technology, Department of Geoscience and Remote Sensing, Stevinweg
A System for Capturing High Resolution Images
A System for Capturing High Resolution Images G.Voyatzis, G.Angelopoulos, A.Bors and I.Pitas Department of Informatics University of Thessaloniki BOX 451, 54006 Thessaloniki GREECE e-mail: [email protected]
Build Panoramas on Android Phones
Build Panoramas on Android Phones Tao Chu, Bowen Meng, Zixuan Wang Stanford University, Stanford CA Abstract The purpose of this work is to implement panorama stitching from a sequence of photos taken
An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network
Proceedings of the 8th WSEAS Int. Conf. on ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING & DATA BASES (AIKED '9) ISSN: 179-519 435 ISBN: 978-96-474-51-2 An Energy-Based Vehicle Tracking System using Principal
QUALITY TESTING OF WATER PUMP PULLEY USING IMAGE PROCESSING
QUALITY TESTING OF WATER PUMP PULLEY USING IMAGE PROCESSING MRS. A H. TIRMARE 1, MS.R.N.KULKARNI 2, MR. A R. BHOSALE 3 MR. C.S. MORE 4 MR.A.G.NIMBALKAR 5 1, 2 Assistant professor Bharati Vidyapeeth s college
Tracking performance evaluation on PETS 2015 Challenge datasets
Tracking performance evaluation on PETS 2015 Challenge datasets Tahir Nawaz, Jonathan Boyle, Longzhen Li and James Ferryman Computational Vision Group, School of Systems Engineering University of Reading,
An automatic system for sports analytics in multi-camera tennis videos
Workshop on Activity Monitoring by Multiple Distributed Sensing (AMMDS) in conjunction with 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance An automatic system for
Colour Image Segmentation Technique for Screen Printing
60 R.U. Hewage and D.U.J. Sonnadara Department of Physics, University of Colombo, Sri Lanka ABSTRACT Screen-printing is an industry with a large number of applications ranging from printing mobile phone
Canny Edge Detection
Canny Edge Detection 09gr820 March 23, 2009 1 Introduction The purpose of edge detection in general is to significantly reduce the amount of data in an image, while preserving the structural properties
Machine vision systems - 2
Machine vision systems Problem definition Image acquisition Image segmentation Connected component analysis Machine vision systems - 1 Problem definition Design a vision system to see a flat world Page
3D Model based Object Class Detection in An Arbitrary View
3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/
Removing Moving Objects from Point Cloud Scenes
1 Removing Moving Objects from Point Cloud Scenes Krystof Litomisky [email protected] Abstract. Three-dimensional simultaneous localization and mapping is a topic of significant interest in the research
Service-Oriented Visualization of Virtual 3D City Models
Service-Oriented Visualization of Virtual 3D City Models Authors: Jan Klimke, Jürgen Döllner Computer Graphics Systems Division Hasso-Plattner-Institut, University of Potsdam, Germany http://www.hpi3d.de
Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet
DICTA2002: Digital Image Computing Techniques and Applications, 21--22 January 2002, Melbourne, Australia Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet K. Ramkishor James. P. Mammen
Spatio-Temporally Coherent 3D Animation Reconstruction from Multi-view RGB-D Images using Landmark Sampling
, March 13-15, 2013, Hong Kong Spatio-Temporally Coherent 3D Animation Reconstruction from Multi-view RGB-D Images using Landmark Sampling Naveed Ahmed Abstract We present a system for spatio-temporally
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM Amol Ambardekar, Mircea Nicolescu, and George Bebis Department of Computer Science and Engineering University
Open-Set Face Recognition-based Visitor Interface System
Open-Set Face Recognition-based Visitor Interface System Hazım K. Ekenel, Lorant Szasz-Toth, and Rainer Stiefelhagen Computer Science Department, Universität Karlsruhe (TH) Am Fasanengarten 5, Karlsruhe
Kapitel 12. 3D Television Based on a Stereoscopic View Synthesis Approach
Kapitel 12 3D Television Based on a Stereoscopic View Synthesis Approach DIBR (Depth-Image-Based Rendering) approach 3D content generation DIBR from non-video-rate depth stream Autostereoscopic displays
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
A Survey of Video Processing with Field Programmable Gate Arrays (FGPA)
A Survey of Video Processing with Field Programmable Gate Arrays (FGPA) Heather Garnell Abstract This paper is a high-level, survey of recent developments in the area of video processing using reconfigurable
BCC Multi Stripe Wipe
BCC Multi Stripe Wipe The BCC Multi Stripe Wipe is a similar to a Horizontal or Vertical Blind wipe. It offers extensive controls to randomize the stripes parameters. The following example shows a Multi
Convolution. 1D Formula: 2D Formula: Example on the web: http://www.jhu.edu/~signals/convolve/
Basic Filters (7) Convolution/correlation/Linear filtering Gaussian filters Smoothing and noise reduction First derivatives of Gaussian Second derivative of Gaussian: Laplacian Oriented Gaussian filters
Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006
Practical Tour of Visual tracking David Fleet and Allan Jepson January, 2006 Designing a Visual Tracker: What is the state? pose and motion (position, velocity, acceleration, ) shape (size, deformation,
Object tracking & Motion detection in video sequences
Introduction Object tracking & Motion detection in video sequences Recomended link: http://cmp.felk.cvut.cz/~hlavac/teachpresen/17compvision3d/41imagemotion.pdf 1 2 DYNAMIC SCENE ANALYSIS The input to
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 2, February 2014,
High Quality Image Deblurring Panchromatic Pixels
High Quality Image Deblurring Panchromatic Pixels ACM Transaction on Graphics vol. 31, No. 5, 2012 Sen Wang, Tingbo Hou, John Border, Hong Qin, and Rodney Miller Presented by Bong-Seok Choi School of Electrical
Adaptive Image Warping Using Disparity Map-Based Rendering For Hole Prevention In 3D View Synthesis
Adaptive Image Warping Using Disparity Map-Based Rendering For Hole Prevention In 3D View Synthesis Yoganandhan.S Lakshmanan.V VinithaSubashini.B Department Of Computer Science and Engineering [email protected]
Single Depth Image Super Resolution and Denoising Using Coupled Dictionary Learning with Local Constraints and Shock Filtering
Single Depth Image Super Resolution and Denoising Using Coupled Dictionary Learning with Local Constraints and Shock Filtering Jun Xie 1, Cheng-Chuan Chou 2, Rogerio Feris 3, Ming-Ting Sun 1 1 University
Fast and Robust Variational Optical Flow for High-Resolution Images using SLIC Superpixels
Fast and Robust Variational Optical Flow for High-Resolution Images using SLIC Superpixels Simon Donné, Jan Aelterman, Bart Goossens, and Wilfried Philips iminds-ipi-ugent {Simon.Donne,Jan.Aelterman,Bart.Goossens,philips}@telin.ugent.be
Key Terms 2D-to-3D-conversion, depth image-based rendering, hole-filling, image warping, optimization.
Disparity-Map-Based Rendering for Hole Prevention Mr. S. K. Somasundaram, Department of Computer Science & Engineering, Associate Professor, PSNA College of Engineering & Technology, Anna University,Chennai,
Machine Learning for Medical Image Analysis. A. Criminisi & the InnerEye team @ MSRC
Machine Learning for Medical Image Analysis A. Criminisi & the InnerEye team @ MSRC Medical image analysis the goal Automatic, semantic analysis and quantification of what observed in medical scans Brain
Spatio-Temporal Nonparametric Background Modeling and Subtraction
Spatio-Temporal Nonparametric Background Modeling and Subtraction Raviteja Vemulapalli and R. Aravind Department of Electrical engineering Indian Institute of Technology, Madras Background subtraction
3D Interactive Information Visualization: Guidelines from experience and analysis of applications
3D Interactive Information Visualization: Guidelines from experience and analysis of applications Richard Brath Visible Decisions Inc., 200 Front St. W. #2203, Toronto, Canada, [email protected] 1. EXPERT
Efficient Background Subtraction and Shadow Removal Technique for Multiple Human object Tracking
ISSN: 2321-7782 (Online) Volume 1, Issue 7, December 2013 International Journal of Advance Research in Computer Science and Management Studies Research Paper Available online at: www.ijarcsms.com Efficient
Segmentation of building models from dense 3D point-clouds
Segmentation of building models from dense 3D point-clouds Joachim Bauer, Konrad Karner, Konrad Schindler, Andreas Klaus, Christopher Zach VRVis Research Center for Virtual Reality and Visualization, Institute
UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003. Yun Zhai, Zeeshan Rasheed, Mubarak Shah
UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003 Yun Zhai, Zeeshan Rasheed, Mubarak Shah Computer Vision Laboratory School of Computer Science University of Central Florida, Orlando, Florida ABSTRACT In this
Analecta Vol. 8, No. 2 ISSN 2064-7964
EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. [email protected]
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE 1 S.Manikandan, 2 S.Abirami, 2 R.Indumathi, 2 R.Nandhini, 2 T.Nanthini 1 Assistant Professor, VSA group of institution, Salem. 2 BE(ECE), VSA
Interactive person re-identification in TV series
Interactive person re-identification in TV series Mika Fischer Hazım Kemal Ekenel Rainer Stiefelhagen CV:HCI lab, Karlsruhe Institute of Technology Adenauerring 2, 76131 Karlsruhe, Germany E-mail: {mika.fischer,ekenel,rainer.stiefelhagen}@kit.edu
