State-of-the-art Algorithms for Complete 3D Model Reconstruction

Size: px
Start display at page:

Download "State-of-the-art Algorithms for Complete 3D Model Reconstruction"

Transcription

1 State-of-the-art Algorithms for Complete 3D Model Reconstruction Georgios Kordelas 1, Juan Diego Pèrez-Moneo Agapito 1,2, Jesùs M. Vegas Hernandez 2, and Petros Daras 1 1 Informatics & Telematics Institute, 1st km Thermi Panorama Road, 57001, Thermi, Thessaloniki,Greece 2 Computer Science Department, University of Valladolid, Valladolid, Spain kordelas@iti.gr,perez@iti.gr,jvegas@infor.uva.es,daras@iti.gr Abstract. The task of generating fast and accurate 3D models of a scene has applications in various computer vision fields, including robotics, virtual and augmented reality, and entertainment. So far, the computer vision scientific community has provided innovative reconstruction algorithms that exploit variant types of equipment. In this paper a survey of the recent methods that are able to generate complete and dense 3D models is given. The algorithms are classified into categories according to the equipment used to acquire the processed data. Keywords: 3D reconstruction, multi-view, scan registration. 1 Introduction Nowadays, many advanced applications require three-dimensional (3D) information. The third dimension plays a decisive role in the analysis of dynamic or static environments. Fields of application in daily life that may exploit the third dimension include the surveillance and robotic domains that exploit depth information to gain a much better analysis of the environment. In the medical research, new technologies require more and more reliable depth data. In general, the domains of 3D image processing, digital photography, games, multimedia, 3D visualization and augmented reality make an increasing use of real-time 3D information. The existing 3D modeling methods can be classified according to the required input data, while their efficacy is reflected by the variety of scene that can be processed, the fidelity of the final model and the total processing time. According to the user requirements, automated, semi-automated or manual image-based approaches can be selected to produce digital models usable for inspections, visualization or documentation. Automated methods focus mainly on the full automation of the process but generally produce results which are mainly good for nice-looking real-time 3D recording or simple visualization. On the other hand, semi-automated methods try to reach a balance between accuracy and automation and are very useful for precise documentation and restoration planning.

2 2 Lecture Notes in Computer Science The aim of this paper is to identify the most recent and advanced methods and present them according to the equipment they exploit. The rest of this paper is organized as follows. In Section 2 algorithms using the range scanner technology are presented. Section 3 covers multi-view stereo approaches. In Section 4 an overview of the 3D reconstruction lab established for the scope of the 3DLife project [54] is presented, while conclusions are drawn in Section 5. 2 Reconstruction Using Laser Scanner Technology In the last years, laser scanner technology was emerged as a useful and competitive approach for creating 3D reconstructions. The basic advantages of the methods that use this technology are: (i) speed, (ii) accuracy and (iii) resolution of the reconstruction. Moreover, the scanners field of view allows for the reconstruction of objects, which size ranges from a few centimeters to several meters and exist in a short or long distance. Consequently, this technology is suitable for large-size scenes, as the interior and exterior of buildings and therefore, it is generally accepted by the community as a valid support for documentation and conservation of historic buildings, monuments or archaeological sites. Epigrammatically, the reconstruction of a scene using a range scanner requires the following steps: 1. Acquisition of an appropriate number of colour range scans so as to adequately cover the 3D scene. 2. Registertation of the range scans in the same coordinate system (Fig. 1(I)). 3. Data processing for refinement of the final 3D surface model (Fig. 1(II)). The processing stage includes the following steps: Elimination of redundant information (i.e. double data created from registration of overlapping regions) from the registered point cloud and noise removal. Construction of a 3D model that comprises polygonal facets from the point cloud and its complete missing data (hole filling). The placement of the equipment, in order to acquire the range scans is a trivial task, while there are many commercial software packages that are able to process data so as to refine the final 3D model [52, 53]. Therefore, the main problem during the 3D reconstruction procedure lies in the automatic computation of the three-dimensional transformations that align the range data sets in order to extract the complete 3D model. Registering point clouds in the same coordinate system presumes the exploitation of a 3D-to-3D registration method. There are plenty of methods dealing with range data registration [1 9, 13 15, 18 23]. Some methods are automatic and rely on an automated matching procedure of features [4 9, 13 15, 18, 23], while others require the use of markers [1 3]. This review presents the most efficient procedures that can perform accurate registration without the need of special markers. The range scan registration procedure can be divided into two steps: (1) initial registration that provides a good initial guess of the alignment transformation and (2) fine registration that gives the accurate alignment transformation.

3 Lecture Notes in Computer Science 3 Fig. 1. Registration of: (I) two range scans (II) multiple range scans forming a 3D model [4]. 2.1 Initial Registration A wide variety of methods have already been proposed for initial registration of scans. Several methods extract geometric features from the scans to allow feature matching and alignment between scans. In [4], an effective system that integrates automated 3D-to-3D registration based on geometric features is presented. During preprocessing a set of major 3D planes, a set of geometric 3D lines and a set of reflectance 3D lines from each 3D range scan are extracted. The range scans are registered in the same coordinate system via the automated 3D-to-3D feature-based range-scan registration method of [5]. As a result, all range scans are registered with respect to one selected pivot scan. Since there are some cases in which the extracted linear features are inadequate to register the range scans together, the authors use nonlinear features such as circular arcs. Then, a circle-based registration method based on similarity of radii, orientation and relative position between pairs of circles is utilized in the matching phase. From each valid matching circle pairs, a candidate transformation is computed and its correctness is evaluated. Finally, the transformation achieving the smallest average distance between overlapping range points is chosen as the best one. The mean registration error for this method is about 1 cm, while it can register efficiently scans with minimum overlap of about 20%. The work presented in [6] proposes an angular-invariant feature for the 3D registration procedure to perform reliable selection of point correspondence. The angular feature, which is invariant to scale and rotation transformations, improves the convergence and error without any assumptions about the initial transformation. A major criticism against this feature, however, is that it can discover the potential structural information hiding in nearly flat surfaces. The work presented in [4] can efficiently perform automatic range scan registration using simple geometric features (3D

4 4 Lecture Notes in Computer Science lines, circles). Therefore, this algorithm is appropriate when intensity data does not accompany the range data. A class of registration comprises methods that extract descriptors of large scan areas as the basis for registration and 3D object recognition. An early method for free-form surface registration is proposed in [13]. This approach uses the spin image surface representation, which has low discriminating capability because it maps the 3D range image into a 2D histogram. Therefore, the spin image matching procedure results in many ambiguous correspondences which must be processed through a number of filtration stages to prune out incorrect ones making the technique computationally inefficient even for range images of a reasonable size. A novel 3D free-form surface area representation scheme based on third order tensors, is used for surface registration in [14]. More, specifically multiple tensors are used to represent each range scan. Tensors of two range scans are matched to identify correspondences between them. Correspondences are verified and then used for pairwise registration of the range images. The experimental results show that this algorithm is robust to resolution, number of tensors per view, the required amount of overlap and noise. Comparison against [13] proved its registration efficiency. In [15], the scan alignment is based on the correlation of two Extended Gaussian Images (EGIs) [16] in the Fourier domain using the spherical harmonics of the EGI and the rotational Fourier transform [17]. For pairs with low overlap, which fail to satisfy two criteria (the first one is based on the consistency of surface orientations in the overlapping region and the second one on visibility information), the rotational alignment can be obtained by the alignment of constellation images generated from the EGIs. Rotationally aligned sets are matched by correlation using the Fourier transform of volumetric functions. The merit of this method is that it can efficiently align point clouds with arbitrarily large displacements that have very little overlap. The major advantage of [13 15] methods is that they can resister free-form objects contrarily to [4, 6, 5], which use simple geometric features. When calibrated intensity images accompany the range scans, intensity features can be combined with range information to develop efficient registration methods. In the method developed by Wyngaerd and Van Goon [18], the 3D measurements are combined with the texture. In particular, the surface is intersected with small spheres centered at feature points. The intersection line between such a sampling sphere and the surface defines an invariant region in the texture image. The surface texture inside these regions is used for matching regions between different patches. The merit of this approach is that it can be applied in parts of the surface with poor geometry and rich texture or with rich geometry and poor texture. Bendels et al.[9], match 2D SIFT features [11], backproject them onto range data, and then employ RANSAC [10] on these points in 3D to identify an initial 3D transformation. A similar registration system is presented in [7]. During initialization, intensity keypoints and their SIFT descriptors are extracted from the images and backprojected onto the range data. A 3D coordinate system is established for each keypoint using its backprojected intensity gradient direction and its locally computed range surface normal. Key-

5 Lecture Notes in Computer Science 5 points are then matched using their SIFT descriptors and each match provides an initial rigid transformation estimation. An extension of the Dual-Bootstrap ICP algorithm [12] is used for the fine alignment of the scans. The weakest point of [7] is initialization, since an improper initial estimation would cause alignment failure. Seo et al.[8] use a method to correct geometric and illumination variations of the photometric features before performing 2D local feature matching using the SIFT algorithm. Authors claim that this algorithm is faster than most methods that rely on shape information. Methods that exploit intensity information [18, 9, 7, 8] perform well in aligning textured range scans. However, lack of texture would make these approaches useless. 2.2 Fine Registration Initial registration provides a good estimation of the alignment transformation between sets of 3D range scans, while fine registration, which follows initial registrations, aims to optimally align these sets. In the literature, the ICP (Iterative Closest Point) algorithm [19] is a very popular method for the fine registration of 3D data sets, when an initial guess of the relative pose between them is known. The work presented in [20] classifies several ICP variants and evaluates their performance according to time required to reach the correct alignment. Moreover, a combination of ICP variants optimized for high speed is proposed. The study on convergence properties of ICP [21] shows that the use of normal distances is more effective than Euclidean distances and proposes faster convergence with higher order approximations. An alternative approach that combines intensity and geometric attributes to filter closest point matches is studied in [22]. A maximum-likelihood method for registration of scenes with unmatched or missing data, which does not require ICP refinement, is presented in [23]. In this method, correspondences are formed between valid and missing points in each view. The matched points are classified according to their visibility properties into types. Then a generic sensor model, which can be adjusted to match a wide variety of sensors, is used to generate likelihood measures for each point type. Finally, a multistage optimization procedure, based on these likelihood measures, takes place to find a maximum-likelihood registration. The experimental results proved the efficacy of this method to register complex noisy scenes with occlusions, missing data, and varying degrees of overlap. Even if range scanners give promising results, their cost, size, power requirement and the intricate handling of their data are significant drawbacks. Therefore, the availability of methods that use range scanned technology is limited when compared to multi-view stereo methods. 3 Multi-view Stereo Reconstruction In this section a survey on multi-view stereo reconstruction methods that are efficient to provide dense and full 3D reconstructions of objects from multiple views, is given. Thus, binocular, trinocular, and multi-baseline methods that

6 6 Lecture Notes in Computer Science Fig. 2. (I) Intersection of three visual cones [24] and (II) Evolving shape after eroding inconsistent points [29]. reconstruct a single dense map or structure-from-motion are not considered in this survey. There are several types of methods that are used to reconstruct 3D models of objects from a set of images. These methods could be classified into: (1) methods that reconstruct the visual hull of the object, (2) approaches that recover the photo-hull of an object and (3) algorithms that minimize the surface integral of a certain cost function over the surface shape. 3.1 Visual Hull Reconstruction The first class includes methods that exploit silhouette information to generate intersected visual cones (Fig. 2(I)), which then are used to obtain the 3D representation of an object. Silhouette-based methods are popular for use in multicamera environments mainly due to their simplicity and computational efficiency. In [49], a parallel pipeline processing method for reconstructing a dynamic 3-D object shape from multiview video images is proposed. Real-time processing is accomplished through the combination of a plane-based volume intersection algorithm with a parallel pipeline implementation. The quantitative performance evaluations demonstrated that the acceleration and parallelizing algorithms are very efficient in reconstructing a dynamic full 3-D shape in good resolution. A novel framework for multi-view silhouette cue fusion is proposed in [25]. This framework uses a space occupancy grid as a probabilistic 3D representation of scene contents. The idea behind this paper is to consider each camera pixel as a statistical occupancy sensor. All pixel observations are then used jointly to infer where, and how likely, matter is present in the scene. Through this paper optimal scene object localization, and robust volume reconstruction, can be achieved, with no constraint on camera placement and object visibility. An algorithm that eliminates the problems related to dense feature point matching and camera calibration is presented in [28]. This method is based on the projective geometry between the object space and silhouette images taken from multiple

7 Lecture Notes in Computer Science 7 viewpoints. The object shape is reconstructed by establishing a set of hypothetical planes slicing the object volume and estimating the projective geometric relations between the images. Ishikawa et al.[26] include a genuine segmentation method to acquire the silhouettes, but segmentation errors directly affect the visual hull since the segmentation process is absolutely independent to the reconstruction process. Grauman et al. [27] proposed a Bayesian approach to compensate for modeling errors from false segmentation. They modeled prior density using probabilistic principal components analysis and estimated a maximuma-posteriori reconstruction of multi-view contours. This approach reconstructs good error-compensated models from erroneous silhouette information, but it needs prior knowledge about the objects and ground-truth training data. Concluding, shape-from-silhouette approaches can generate full 3D reconstruction of dynamic scenes, but they lack in reconstruction fidelity and are very sensitive to errors in silhouette extraction. Therefore more efficient techniques, in terms of reconstruction quality, are employed to generate 3D models with increased level of detail. 3.2 Space Carving Reconstruction The second class includes the space carving approaches, which take into account the photometric consistency of the surface across the input images and allow for the recovery of the photo-hull that contains all possible photo-consistent reconstructions. Space carving methods generate an initial reconstruction that envelops the object to be reconstructed. The surface of the reconstruction is then eroded at the points that are inconsistent with the input images (Fig. 2(II)). By repeating this process a reconstruction, which is consistent with the input images, emerges. The Space Carving algorithm suggested by Kutulakos and Seitz [29] uses a repeatedly sweeping plane through the scene volume and tests the photo-consistency of voxels on that plane. This approach permits arbitrary camera placement. However, it has the drawback of making hard and irreversible commitments on the removal of voxels. In particular, if a voxel is removed by error, further voxels can be erroneously removed in a cascade effect. This may lead to incorrect reconstruction by creating holes. Thus, space carving recasts are proposed in the literature. A space carving probabilistic framework for analyzing the 3D occupancy computation problem from multiple images is introduced in [30]. This framework enables a complete analysis of the complex probabilistic dependencies inherent in occupancy calculations and provides an expression for the tightest occupancy bound recoverable from noisy images. In [31], two major extensions to the Space Carving framework are presented. The first one is a progressive scheme for better reconstruction of surfaces lacking sufficient textures. The second one is a novel photo-consistency measure that is valid for both specular and diffuse (Lambertian) surfaces, without the need of light calibration. This method, unlike [29, 30], can deal with surfaces lacking sufficient textures. Concluding, the Space Carving framework suffers from several important limitations:

8 8 Lecture Notes in Computer Science The original Space Carving approach [29] makes hard decisions. This limitation is partially overcome in [30]. The choice of the global threshold on the color variance is often problematic [29, 30]. An attempt to alleviate these photometric constraints in presented in [31] The voxel-based representation, used in the Space Carving approaches, disregards the continuity of shape makes it very hard to enforce any kind of spatial coherence. As a result, space carving is sensitive to noise and outliers and may yield to noisy reconstructions. 3.3 Reconstruction via Surface Integral Minimization The third class of methods optimizes the surface integral of a consistency function over the surface shape. Level-set based methods provide a way of minimizing this cost function. In [43], surface reconstruction is achieved by combining both 3D data and 2D image information. This leads to a more robust approach than existing methods that use only pure 2D information or 3D stereo data. For the efficient evolution of surfaces, a bounded regularization method based on levelset methods, is proposed. Additionally, if the silhouette information from the 2D images is available, it can be integrated to improve the pure 3D results. The main limitation of this system is due to the choice of the surface evolution approach, which assumes a closed and smooth surface. Therefore, the surface reconstruction module is not designed for outdoor or polyhedric objects. The algorithm presented in [37], starts with a generic surface, say a large sphere or a smooth cube, and evolves it to best approximate the shape of the scene. This task is performed by numerically integrating systems of partial differential equations using the level set method presented in [41]. In order to deal with non-lambertial surfaces this method uses a model of the radiance that accounts for deviations from Lambertian reflection through an affine subspace constraint on the radiance tensor field. This algorithm does not require strong texture and can handle sharp radiance changes. A novel method for multi-view stereovision that minimizes the prediction error using a global image-based matching score, is presented in [40]. The input image views are wrapped and registered with a user-defined image similarity measure, which can include neighborhood and global intensity information. The surface evolution is implemented in a level set framework [41]. Experiments proved the superiority of this method against the method presented in [37], even for complex non-lambertian images including specularities and translucency. Except for level-set based methods, a second way of minimizing the surface integral is to use graph-cuts. Yu et al.[38] propose a new iterative graph-cuts based algorithm which operates on the Surface Distance Grid, to reduce the minimal surface bias and transform the discretization bias into a controllable degree of surface smoothness (these biases make difficult the recovery of surface extrusions and other details). This algorithm works better than [37] in preserving the edges and corners, which results in a lower volume difference. The drawback of this method is that it is assumed that the initial estimate is already quite close

9 Lecture Notes in Computer Science 9 to the final result. In [39], a direct surface reconstruction approach is proposed, which starts from a continuous geometric functional that is minimized up to a discretization by a global graph-cut algorithm operating on a 3D embedded graph. The whole procedure is consistently incorporated into a voxel representation that handles both occlusions and discontinuities. In [32], an algorithm for reducing the minimal surface bias associated with volumetric graph cuts for 3D reconstruction from multiple calibrated images is presented. The algorithm is based on an iterative graph-cut over narrow bands combined with an accurate surface normal estimation. At each iteration, the normal to each surface patch is optimized in order to obtain a precise value for the photometric consistency measure. Then, a volumetric graph-cut is applied on a narrow band around the current surface estimate to determine the optimal surface inside this band. The reconstruction results, obtained on standard data sets, are more accurate and complete than in [45] and [33] ([47] presents the evolution of this work and is described below). Additionally, this method does not require exact silhouette images (unlike [45]) or the use of a ballooning term (unlike [33]). The octahedral graph structure used in [34] establishes a well defined relationship between the photo-consistency of a voxel and the edge weights of an embedded octahedral subgraph. This specific graph design supports a hierarchical surface extraction, which allows to efficiently process even high volumetric resolutions and a large number of input images. This method achieves high resolution in the region of interest, but it relies heavily on the visual hull being a good approximation of the surface. 3.4 Fusion of Reconstruction Techniques Many recent approaches use a fusion of different reconstruction techniques to accomplish better reconstruction results. The flexibility of the carving approach is combined with the accuracy of graph-cut optimization in [35]. In this algorithm a progressive refinement scheme is used to recover the topology and reason the visibility of the object. Within each voxel, a detailed surface patch is optimally reconstructed using a graph-cut method. The advantage of this technique is its ability to handle complex shape similarly to level sets while enjoying a higher precision. Compared to carving techniques the produced surface does not suffer from aliasing. This work is extended in [36], where a new surface representation method, called patchwork is introduced. A patchwork is the combination of several patches that are built one by one. This design potentially allows for the reconstruction of an object with arbitrarily large dimensions while preserving a fine level of detail. This algorithm outperforms the level-set method presented in [42] and Space Carving [29]. The use of graph-cut optimization to the volumetric multiview stereo problem is introduced in [47], too. Initially, it is defined an occlusion-robust photo-consistency metric, which is then approximated by a discrete flow graph. This metric uses a robust voting scheme that treats pixels from occluded cameras as outliers. Graph-cut optimization can exactly compute the minimal surface that encloses the largest possible volume, where surface area is just a surface integral in this photo-consistency field. However, the ballooning

10 10 Lecture Notes in Computer Science Fig. 3. Provided interface for (I) displaying the frames captured by the cameras in real-time and (II) calibrating the camera network. term used in this method cannot handle thin structures or big concavities. Kolev et al.[44] consider three different energy models for multiview reconstruction, which are based on a common variational template unifying regional volumetric terms and on-surface photoconsistency. While the first two approaches are based on a classical silhouette-based volume subdivision, the third one relies on stereo information to introduce the concept of propagated photoconsistency, thereby addressing some of the shortcomings of classical methodologies. Qualitative and quantitative experiments showed that precise and spatially consistent reconstructions can be computed by minimizing continuous convex functionals. In [45], a graph cut algorithm manages to recover the 3D shape of an object using both silhouette and foreground color information. Initially, a method that is able to deal with silhouette uncertainties arising from background subtraction is used to extract the visual hull of the object. Then, the graph cut algorithm is used for optimization on a color consistency field. Constraints that are added to improve its performance are efficient enough to preserve protrusions and to pursue concavities on the surface of the object. Sinha et al.[46], recover surfaces at high resolution by performing a graph-cut on the dual of an adaptive volumetric mesh created by photo-consistency driven subdivision. This methods does not require good initializations and is not restricted to a specific surface topology. The specific graph-cut formulation enforces silhouette constraints to counter the bias for minimal surfaces. Local shape refinement via surface deformation is used to recover details in the reconstructed surface. However, the solutions that incorporate silhouette constraints [45, 46] are only viable when exact silhouettes are available. 4 The 3D Reconstruction Lab of 3DLife The 3DLife [54] project aims to develop technologies that could make interaction between humans in virtual online environments easier, more reliable and more

11 Lecture Notes in Computer Science 11 realistic. In order to achieve this goal, the integration of recent progress in 3D data acquisition and processing, autonomous avatars, real-time rendering, interaction in virtual worlds and networking are required. Therefore, scenarios that will enable exploitation of recent progress have been developed. A key element in these scenarios is to collect 3D data in real data from moving people. On this scope a 3D reconstruction lab was established. The lab consists of six CCD cameras (Fig. 4(IV)), which are mounted on a cylindrical grid. Each camera is switched to a computer, which are connected to a net switch forming a star network. There is an additional computer used as server, where the Network Time Protocol is installed. This protocol is used to synchronize the clocks of the computers with the server, so that all the computers share the same time. Server s Graphical User Interface (GUI) encompasses the tools used for monitoring and performing the 3D reconstruction process. In particular, video frames from all cameras are depicted in real time through the interface (Fig. 3(I)). Thus, cameras positions and orientations can be readily adjusted to gain better visibility of the scene. The calibration of the camera network is performed via Zhang s algorithm [48], which uses as calibration object a chessboard pattern. The GUI (Fig. 3(II)) allows the user to define the calibration pattern settings (the number and the size of the squares along width and size), while calibration can be performed either automatically or manually. The calibration outcome are the extrinsic and the intrinsic parameters, which are essential to fuse video data across the cameras. The volumetric intersection approach presented in [49] is the basis of computing the visual hull of the 3D object through multiple video inputs. The process of generating a 3D frame in this framework is described by the following steps: 1. Synchronized multiple video capturing: A set of multiple images are captured simultaneously by the camera network (Fig. 4(I)). 2. Silhouette Extraction: A silhouette is extracted per frame by exploiting a background subtraction algorithm [51] (Fig. 4(II)). 3. Silhouette Volume Intersection: A visual cone, encasing the 3D object, is generated per silhouette and cones are intersected with each other to generate the visual hull of the object (voxel representation). 4. Surface Shape Representation: A marching cubes method [50] is applied to convert the voxel representation to a polygonal representation (the outcome of 3,4 is visualized in Fig. 4(III)). 5. Texture Mapping: Color and texture are projected on the generated 3D shape. The reconstruction results for a time frame are depicted in Fig. 5. More specifically, on the left image the 3D object is depicted in a model simulation of the reconstruction lab (the spheres simulate the position of the cameras), while on the right image a closer view of the reconstructed object is provided. The main advantages of this method are the following: It allows real-time dynamic full 3D shape reconstruction.

12 12 Lecture Notes in Computer Science Fig. 4. (I) Video Capture, (II) Silhouette Extraction, (III) 3D Shape Reconstruction and (IV) 3D Reconstruction Lab Overview. It provides good 3D reconstruction results even for poor quality video, as no texture correspondences are needed. Thus, this algorithm fits to the scope of this lab, which is to collect 3D data in real data from moving people. Its main drawback lies on the fact that not high level of detail can be obtained. Other 3D reconstruction methods (Sections: 3.2, 3.3) could provide better reconstruction fidelity, but the time required for the data process is far from being real time. As a sequence, they are not appropriate for the specific application. The ongoing research towards improving the 3D reconstruction algorithm, includes: (a) the employment of a more efficient background subtraction algorithm than [51], which will be more robust to camera sensor noise, ambiguities between objects and background colors, changes in the lighting of the scene (including shadows of objects of interest) and (b) fusion of the existing method with novel approaches that will improve the reconstruction fidelity without adding significantly to the processing time. 5 Conclusions This paper presents a survey on recent reconstruction algorithms that are able to reconstruct dense and full 3D models. The aim of this survey is broad since methods that exploit variant types of equipment are included. However, except for the equipment-based categorization a further classification for each category is given based on the particular techniques that methods use to recover the 3D shape of objects. Acknowledgments. This work was supported by the 3DLifeEU Network of Excellence (NoE) project.

13 Lecture Notes in Computer Science 13 Fig. 5. Reconstruction results for a time frame. References 1. Kim, T., Seo, Y., Lee, S., Yang, Z., Chang, M.: Simultaneous registration of multiple views with markers. Computer-Aided Design, Elsevier, 41(4), , (2009) 2. Akca, D.: Full automatic registration of laser scanner point clouds. Optical 3-D Measurement Techniques VI, 1, , (2003) 3. Bienert, A., Maas, H.: Methods for the Automatic Geometric Registration of Terrestrial Laserscanner Point Clouds in Forest Stands. In ISPRS Workshop, (2009) 4. Stamos, I., Liu, L., Chao, C., Wolberg, G., Yu, G., Zokai, S.: Integrating Automated Range Registration with Multiview Geometry for the Photorealistic Modeling of Large-Scale Scenes. IJCV, Springer, 78(2/3), , (2008) 5. Chen, C., Stamos, I.: Semi-automatic range to range registration: a feature-based method. In the 5th international conference on 3-D digital imaging and modeling, , (2005) 6. Jiang, J., Cheng, J., Chen, X.: Registration for 3-D point cloud using angularinvariant feature. Neurocomputing, Elsevier, 72, (2009) 7. Smith, E., King, B., Stewart, C., Radke, R.: Registration of Combined Range- Intensity Scans: Initialization through Verification. Computer Vision and Image Understanding, 110, (2008) 8. Seo, J.,Sharp, G., Lee, S.: Range data registration using photometric features. In Proc. of CVPR, 2, , (2005) 9. Bendels, G., Degener, P., Wahl, R., Kortgen, M.: Klein R., Image-based registration of 3d-range data using feature surface elements. In Proc. of International Symposium on Virtual Reality, Archaeology and Cultural Heritage VAST, (2004) 10. Fischler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Graphics and Image Processing 24(6), , (1981) 11. Lowe, D.G.: Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision, 60, (2004)

14 14 Lecture Notes in Computer Science 12. Stewart, C., Tsai, C., Roysam, B.: The Dual-Bootstrap iterative closest point algorithm with application to retinal image registration. IEEE Trans. Med. Imag., 22(11), , (2003) 13., Johnson, A., Hebert, M.: Surface registration by matching oriented points. In International Conference on Recent Advances in 3-D Imaging and Modelling, , (1997) 14. Mian, A., Bennamoun, M., Owens, R.: A Novel Representation and Feature Matching Algorithm for Automatic Pairwise Registration of Range Images. IJCV, Springer, 66(1), 19 40, (2006) 15. Makadia, A., Patterson, A., Daniilidis, K.: Fully automatic registration of 3d point clouds. In CVPR, (2006) 16. Kang, S., Ikeuchi, K.: The complex EGI: A new representation for 3-D pose determination. TPAMI, 15(7), , (1993) 17. Kostelec, P., Rockmore, D.: FFTs on the Rotation Group. In Working Paper Series, Santa Fe Institute, (2003) 18. Wyngaerd, V., Gool, V.: Combining texture and shape for automatic crude patch registration. In Proc. Fourth Int. Conf. on 3DIM, pages, , (2003) 19. Besl, P., McKay, N.: A method for registration of 3-d shapes. IEEE Trans. Pattern Anal. Mach. Intell, 14(2), , (1992) 20. Rusinkiewicz, S., Levoy, M.: Effcient variants of the ICP algorithm. In Proc. Third Int. Conf. on 3DIM, , (2001) 21. Pottmann, H., Huang, Q., Yang, Y., Hu, S.:Geometry and convergence analysis of algorithms for registration of 3d shapes. Int. J. Comp. Vis., 67(3), , (2006) 22. Okatani, I., Sugimoto, A.: Registration of Range Images that Preserves Local Surface Structures and Color, In 3DPTV, , (2004) 23. Sharp, G., Lee, S., Wehe, D.: Maximum-Likelihood Registration of Range Images with Missing Data. TPAMI, 30(1), , (2008) 24. Matusik, W., Buehler, C., Raskar, R., Gortler, S., McMillan, L.: Image-based visual hulls. In SIGGRAPH Proceedings, , (2000) 25. Franco, J., Boyer, E.: Fusion of multi-view silhouette cues using a space occupancy grid. In ICCV, 2, , (2005) 26. Ishikawa, T., Yamazawa, K., Yokoya, N.: Real-time generation of novel views of a dynamic scene using morphing and visual hull. Proc. ICIP, , (2005) 27. Grauman, K., Shakhnarovich, G., Darrell, T.: A Bayesian Approach to Image- Based Visual Hull Reconstruction. In CVPR, , (2003) 28. Lai, P., Yilmaz, A.: Shape Recovery Using Rotated Slicing Planes, Int. Congress on Image and Signal Processing, (2009) 29. Kutulakos, K., Seitz, S.: A Theory of Shape by Space Carving. In International Journal of Computer Vision, 38(3), , (2000) 30. Bhotika, R., Fleet, D., Kutulakos, K.: A Probabilistic Theory of Occupancy and Emptiness. In Proc. ECCV, 3, , (2002) 31. Yang, R., Pollefeys, M., Welch, G.: Dealing with textureless regions and specular highlights - A progressive space carving scheme using a novel photo-consistency measure. In ICCV, , (2003) 32. Ladikos, A.,Benhimane, S., Navab, N.: Multi-View Reconstruction using Narrow- Band Graph-Cuts and Surface Normal Optimization, In BMVC, (2008) 33. Vogiatzis, G., Torr, P., Cipolla, R.: Multi-view stereo via volumetric graph-cuts. In CVPR, , (2005) 34. Hornung, A., Kobbelt, L.: Hierarchical volumetric multi-view stereo reconstruction of manifold surfaces based on dual graph embedding. In CVPR, 1, , (2006)

15 Lecture Notes in Computer Science Zeng, G., Paris S., Quan, L., Sillion, F.: Progressive surface reconstruction from images using a local prior. In ICCV, , (2005) 36. Zeng, G., Paris S., Quan, L., Sillion, F.: Accurate and scalable surface representation and reconstruction from images. TPAMI, 29(1), , (2007) 37. Jin, H., Soatto, S., Yezzi A.: Multi-view stereo reconstruction of dense shape and complex appearance. International Journal of Computer Vision, 63(3), , (2005) 38. Yu, T., Ahuja, N., Chen, W.: SDG Cut: 3d reconstruction of non-lambertian objects using graph cuts on surface distance grid. In Proc. CVPR, (2006) 39. Paris, S., Sillion, F., Quan, L.: A surface reconstruction method using global graph cut optimization. IJCV, Springer, 66(2), , (2006) 40. Pons, J., Keriven, R., Faugeras, O.: Multi-view stereo reconstruction and scene flow estimation with a global image-based matching score. IJCV, Springer, 72(2), , (2007) 41. Osher, S., Sethian, J.: Fronts propagating with curvature-dependent speed: algorithms based on hamilton-jacobi equations. J. of Comp. Physics, 79, 12 49, (1988) 42. Lhuillier, M., Quan, L.: Surface Reconstruction by Integrating 3D and 2D Data of Multiple Views. In ICCV, (2003) 43. Lhuillier, M., Quan, L.: A Quasi-Dense Approach to Surface Reconstruction from Uncalibrated Images, TPAMI, 27(3), , (2005) 44. Kolev K.,Klodt, M.,Brox, T.,Cremers, D.:Continuous Global Optimization in Multiview 3D Reconstruction. IJCV, Springer, 84, 8096, (2009) 45. Tran, S., Davis, L.: 3d surface reconstruction using graph cuts with surface constraints. In ECCV, 2, , (2006) 46. Sinha, S., Mordohai, P., Pollefeys, M.: Multi-view stereo via graph cuts on the dual of an adaptive tetrahedral mesh. In ICCV, (2007) 47. Vogiatzis, G., C. Hernandez, Torr, P., and Cipolla, R.: Multiview stereo via volumetric graph-cuts and occlusion robust photo-consistency. TPAMI, 29(12), , (2007) 48. Zhang, Z.:A flexible new technique for camera calibration. In IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(11), , (2000) 49. Matsuyama, T., Wu, X., Takai, T., Wada, T.: Real-Time Dynamic 3-D Object Shape Reconstruction and High-Fidelity Texture Mapping for 3-D Video. In IEEE Transactions on Circuits and Systems for Video Technology, 14(3), , (2004) 50. Lorensen, W., Cline, H.: Marching cubes: A high resolution 3d surface construction algorithm. Computer Graphics, 21(4): , Kim, K., Chalidabhongse, T., Harwood, D., Davis L.: Real-time foregroundbackground segmentation using codebook model. Real-Time Imaging, 11, , (2005) 52. Autodesk 3ds Max, VRMesh - For point cloud and triangle mesh processing, DLife-Towards a VCE for Media Internet,

Announcements. Active stereo with structured light. Project structured light patterns onto the object

Announcements. Active stereo with structured light. Project structured light patterns onto the object Announcements Active stereo with structured light Project 3 extension: Wednesday at noon Final project proposal extension: Friday at noon > consult with Steve, Rick, and/or Ian now! Project 2 artifact

More information

3 Image-Based Photo Hulls. 2 Image-Based Visual Hulls. 3.1 Approach. 3.2 Photo-Consistency. Figure 1. View-dependent geometry.

3 Image-Based Photo Hulls. 2 Image-Based Visual Hulls. 3.1 Approach. 3.2 Photo-Consistency. Figure 1. View-dependent geometry. Image-Based Photo Hulls Greg Slabaugh, Ron Schafer Georgia Institute of Technology Center for Signal and Image Processing Atlanta, GA 30332 {slabaugh, rws}@ece.gatech.edu Mat Hans Hewlett-Packard Laboratories

More information

ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES

ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES B. Sirmacek, R. Lindenbergh Delft University of Technology, Department of Geoscience and Remote Sensing, Stevinweg

More information

3D Model based Object Class Detection in An Arbitrary View

3D Model based Object Class Detection in An Arbitrary View 3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/

More information

A Short Introduction to Computer Graphics

A Short Introduction to Computer Graphics A Short Introduction to Computer Graphics Frédo Durand MIT Laboratory for Computer Science 1 Introduction Chapter I: Basics Although computer graphics is a vast field that encompasses almost any graphical

More information

Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006

Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006 Practical Tour of Visual tracking David Fleet and Allan Jepson January, 2006 Designing a Visual Tracker: What is the state? pose and motion (position, velocity, acceleration, ) shape (size, deformation,

More information

Segmentation of building models from dense 3D point-clouds

Segmentation of building models from dense 3D point-clouds Segmentation of building models from dense 3D point-clouds Joachim Bauer, Konrad Karner, Konrad Schindler, Andreas Klaus, Christopher Zach VRVis Research Center for Virtual Reality and Visualization, Institute

More information

Wii Remote Calibration Using the Sensor Bar

Wii Remote Calibration Using the Sensor Bar Wii Remote Calibration Using the Sensor Bar Alparslan Yildiz Abdullah Akay Yusuf Sinan Akgul GIT Vision Lab - http://vision.gyte.edu.tr Gebze Institute of Technology Kocaeli, Turkey {yildiz, akay, akgul}@bilmuh.gyte.edu.tr

More information

A Prototype For Eye-Gaze Corrected

A Prototype For Eye-Gaze Corrected A Prototype For Eye-Gaze Corrected Video Chat on Graphics Hardware Maarten Dumont, Steven Maesen, Sammy Rogmans and Philippe Bekaert Introduction Traditional webcam video chat: No eye contact. No extensive

More information

Automatic Labeling of Lane Markings for Autonomous Vehicles

Automatic Labeling of Lane Markings for Autonomous Vehicles Automatic Labeling of Lane Markings for Autonomous Vehicles Jeffrey Kiske Stanford University 450 Serra Mall, Stanford, CA 94305 jkiske@stanford.edu 1. Introduction As autonomous vehicles become more popular,

More information

Spatio-Temporally Coherent 3D Animation Reconstruction from Multi-view RGB-D Images using Landmark Sampling

Spatio-Temporally Coherent 3D Animation Reconstruction from Multi-view RGB-D Images using Landmark Sampling , March 13-15, 2013, Hong Kong Spatio-Temporally Coherent 3D Animation Reconstruction from Multi-view RGB-D Images using Landmark Sampling Naveed Ahmed Abstract We present a system for spatio-temporally

More information

MetropoGIS: A City Modeling System DI Dr. Konrad KARNER, DI Andreas KLAUS, DI Joachim BAUER, DI Christopher ZACH

MetropoGIS: A City Modeling System DI Dr. Konrad KARNER, DI Andreas KLAUS, DI Joachim BAUER, DI Christopher ZACH MetropoGIS: A City Modeling System DI Dr. Konrad KARNER, DI Andreas KLAUS, DI Joachim BAUER, DI Christopher ZACH VRVis Research Center for Virtual Reality and Visualization, Virtual Habitat, Inffeldgasse

More information

PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY

PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY V. Knyaz a, *, Yu. Visilter, S. Zheltov a State Research Institute for Aviation System (GosNIIAS), 7, Victorenko str., Moscow, Russia

More information

Introduction. C 2009 John Wiley & Sons, Ltd

Introduction. C 2009 John Wiley & Sons, Ltd 1 Introduction The purpose of this text on stereo-based imaging is twofold: it is to give students of computer vision a thorough grounding in the image analysis and projective geometry techniques relevant

More information

CS 534: Computer Vision 3D Model-based recognition

CS 534: Computer Vision 3D Model-based recognition CS 534: Computer Vision 3D Model-based recognition Ahmed Elgammal Dept of Computer Science CS 534 3D Model-based Vision - 1 High Level Vision Object Recognition: What it means? Two main recognition tasks:!

More information

High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound

High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound Ralf Bruder 1, Florian Griese 2, Floris Ernst 1, Achim Schweikard

More information

The Visual Internet of Things System Based on Depth Camera

The Visual Internet of Things System Based on Depth Camera The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed

More information

ENGN 2502 3D Photography / Winter 2012 / SYLLABUS http://mesh.brown.edu/3dp/

ENGN 2502 3D Photography / Winter 2012 / SYLLABUS http://mesh.brown.edu/3dp/ ENGN 2502 3D Photography / Winter 2012 / SYLLABUS http://mesh.brown.edu/3dp/ Description of the proposed course Over the last decade digital photography has entered the mainstream with inexpensive, miniaturized

More information

A unified representation for interactive 3D modeling

A unified representation for interactive 3D modeling A unified representation for interactive 3D modeling Dragan Tubić, Patrick Hébert, Jean-Daniel Deschênes and Denis Laurendeau Computer Vision and Systems Laboratory, University Laval, Québec, Canada [tdragan,hebert,laurendeau]@gel.ulaval.ca

More information

Build Panoramas on Android Phones

Build Panoramas on Android Phones Build Panoramas on Android Phones Tao Chu, Bowen Meng, Zixuan Wang Stanford University, Stanford CA Abstract The purpose of this work is to implement panorama stitching from a sequence of photos taken

More information

Manufacturing Process and Cost Estimation through Process Detection by Applying Image Processing Technique

Manufacturing Process and Cost Estimation through Process Detection by Applying Image Processing Technique Manufacturing Process and Cost Estimation through Process Detection by Applying Image Processing Technique Chalakorn Chitsaart, Suchada Rianmora, Noppawat Vongpiyasatit Abstract In order to reduce the

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

3D Human Face Recognition Using Point Signature

3D Human Face Recognition Using Point Signature 3D Human Face Recognition Using Point Signature Chin-Seng Chua, Feng Han, Yeong-Khing Ho School of Electrical and Electronic Engineering Nanyang Technological University, Singapore 639798 ECSChua@ntu.edu.sg

More information

How To Analyze Ball Blur On A Ball Image

How To Analyze Ball Blur On A Ball Image Single Image 3D Reconstruction of Ball Motion and Spin From Motion Blur An Experiment in Motion from Blur Giacomo Boracchi, Vincenzo Caglioti, Alessandro Giusti Objective From a single image, reconstruct:

More information

Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite

Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Philip Lenz 1 Andreas Geiger 2 Christoph Stiller 1 Raquel Urtasun 3 1 KARLSRUHE INSTITUTE OF TECHNOLOGY 2 MAX-PLANCK-INSTITUTE IS 3

More information

Architectural Photogrammetry Lab., College of Architecture, University of Valladolid - jgarciaf@mtu.edu b

Architectural Photogrammetry Lab., College of Architecture, University of Valladolid - jgarciaf@mtu.edu b AN APPROACH TO 3D DIGITAL MODELING OF SURFACES WITH POOR TEXTURE BY RANGE IMAGING TECHNIQUES. SHAPE FROM STEREO VS. SHAPE FROM SILHOUETTE IN DIGITIZING JORGE OTEIZA S SCULPTURES J. García Fernández a,

More information

EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM

EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM Amol Ambardekar, Mircea Nicolescu, and George Bebis Department of Computer Science and Engineering University

More information

Multi-View Reconstruction in-between Known Environments

Multi-View Reconstruction in-between Known Environments Multi-View Reconstruction in-between Known Environments Stefan KUHN and Dominik HENRICH Lehrstuhl für Angewandte Informatik III (Robotik und Eingebettete Systeme) Universität Bayreuth, D-95440 Bayreuth,

More information

Consolidated Visualization of Enormous 3D Scan Point Clouds with Scanopy

Consolidated Visualization of Enormous 3D Scan Point Clouds with Scanopy Consolidated Visualization of Enormous 3D Scan Point Clouds with Scanopy Claus SCHEIBLAUER 1 / Michael PREGESBAUER 2 1 Institute of Computer Graphics and Algorithms, Vienna University of Technology, Austria

More information

VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS

VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,

More information

Colorado School of Mines Computer Vision Professor William Hoff

Colorado School of Mines Computer Vision Professor William Hoff Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description

More information

A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA

A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA N. Zarrinpanjeh a, F. Dadrassjavan b, H. Fattahi c * a Islamic Azad University of Qazvin - nzarrin@qiau.ac.ir

More information

Simultaneous Gamma Correction and Registration in the Frequency Domain

Simultaneous Gamma Correction and Registration in the Frequency Domain Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong a28wong@uwaterloo.ca William Bishop wdbishop@uwaterloo.ca Department of Electrical and Computer Engineering University

More information

3D SCANNING: A NEW APPROACH TOWARDS MODEL DEVELOPMENT IN ADVANCED MANUFACTURING SYSTEM

3D SCANNING: A NEW APPROACH TOWARDS MODEL DEVELOPMENT IN ADVANCED MANUFACTURING SYSTEM 3D SCANNING: A NEW APPROACH TOWARDS MODEL DEVELOPMENT IN ADVANCED MANUFACTURING SYSTEM Dr. Trikal Shivshankar 1, Patil Chinmay 2, Patokar Pradeep 3 Professor, Mechanical Engineering Department, SSGM Engineering

More information

Mean-Shift Tracking with Random Sampling

Mean-Shift Tracking with Random Sampling 1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of

More information

siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service

siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service Ahmad Pahlavan Tafti 1, Hamid Hassannia 2, and Zeyun Yu 1 1 Department of Computer Science, University of Wisconsin -Milwaukee,

More information

Removing Moving Objects from Point Cloud Scenes

Removing Moving Objects from Point Cloud Scenes 1 Removing Moving Objects from Point Cloud Scenes Krystof Litomisky klitomis@cs.ucr.edu Abstract. Three-dimensional simultaneous localization and mapping is a topic of significant interest in the research

More information

Efficient Multi-View Reconstruction of Large-Scale Scenes using Interest Points, Delaunay Triangulation and Graph Cuts

Efficient Multi-View Reconstruction of Large-Scale Scenes using Interest Points, Delaunay Triangulation and Graph Cuts Efficient Multi-View Reconstruction of Large-Scale Scenes using Interest Points, Delaunay Triangulation and Graph Cuts Patrick Labatut 1 Jean-Philippe Pons 1,2 Renaud Keriven 1,2 1 Département d informatique

More information

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles mjhustc@ucla.edu and lunbo

More information

How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm

How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm IJSTE - International Journal of Science Technology & Engineering Volume 1 Issue 10 April 2015 ISSN (online): 2349-784X Image Estimation Algorithm for Out of Focus and Blur Images to Retrieve the Barcode

More information

Edge tracking for motion segmentation and depth ordering

Edge tracking for motion segmentation and depth ordering Edge tracking for motion segmentation and depth ordering P. Smith, T. Drummond and R. Cipolla Department of Engineering University of Cambridge Cambridge CB2 1PZ,UK {pas1001 twd20 cipolla}@eng.cam.ac.uk

More information

ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES

ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES ABSTRACT Chris Gordon 1, Burcu Akinci 2, Frank Boukamp 3, and

More information

Model Repair. Leif Kobbelt RWTH Aachen University )NPUT $ATA 2EMOVAL OF TOPOLOGICAL AND GEOMETRICAL ERRORS !NALYSIS OF SURFACE QUALITY

Model Repair. Leif Kobbelt RWTH Aachen University )NPUT $ATA 2EMOVAL OF TOPOLOGICAL AND GEOMETRICAL ERRORS !NALYSIS OF SURFACE QUALITY )NPUT $ATA 2ANGE 3CAN #!$ 4OMOGRAPHY 2EMOVAL OF TOPOLOGICAL AND GEOMETRICAL ERRORS!NALYSIS OF SURFACE QUALITY 3URFACE SMOOTHING FOR NOISE REMOVAL 0ARAMETERIZATION 3IMPLIFICATION FOR COMPLEXITY REDUCTION

More information

A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms

A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms Steven M. Seitz Brian Curless University of Washington James Diebel Stanford University Daniel Scharstein Middlebury College Richard

More information

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. indhubatchvsa@gmail.com

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. indhubatchvsa@gmail.com LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE 1 S.Manikandan, 2 S.Abirami, 2 R.Indumathi, 2 R.Nandhini, 2 T.Nanthini 1 Assistant Professor, VSA group of institution, Salem. 2 BE(ECE), VSA

More information

Palmprint Recognition. By Sree Rama Murthy kora Praveen Verma Yashwant Kashyap

Palmprint Recognition. By Sree Rama Murthy kora Praveen Verma Yashwant Kashyap Palmprint Recognition By Sree Rama Murthy kora Praveen Verma Yashwant Kashyap Palm print Palm Patterns are utilized in many applications: 1. To correlate palm patterns with medical disorders, e.g. genetic

More information

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow , pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices

More information

A NEW SUPER RESOLUTION TECHNIQUE FOR RANGE DATA. Valeria Garro, Pietro Zanuttigh, Guido M. Cortelazzo. University of Padova, Italy

A NEW SUPER RESOLUTION TECHNIQUE FOR RANGE DATA. Valeria Garro, Pietro Zanuttigh, Guido M. Cortelazzo. University of Padova, Italy A NEW SUPER RESOLUTION TECHNIQUE FOR RANGE DATA Valeria Garro, Pietro Zanuttigh, Guido M. Cortelazzo University of Padova, Italy ABSTRACT Current Time-of-Flight matrix sensors allow for the acquisition

More information

A Learning Based Method for Super-Resolution of Low Resolution Images

A Learning Based Method for Super-Resolution of Low Resolution Images A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 emre.ugur@ceng.metu.edu.tr Abstract The main objective of this project is the study of a learning based method

More information

The Big Data methodology in computer vision systems

The Big Data methodology in computer vision systems The Big Data methodology in computer vision systems Popov S.B. Samara State Aerospace University, Image Processing Systems Institute, Russian Academy of Sciences Abstract. I consider the advantages of

More information

Universidad de Cantabria Departamento de Tecnología Electrónica, Ingeniería de Sistemas y Automática. Tesis Doctoral

Universidad de Cantabria Departamento de Tecnología Electrónica, Ingeniería de Sistemas y Automática. Tesis Doctoral Universidad de Cantabria Departamento de Tecnología Electrónica, Ingeniería de Sistemas y Automática Tesis Doctoral CONTRIBUCIONES AL ALINEAMIENTO DE NUBES DE PUNTOS 3D PARA SU USO EN APLICACIONES DE CAPTURA

More information

Optical Tracking Using Projective Invariant Marker Pattern Properties

Optical Tracking Using Projective Invariant Marker Pattern Properties Optical Tracking Using Projective Invariant Marker Pattern Properties Robert van Liere, Jurriaan D. Mulder Department of Information Systems Center for Mathematics and Computer Science Amsterdam, the Netherlands

More information

Dense Matching Methods for 3D Scene Reconstruction from Wide Baseline Images

Dense Matching Methods for 3D Scene Reconstruction from Wide Baseline Images Dense Matching Methods for 3D Scene Reconstruction from Wide Baseline Images Zoltán Megyesi PhD Theses Supervisor: Prof. Dmitry Chetverikov Eötvös Loránd University PhD Program in Informatics Program Director:

More information

Image Segmentation and Registration

Image Segmentation and Registration Image Segmentation and Registration Dr. Christine Tanner (tanner@vision.ee.ethz.ch) Computer Vision Laboratory, ETH Zürich Dr. Verena Kaynig, Machine Learning Laboratory, ETH Zürich Outline Segmentation

More information

3D/4D acquisition. 3D acquisition taxonomy 22.10.2014. Computer Vision. Computer Vision. 3D acquisition methods. passive. active.

3D/4D acquisition. 3D acquisition taxonomy 22.10.2014. Computer Vision. Computer Vision. 3D acquisition methods. passive. active. Das Bild kann zurzeit nicht angezeigt werden. 22.10.2014 3D/4D acquisition 3D acquisition taxonomy 3D acquisition methods passive active uni-directional multi-directional uni-directional multi-directional

More information

Jiří Matas. Hough Transform

Jiří Matas. Hough Transform Hough Transform Jiří Matas Center for Machine Perception Department of Cybernetics, Faculty of Electrical Engineering Czech Technical University, Prague Many slides thanks to Kristen Grauman and Bastian

More information

Incremental Surface Extraction from Sparse Structure-from-Motion Point Clouds

Incremental Surface Extraction from Sparse Structure-from-Motion Point Clouds HOPPE, KLOPSCHITZ, DONOSER, BISCHOF: INCREMENTAL SURFACE EXTRACTION 1 Incremental Surface Extraction from Sparse Structure-from-Motion Point Clouds Christof Hoppe 1 hoppe@icg.tugraz.at Manfred Klopschitz

More information

Accurate and robust image superresolution by neural processing of local image representations

Accurate and robust image superresolution by neural processing of local image representations Accurate and robust image superresolution by neural processing of local image representations Carlos Miravet 1,2 and Francisco B. Rodríguez 1 1 Grupo de Neurocomputación Biológica (GNB), Escuela Politécnica

More information

2003-2012 MVTec Software GmbH.

2003-2012 MVTec Software GmbH. 1 MVTec Software GmbH is a leading international manufacturer of software for machine vision used in all demanding areas of imaging: semi-conductor industry, web inspection, quality control and inspection

More information

Tracking Groups of Pedestrians in Video Sequences

Tracking Groups of Pedestrians in Video Sequences Tracking Groups of Pedestrians in Video Sequences Jorge S. Marques Pedro M. Jorge Arnaldo J. Abrantes J. M. Lemos IST / ISR ISEL / IST ISEL INESC-ID / IST Lisbon, Portugal Lisbon, Portugal Lisbon, Portugal

More information

T-REDSPEED White paper

T-REDSPEED White paper T-REDSPEED White paper Index Index...2 Introduction...3 Specifications...4 Innovation...6 Technology added values...7 Introduction T-REDSPEED is an international patent pending technology for traffic violation

More information

Robot Perception Continued

Robot Perception Continued Robot Perception Continued 1 Visual Perception Visual Odometry Reconstruction Recognition CS 685 11 Range Sensing strategies Active range sensors Ultrasound Laser range sensor Slides adopted from Siegwart

More information

HIGH AND LOW RESOLUTION TEXTURED MODELS OF COMPLEX ARCHITECTURAL SURFACES

HIGH AND LOW RESOLUTION TEXTURED MODELS OF COMPLEX ARCHITECTURAL SURFACES HIGH AND LOW RESOLUTION TEXTURED MODELS OF COMPLEX ARCHITECTURAL SURFACES E. K. Stathopoulou a, A. Valanis a, J. L. Lerma b, A. Georgopoulos a a Laboratory of Photogrammetry, National Technical University

More information

Tracking performance evaluation on PETS 2015 Challenge datasets

Tracking performance evaluation on PETS 2015 Challenge datasets Tracking performance evaluation on PETS 2015 Challenge datasets Tahir Nawaz, Jonathan Boyle, Longzhen Li and James Ferryman Computational Vision Group, School of Systems Engineering University of Reading,

More information

Automated Process for Generating Digitised Maps through GPS Data Compression

Automated Process for Generating Digitised Maps through GPS Data Compression Automated Process for Generating Digitised Maps through GPS Data Compression Stewart Worrall and Eduardo Nebot University of Sydney, Australia {s.worrall, e.nebot}@acfr.usyd.edu.au Abstract This paper

More information

DESIGN & DEVELOPMENT OF AUTONOMOUS SYSTEM TO BUILD 3D MODEL FOR UNDERWATER OBJECTS USING STEREO VISION TECHNIQUE

DESIGN & DEVELOPMENT OF AUTONOMOUS SYSTEM TO BUILD 3D MODEL FOR UNDERWATER OBJECTS USING STEREO VISION TECHNIQUE DESIGN & DEVELOPMENT OF AUTONOMOUS SYSTEM TO BUILD 3D MODEL FOR UNDERWATER OBJECTS USING STEREO VISION TECHNIQUE N. Satish Kumar 1, B L Mukundappa 2, Ramakanth Kumar P 1 1 Dept. of Information Science,

More information

ON THE AUTOMATION OF THE REGISTRATION OF POINT CLOUDS USING THE METROPOLIS ALGORITHM

ON THE AUTOMATION OF THE REGISTRATION OF POINT CLOUDS USING THE METROPOLIS ALGORITHM ON THE AUTOMATION OF THE REGISTRATION OF POINT CLOUDS USING THE METROPOLIS ALGORITHM Axel Wendt Institute for Applied Photogrammetry and Geoinformatics, University of Applied Sciences Oldenburg, Ofener

More information

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express

More information

Template-based Eye and Mouth Detection for 3D Video Conferencing

Template-based Eye and Mouth Detection for 3D Video Conferencing Template-based Eye and Mouth Detection for 3D Video Conferencing Jürgen Rurainsky and Peter Eisert Fraunhofer Institute for Telecommunications - Heinrich-Hertz-Institute, Image Processing Department, Einsteinufer

More information

Character Animation from 2D Pictures and 3D Motion Data ALEXANDER HORNUNG, ELLEN DEKKERS, and LEIF KOBBELT RWTH-Aachen University

Character Animation from 2D Pictures and 3D Motion Data ALEXANDER HORNUNG, ELLEN DEKKERS, and LEIF KOBBELT RWTH-Aachen University Character Animation from 2D Pictures and 3D Motion Data ALEXANDER HORNUNG, ELLEN DEKKERS, and LEIF KOBBELT RWTH-Aachen University Presented by: Harish CS-525 First presentation Abstract This article presents

More information

Segmentation & Clustering

Segmentation & Clustering EECS 442 Computer vision Segmentation & Clustering Segmentation in human vision K-mean clustering Mean-shift Graph-cut Reading: Chapters 14 [FP] Some slides of this lectures are courtesy of prof F. Li,

More information

3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving

3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving 3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving AIT Austrian Institute of Technology Safety & Security Department Christian Zinner Safe and Autonomous Systems

More information

REAL-TIME IMAGE BASED LIGHTING FOR OUTDOOR AUGMENTED REALITY UNDER DYNAMICALLY CHANGING ILLUMINATION CONDITIONS

REAL-TIME IMAGE BASED LIGHTING FOR OUTDOOR AUGMENTED REALITY UNDER DYNAMICALLY CHANGING ILLUMINATION CONDITIONS REAL-TIME IMAGE BASED LIGHTING FOR OUTDOOR AUGMENTED REALITY UNDER DYNAMICALLY CHANGING ILLUMINATION CONDITIONS Tommy Jensen, Mikkel S. Andersen, Claus B. Madsen Laboratory for Computer Vision and Media

More information

A method of generating free-route walk-through animation using vehicle-borne video image

A method of generating free-route walk-through animation using vehicle-borne video image A method of generating free-route walk-through animation using vehicle-borne video image Jun KUMAGAI* Ryosuke SHIBASAKI* *Graduate School of Frontier Sciences, Shibasaki lab. University of Tokyo 4-6-1

More information

Building an Advanced Invariant Real-Time Human Tracking System

Building an Advanced Invariant Real-Time Human Tracking System UDC 004.41 Building an Advanced Invariant Real-Time Human Tracking System Fayez Idris 1, Mazen Abu_Zaher 2, Rashad J. Rasras 3, and Ibrahiem M. M. El Emary 4 1 School of Informatics and Computing, German-Jordanian

More information

How To Fuse A Point Cloud With A Laser And Image Data From A Pointcloud

How To Fuse A Point Cloud With A Laser And Image Data From A Pointcloud REAL TIME 3D FUSION OF IMAGERY AND MOBILE LIDAR Paul Mrstik, Vice President Technology Kresimir Kusevic, R&D Engineer Terrapoint Inc. 140-1 Antares Dr. Ottawa, Ontario K2E 8C4 Canada paul.mrstik@terrapoint.com

More information

Introduction to Computer Graphics

Introduction to Computer Graphics Introduction to Computer Graphics Torsten Möller TASC 8021 778-782-2215 torsten@sfu.ca www.cs.sfu.ca/~torsten Today What is computer graphics? Contents of this course Syllabus Overview of course topics

More information

Tracking in flussi video 3D. Ing. Samuele Salti

Tracking in flussi video 3D. Ing. Samuele Salti Seminari XXIII ciclo Tracking in flussi video 3D Ing. Tutors: Prof. Tullio Salmon Cinotti Prof. Luigi Di Stefano The Tracking problem Detection Object model, Track initiation, Track termination, Tracking

More information

A Theory of Shape by Space Carving Kiriakos N. Kutulakos èkyros@cs.rochester.eduè Department of Computer Science and Department of Dermatology, Computer Studies Building, University of Rochester, Rochester,

More information

More Local Structure Information for Make-Model Recognition

More Local Structure Information for Make-Model Recognition More Local Structure Information for Make-Model Recognition David Anthony Torres Dept. of Computer Science The University of California at San Diego La Jolla, CA 9093 Abstract An object classification

More information

Projection Center Calibration for a Co-located Projector Camera System

Projection Center Calibration for a Co-located Projector Camera System Projection Center Calibration for a Co-located Camera System Toshiyuki Amano Department of Computer and Communication Science Faculty of Systems Engineering, Wakayama University Sakaedani 930, Wakayama,

More information

Low-resolution Character Recognition by Video-based Super-resolution

Low-resolution Character Recognition by Video-based Super-resolution 2009 10th International Conference on Document Analysis and Recognition Low-resolution Character Recognition by Video-based Super-resolution Ataru Ohkura 1, Daisuke Deguchi 1, Tomokazu Takahashi 2, Ichiro

More information

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms H. Kandil and A. Atwan Information Technology Department, Faculty of Computer and Information Sciences, Mansoura University,El-Gomhoria

More information

Part-Based Recognition

Part-Based Recognition Part-Based Recognition Benedict Brown CS597D, Fall 2003 Princeton University CS 597D, Part-Based Recognition p. 1/32 Introduction Many objects are made up of parts It s presumably easier to identify simple

More information

Robust NURBS Surface Fitting from Unorganized 3D Point Clouds for Infrastructure As-Built Modeling

Robust NURBS Surface Fitting from Unorganized 3D Point Clouds for Infrastructure As-Built Modeling 81 Robust NURBS Surface Fitting from Unorganized 3D Point Clouds for Infrastructure As-Built Modeling Andrey Dimitrov 1 and Mani Golparvar-Fard 2 1 Graduate Student, Depts of Civil Eng and Engineering

More information

False alarm in outdoor environments

False alarm in outdoor environments Accepted 1.0 Savantic letter 1(6) False alarm in outdoor environments Accepted 1.0 Savantic letter 2(6) Table of contents Revision history 3 References 3 1 Introduction 4 2 Pre-processing 4 3 Detection,

More information

Face Recognition in Low-resolution Images by Using Local Zernike Moments

Face Recognition in Low-resolution Images by Using Local Zernike Moments Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie

More information

The Advantages of Using a Fixed Stereo Vision sensor

The Advantages of Using a Fixed Stereo Vision sensor Proc. of International Conference on Industrial & Engineering Applications of Artificial Intelligence & Expert Systems (IEA/AIE), 2005 Real-Time People Localization and Tracking through Fixed Stereo Vision

More information

REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4

REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4 REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4 S. Heymann, A. Smolic, K. Mueller, Y. Guo, J. Rurainsky, P. Eisert, T. Wiegand Fraunhofer Institute

More information

3-D Object recognition from point clouds

3-D Object recognition from point clouds 3-D Object recognition from point clouds Dr. Bingcai Zhang, Engineering Fellow William Smith, Principal Engineer Dr. Stewart Walker, Director BAE Systems Geospatial exploitation Products 10920 Technology

More information

An Approach for Utility Pole Recognition in Real Conditions

An Approach for Utility Pole Recognition in Real Conditions 6th Pacific-Rim Symposium on Image and Video Technology 1st PSIVT Workshop on Quality Assessment and Control by Image and Video Analysis An Approach for Utility Pole Recognition in Real Conditions Barranco

More information

Vision based Vehicle Tracking using a high angle camera

Vision based Vehicle Tracking using a high angle camera Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu gramos@clemson.edu dshu@clemson.edu Abstract A vehicle tracking and grouping algorithm is presented in this work

More information

A typical 3D modeling system involves the phases of 1. Individual range image acquisition from different viewpoints.

A typical 3D modeling system involves the phases of 1. Individual range image acquisition from different viewpoints. Efficient Model Creation of Large Structures based on Range Segmentation 2nd International Symposium on 3D Data Processing, Visualization & Transmission, September 2004, Thessaloniki, Greece. Ioannis Stamos

More information

Probabilistic Latent Semantic Analysis (plsa)

Probabilistic Latent Semantic Analysis (plsa) Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} References

More information

Off-line Model Simplification for Interactive Rigid Body Dynamics Simulations Satyandra K. Gupta University of Maryland, College Park

Off-line Model Simplification for Interactive Rigid Body Dynamics Simulations Satyandra K. Gupta University of Maryland, College Park NSF GRANT # 0727380 NSF PROGRAM NAME: Engineering Design Off-line Model Simplification for Interactive Rigid Body Dynamics Simulations Satyandra K. Gupta University of Maryland, College Park Atul Thakur

More information

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION Saurabh Asija 1, Rakesh Singh 2 1 Research Scholar (Computer Engineering Department), Punjabi University, Patiala. 2 Asst.

More information

Analecta Vol. 8, No. 2 ISSN 2064-7964

Analecta Vol. 8, No. 2 ISSN 2064-7964 EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,

More information

Colour Image Segmentation Technique for Screen Printing

Colour Image Segmentation Technique for Screen Printing 60 R.U. Hewage and D.U.J. Sonnadara Department of Physics, University of Colombo, Sri Lanka ABSTRACT Screen-printing is an industry with a large number of applications ranging from printing mobile phone

More information

INTRODUCTION TO RENDERING TECHNIQUES

INTRODUCTION TO RENDERING TECHNIQUES INTRODUCTION TO RENDERING TECHNIQUES 22 Mar. 212 Yanir Kleiman What is 3D Graphics? Why 3D? Draw one frame at a time Model only once X 24 frames per second Color / texture only once 15, frames for a feature

More information

An Adaptive Hierarchical Next-Best-View Algorithm for 3D Reconstruction of Indoor Scenes

An Adaptive Hierarchical Next-Best-View Algorithm for 3D Reconstruction of Indoor Scenes Technical Report TR06-003, Department of Computer Science, University of North Carolina at Chapel Hill, January 2006. An Adaptive Hierarchical Next-Best-View Algorithm for 3D Reconstruction of Indoor Scenes

More information