|
|
|
- Gervais Gardner
- 10 years ago
- Views:
Transcription
1 Shape from Shading: A Survey Ruo Zhang, Ping-Sing Tsai, James Edwin Cryer and Mubarak Shah Computer Vision Lab School of Computer Science University of Central Florida Orlando, FL [email protected] Abstract Since the rst shape-from-shading (SFS) technique was developed by Horn in the early 1970s, many dierent approaches have emerged. In this paper, six well-known SFS algorithms are implemented and compared. The performance of the algorithms was analyzed on synthetic images using mean and standard deviation of depth (Z) error, mean of surface gradient (p, q) error and CPU timing. Each algorithm works well for certain images, but performs poorly for others. In general, minimization approaches are more robust, while the other approaches are faster. The implementation of these algorithms in C, and images used inthispaper, are available by anonymous ftp under the pub=tech paper=survey directory at eustis:cs:ucf:edu ( ). These are also part of the electronic version of paper. Keywords: Shape from shading, analysis of algorithms, Lambertian model, survey of shape from shading algorithms. This work was supported by NSF grants CDA and CDA
2 2 1 Introduction Shape recovery is a classic problem in computer vision. The goal is to derive a 3-D scene description from one or more 2-D images. The recovered shape can be expressed in several ways: depth Z(x y), surface normal (n x n y n z ), surface gradient (p q), and surface slant,, and tilt,. The depth can be considered either as the relative distance from camera to surface points, or the relative surface height above the x-y plane. The surface normal is the orientation of a vector perpendicular to the tangent plane on the object surface. The surface gradient, (p q) ), is the rate of change of depth in the x and y directions. The surface slant,, and tilt,, are related to the surface normal as (n x n y n z ) = (l sin cos l sin sin l cos ), where l is the magnitude of the surface normal. In computer vision, the techniques to recover shape are called shape-from-x techniques, where X can be shading, stereo, motion, texture, etc. Shape-from-shading (SFS) deals with the recovery of shape from a gradual variation of shading in the image. Artists have long exploited lighting and shading to convey vivid illusions of depth in paintings. To solve the SFS problem, it is important to study how the images are formed. A simple model of image formation is the Lambertian model, in which the gray level at a pixel in the image depends on the light source direction and the surface normal. In SFS, given a gray level image, the aim is to recover the light source and the surface shape at each pixel in the image. However, real images do not always follow the Lambertian model. Even if we assume Lambertian reectance and known light source direction, and if the brightness can be described as a function of surface shape and light source direction, the problem is still not simple. This is because if the surface shape is described in terms of the surface normal, we have a linear equation with three unknowns, and if the surface shape is described in terms of the surface gradient, we have a non-linear equation with two unknowns. Therefore, nding a unique solution to SFS is dicult it requires additional constraints. This paper is about the comparison and performance analysis of SFS techniques. We have implemented six well-known SFS algorithms, and compared them in terms of timing (CPU time) and accuracy (mean and standard deviation of the depth error and mean of the surface gradient error), in order to analyze the advantages and disadvantages of these approaches.
3 3 Two synthetic images with two dierent light sources each and three real images were used in our experiments. We found that none of the algorithms has consistent performance for all images, since they work well for certain images, but perform poorly for others. In general, minimization approaches are more robust, while the other approaches are faster. The implementation of these algorithms in C, and images used in this paper, are available by anonymous ftp. 2 Literature Review Shading plays an important role in human perception of surface shape. Researchers in human vision have attempted to understand and simulate the mechanisms by which our eyes and brains actually use the shading information to recover the 3-D shapes. Ramachandran [45] demonstrated that the brain recovers the shape information not only by the shading, but also by the outlines, elementary features, and the visual system's knowledge of objects. The extraction of SFS by visual system is also strongly aected by stereoscopic processing. Barrow and Tenenbaum discovered that it is the line drawing of the shading pattern that seems to play a central role in the interpretation of shaded patterns [2]. Mingolla and Todd's study of human visual system based on the perception of solid shape [30] indicated that the traditional assumptions in SFS{Lambertian reectance, known light source direction, and local shape recovery{are not valid from psychology point of view. One can observe from the above discussion that human visual system uses SFS dierently than computer vision normally does. Recently, Horn, Szeliski and Yuille [19] discovered that some impossibly shaded images exist, which could not be shading images of any smooth surface under the assumption of uniform reectance properties and lighting. For this kind of image, SFS will not provide a correct solution, so it is necessary to detect impossibly shaded images. SFS techniques can be divided into four groups: minimization approaches, propagation approaches, local approaches and linear approaches (see gure 1). Minimization approaches obtain the solution by minimizing an energy function. Propagation approaches propagate the shape information from a set of surface points (e.g., singular points) to the whole image. Local approaches derive shape based on the assumption of surface type. Linear approaches
4 4 Minimization Propagation Local Linear Ikeuchi & Horn(81) Horn (70) Pentland (84) Pentland(88) Brooks & Horn(85) Rouy & Tourin(92) Lee & Rosefeld(85) Tsai & Shah (92) Frankot & Chellappa (88) Dupuis & Oliensis(92) Horn(89) Kimmel & Bruckstein (92) Malik & Maydan(89) Bichsel& Pentland(92) Szeliski(91) Zheng & Chellappa(91) Lee & Kuo(91) Leclerc & Bobick(93) Vega &Yang(93) Figure 1: Four categories of shape from shading approaches. The number in parentheses besides authors' names is the year of publication. compute the solution based on the linearization of the reectance map. 2.1 Minimization Approaches One of the earlier minimization approaches, which recovered the surface gradients, was by Ikeuchi and Horn [21]. Since each surface point has two unknowns for the surface gradient and each pixel in the image provides one gray value, we have an underdetermined system. To overcome this, they introduced two constraints: the brightness constraint and the smoothness constraint. The brightness constraint requires that the reconstructed shape produce the same brightness as the input image at each surface point, while the smoothness constraint ensures a smooth surface reconstruction. The shape was computed by minimizing an energy function which consists of the abovetwo constraints. To ensure a correct convergence, the shape at the occluding boundary was given for the initialization. Since the gradient atthe the occluding boundary has at least one innite component, stereographic projection was used to transform the error function to a dierent space. Also using these two constraints, Brooks and Horn [5] minimized the same energy function, in terms of the surface normal. Frankot and Chellappa [11] enforced integrability in Brooks and Horn's algorithm in order to recover integrable
5 5 surfaces (surfaces for which z xy = z yx ). Surface slope estimates from the iterativescheme were expressed in terms of a linear combination of a nite set of orthogonal Fourier basis functions. The enforcement of integrability was done by projecting the nonintegrable surface slope estimates onto the nearest (in terms of distance) integrable surface slopes. This projection was fullled by nding the closest set of coecients which satisfy integrabilityinthe linear combination. Their results showed improvements in both accuracy and eciency over Brooks and Horn's algorithm [5]. Later, Horn also [18] replaced the smoothness constraint in his approach with an integrability constraint. The major problem with Horn's method is its slow convergence. Szeliski [48] sped it up using a hierarchical basis pre-conditioned conjugate gradient descent algorithm. Based on the geometrical interpretation of Brooks and Horn's algorithm, Vega and Yang [51] applied heuristics to the variational approach in an attempt to improve thestability of Brooks and Horn's algorithm. Instead of the smoothness constraint, Zheng and Chellappa [54] introduced an intensity gradient constraint, which species that the intensity gradients of the reconstructed image and the input image are close to each other in both the x and y directions. All of the above techniques use variational calculus. Leclerc and Bobick [25] solved directly for depth by using a discrete formulation and employing a conjugate gradient technique. The brightness constraint and smoothness constraint were applied to ensure convergence, and a stereo depth map was used as an initial estimate. Recently, Lee and Kuo [28] also proposed an approach to recover depth using the brightness and the smoothness constraint. They approximated surfaces by a union of triangular patches. This approach did not require the depth initialization. The approaches described so far deal with a single smooth surface. Malik and Maydan [29] developed a solution for piecewise smooth surfaces. They combined the line drawing and shading constraints in an energy function, and recovered both surface normal and line labeling through the minimization of the energy function. 2.2 Propagation Approaches Horn's characteristic strip method [17] is essentially a propagation method. Acharacteristic strip is a line in the image along which the surface depth and orientation can be computed if these quantities are known at the starting point of the line. Horn's method constructs
6 6 initial surface curves around the neighborhoods of singular points (singular points are the points with maximum intensity) using a spherical approximation. The shape information is propagated simultaneously along the characteristic strips outwards, assuming no crossover of adjacent strips. The direction of characteristic strips is identied as the direction of intensity gradients. In order to get a dense shape map, new strips have to be interpolated when neighboring strips are not close to each other. Rouy and Tourin [46] presented a solution to SFS based on Hamilton-Jacobi-Bellman equations and viscosity solutions theories in order to obtain a unique solution. A link between viscosity solutions and optimal control theories was given via dynamic programming. Moreover, conditions for the existence of both continuous and smooth solutions were provided. Oliensis [33] observed that the surface shape can be reconstructed from singular points instead of the occluding boundary. Based on this idea, Dupuis and Oliensis [9, 34] formulated SFS as an optimal control problem, and solved it using numerical methods. Bichsel and Pentland [3] simplied Dupuis and Oliensis's approach and proposed a minimum downhill approach for SFS which converged in less than ten iterations. Similar to Horn's, and Dupuis and Oliensis's approaches, Kimmel and Bruckstein [23] reconstructed the surface through layers of equal heightcontours from an initial closed curve. Their method applied techniques in dierential geometry, uid dynamics, and numerical analysis, which enabled the good recovery of non-smooth surfaces. The algorithm used a closed curve in the areas of singular points for initialization. 2.3 Local Approaches Pentland's local approach [37] recovered shape information from the intensity, and its rst and second derivatives. He used the assumption that the surface is locally spherical at each point. Under the same spherical assumption, Lee and Rosenfeld [26] computed the slant and tilt of the surface in the light source coordinate system using the rst derivative of the intensity. The approaches by Pentland, and Tsai and Shah are linear approaches, which linearize the reectance map and solve for shape.
7 7 2.4 Linear Approaches Pentland [38] used the linear approximation of the reectance function in terms of the surface gradient, and applied a Fourier transform to the linear function to get a closed form solution for the depth at each point. Tsai and Shah [50] applied the discrete approximation of the gradient rst, then employed the linear approximation of the reectance function in terms of the depth directly. Their algorithm recovered the depth at each point using a Jacobi iterative scheme. Interreections None of the above methods deals with interreections { the mutual illumination between surface facets. Nayar, Ikeuchi, and Kanade [31, 32] addressed the shape-from-interreection problem using photometric stereo. They observed that the erroneous shape extracted by shape-from-photometric-stereo algorithms in the presence of interreections was shallower than the real shape. Therefore, they proposed a method to iteratively rene the shape. Similar formulation of interreection was also discussed by Forsyth and Zisserman [10]. Convergence, Uniqueness and Existence Little work has been done on proving the uniqueness or existence of a solution to SFS. The uniqueness of SFS can be proven under the condition that the light source direction is equal to, or symmetric around, the viewing direction [33]. With an initial known curve, the method of characteristic strips yields a unique solution if the rst derivative of surface depth is continuous. For general cases, the uniqueness is unknown. However, Lee and Kuo [28] showed that, given the depth at a reference point, the addition of the smoothness constraint and successive linearization of the reectance map (based on the local gradients obtained from the previous iteration) provides a unique solution for their approach, in most cases. Blake & Zisserman, Bruss, and Rouy & Tourin also provided sucient conditions for uniqueness [4, 6, 46]. Their conditions contain either the singular point or the occluding boundary. If we consider local uniqueness instead of global uniqueness over the entire image, the uniqueness of a solution can be easily determined at singular points, and occluding boundaries, provided that the reectance map is given. These are the points at which we can
8 8 determine the surface orientation directly from the image brightness. The brightness pattern in any arbitrary region could arise from an innite number of dierent surfaces. However, the information at singular points and at occluding boundaries can be used to constrain the possible solutions. The organization of the paper The organization of the remainder of this paper is as follows: The next section introduces background knowledge related to reectance models. Section four describes six SFS approaches studied in detail in this paper. Section ve is devoted to the description of the synthetic and real images used in this study. We present our experimental results for six dierent SFS algorithms in section six. The error analysis is presented in section seven. Section eight summarizes the CPU timing of all six algorithms for dierent images. Finally, conclusions and future research are covered in section nine. 3 Reectance Models Depending on their physical properties, surfaces can be categorized as pure Lambertian, pure specular, hybrid, or more sophisticated surfaces. In this section, we will describe the reectance models and discuss their properties related to shape from shading. 3.1 Lambertian and Specular Reectance Models Lambertian surfaces are surfaces having only diuse reectance, i.e., surfaces which reect light in all directions. The brightness of a Lambertian surface is proportional to the energy of the incident light. The amount oflight energy falling on a surface element is proportional to the area of the surface element as seen from the light source position (the foreshortened area). The foreshortened area is a cosine function of the angle between the surface orientation and the light source direction. Therefore, the Lambertian surface can be modeled as the product of the strength of the light source A, the albedo of the surface, and the foreshortened area cos i as follows: I L = R = A cos i (1)
9 9 N S θ i Figure 2: Lambertian Reection Geometry. where R is the reectance map and i is the angle between the surface normal N ~ =(n x n y n z ) and the source direction S ~ =(s x s y s z ) (See Figure 2). If we let the surface normal and the light source direction both be unit vectors, we can rewrite the above formula as: I L = A N ~ S ~ (2) where '' represents dot product. Specularity only occurs when the incident angle of the light source is equal to the reected angle. It is formed by two components: the specular spike and the specular lobe. The specular spike is zero in all directions except for a very narrow range around the direction of specular reection. The specular lobe spreads around the direction of specular reection. The simplest model for specular reection is described by the following delta function: I S = B( s ; 2 r ) (3) where I S is the specular brightness, B is the strength of the specular component, s is the angle between the light source direction and the viewing direction, and r is the angle between the surface normal and the viewing direction. This model assumes that the highlight caused by specular reection is only a single point, but in real life this assumption is not true. Another model developed by Phong [40] represents the specular component of reection as powers of the cosine of the angle between the perfect specular direction and the viewing direction. This model is capable of predicting specularities which extend beyond
10 10 H N S V ( Viewer ) α θ r θ s Figure 3: Specular Reection Geometry. a single point however, the parameters have no physical meaning. A more rened model, the Torrance-Sparrow model [49], assumes that a surface is composed of small, randomly oriented, mirror-like facets. It describes the specular brightness as the product of four components: energy of incident light, Fresnel coecient, facet orientation distribution function, and geometrical attenuation factor adjusted for foreshortening. On the basis of the Torrance- Sparrow model, Healey and Binford [15] derived a simplied model by using the Gaussian distribution as the facet orientation function, and considering the other components as constants. It can be described as: I S = Ke ;( m )2 (4) where K is a constant, is the angle between the surface normal N ~ and the bisector H of the viewing direction and source direction, and m indicates the surface roughness (Figure 3). Most surfaces in the real world are neither purely Lambertian, nor purely specular, they are a combination of both. That is, they are hybrid surfaces. One straightforward equation for a hybrid surface is: I =(1;!)I L +!I S (5) where I is the total brightness for the hybrid surface, I S, I L are the specular brightness and Lambertian brightness respectively, and! is the weight of the specular component. Nayar, Ikeuchi and Kanade [32] proposed a reectance model which consists of three
11 11 components: diuse lobe, specular lobe, and specular spike. The Lambertian model was used to represent the diuse lobe, the specular component of the Torrance-Sparrow model was used to model the specular lobe, and the spike component of the Beckmann-Spizzichino model was used to describe the specular spike. The resulting hybrid model is given as: I = K dl cos i + K sl e ; K ss ( i ; r )( r ): (6) where K dl, K sl and K ss are the strengths of the three components, is the angle between the surface normal of a micro-facet on a patch and the mean normal of this surface patch, and is its standard derivation. If we consider the surface normal being in the Z direction, then, ( i i ) is the direction of incidence light interms of the slant and tilt in 3-D, ( r r ) is the direction of reected light. Although the Lambertian model is widely used because of its simplicity, it is a poor approximation to the diuse component of rough surfaces. See [20, 24, 7, 35] for more sophisticated approaches. 4 Selected Shape from Shading Algorithms Most SFS algorithms assume that the light source direction is known. In the case of the unknown light source direction, there are algorithms [36, 26, 54] which can estimate the light source direction without the knowledge of the surface shape. However, some assumptions about the surface shape are required, such as the local spherical surface, uniform and isotropic distribution of the surface orientation. Once the light source direction is known, 3-D shape can be estimated. We have implemented two minimization, one propagation, one local, and two linear methods. Ease of nding/making an implementation was an important criterion. In addition, this selection was guided by several other reasons. First, we have attempted to focus on more recent algorithms. Since authors improve their own algorithms or others' algorithms, new algorithms, in general, perform better than older algorithms. Second, some papers deal with theoretical issues related to SFS, but do not provide any particular algorithm. Such papers have not been dealt with in detail in this paper. Third, some approaches combine shape from shading with stereo, or line drawings, etc. We have mentioned such approaches in the
12 12 review section, for the sake of completeness, but have not discussed them in detail. Finally, since papers related to interreection and specularity consider special and more complex situations of shape from shading, they have not been dealt with in detail. 4.1 Minimization Approaches Minimization approaches compute the solution which minimizes an energy function over the entire image. The function can involve the brightness constraint, and other constraints, such as the smoothness constraint, the integrability constraint, the gradient constraint, and the unit normal constraint. In this subsection, rst, we briey describe these constraints, and then discuss SFS methods which use these constraints. The Brightness constraint is derived directly from the image irradiance (Equation 2). It indicates the total brightness error of the reconstructed image compared with the input image, and is given by ZZ (I ; R) 2 dx dy (7) where I is the measured intensity, and R is the estimated reectance map. The Smoothness constraint ensures a smooth surface in order to stabilize the convergence to auniquesolution, and is given by ZZ (p 2 x + p 2 y + q 2 x + q 2 y) dx dy (8) here p and q are surface gradients along the x and y directions. Another version of the smoothness term is less restrictive by requiring constant change of depth only in x and y directions: ZZ (p 2 x + q 2 y) dx dy: (9) The smoothness constraint can also be described in terms of the surface normal ~ N: ZZ (k ~ N x k 2 + k ~ N y k 2 ) dx dy: (10) This means that the surface normal should change gradually. The Integrability constraint ensures valid surfaces, that is, Z x y = Z y x. It can be described by either ZZ (p y ; q x ) 2 dx dy (11)
13 13 or ZZ ((Z x ; p) 2 +(Z y ; q) 2 )) dx dy: (12) The Intensity Gradient constraint requires that the intensity gradient of the reconstructed image be close to the intensity gradient of the input image in both the x and y directions: ZZ ((R x ; I x ) 2 +(R y ; I y ) 2 ) dx dy: (13) The Unit Normal constraint forces the recovered surface normals to be unit vectors: ZZ (k ~ Nk 2 ; 1) dx dy: (14) Zheng and Chellappa (91) Zheng and Chellappa [54] applied the intensity gradient constraint, instead of a smoothness constraint (used by Ikeuchi and Horn [21], Brooks and Horn [5], and Horn [18]), therefore, their energy function became: ZZ ((I ; R) 2 +((R x ; I x ) 2 +(R y ; I y ) 2 )+((Z x ; p) 2 +(Z y ; q) 2 )) dx dy: (15) The minimization of the above function was done through variational calculus. In variational calculus, the Euler equations, which are dierential equations, are rst computed, and then solved discretely to minimize the energy function. Zheng and Chellappa simplied the Euler equations by taking the rst-order Taylor series of the reectance map and representing the depth, gradient and their derivatives in discrete form. Then, a new iterative scheme was derived, which updates depth and gradients simultaneously. The algorithm was implemented using a hierarchical structure (pyramid) in order to speed up the computation. There was no special requirement for the initialization of the boundary. The initial values for both depth and gradient can be zero. The implementation of Zheng and Chellappa's method is very straightforward. We used the forward dierence approximation to compute the partial derivatives [54]. For the boundary points, where the forward approximation could not be applied, we switched to the backward dierence approximation for the rst order partial derivatives and set the second order partial derivatives to zero. In all the experiments, we set to 1 as suggested by the authors [54].
14 Lee and Kuo (91) Lee and Kuo [28] used the brightness constraint and the smoothness constraint. In their approach, surfaces were approximated by the union of triangular surface patches. The vertices of the triangles were called nodal points, and only nodal depths were recovered. Depths at the pixels, which are not nodal points, were obtained through interpolation. For each triangular patch, the intensity of the triangle was taken as the average intensity of all pixels in the triangle, and the surface gradient of the triangle was approximated by the cross product of any two adjacent edges of the triangle. This established a relationship between the triangle's intensity and the depth at its three nodal points. Linearizing the reectance map in terms of the surface gradient (p q), a linear relationship between the intensity and depth at the nodal points was derived. The surface depths at the nodal points were computed using optimization. The optimization problem was reduced to the solution of a sparse linear system, and, a multigrid computational algorithm was applied. There was no initialization required. Lee and Kuo's algorithm was implemented using the V-cycle multigrid scheme to solve the linear system, as reported in [28]. We used Gauss-Seidel relaxation as the smoothing operator, and as the exact solver for the nest grid. Full-weighting restriction was applied to transfer the residual from ner grids to coarser grids, and bi-linear interpolation was applied to make the prolongation from the coarser grid to ner grids. The same template was used for the smoothness term as given in their paper. The nodal points in the nest grids were chosen to be the image pixels. Successive linearizations were done through a maximum of 10 successive iterations, and the number of V-cycles was set to 10 for the rst iteration, 2 for the second, 1forthe rest. The initial values for depth of the nest grid, and corrections for the coarser grids, were all zero. Since the algorithm does not work for light source direction (0 0 1), we used (0:001 0:001 1:0) as the input light source direction instead. We used smoothing factor of 2000, and computed the level of grids by L = log(m) ; 1, where M is the size of the image. Therefore, we have 7 levels for 256 by 256 images, and 6 levels for 128 by 128 images. The depth maps, after the rst iteration, contain more detail but have a smaller range. After 10 iterations, details are smoothed out, but the depth range is wider. This means that more iterations will provide more low frequency information,
15 15 which overtakes the high frequency information from the initial iterations. 4.2 Propagation Approaches Propagation approaches start from a single reference surface point, or a set of surface points where the shape either is known or can be uniquely determined (such as singular points), and propagate the shape information across the whole image. We discuss one algorithm in this section Bichsel and Pentland (92) Following the main idea of Dupuis and Oliensis, Bichsel and Pentland [3] developed an ecient minimum downhill approach which directly recovers depth and guarantees a continuous surface. Given initial values at the singular points (brightest points), the algorithm looks in eight discrete directions in the image and propagates the depth information away from the light source to ensure the proper termination of the process. Since slopes at the surface points in low brightness regions are close to zero for most directions (except the directions which form a very narrow angle with the illumination direction), the image was initially rotated to align the light source direction with one of the eight directions. The inverse rotation was performed on the resulting depth map in order to get the original orientation back. Assuming the constraint of parallel slope, the surface gradient, (p q), was pre-computed by taking the derivative of the reectance map with respect to q in the rotated coordinate system, setting it to zero, and then solving for p and q. The solutions for p and q were given by: q p = ;s xs z (1 ; R 2 )(R 2 ; s 2 y) (16) R 2 ; s 2 ; s 2 x y q = ps ys x ; s y s z : (17) R 2 ; s 2 y where (s x s y s z ) is the light source direction and R is the reectance map as previously dened. In the implementation of Bichsel and Pentland's algorithm the initial depth values for the singular points were assigned a xed positive value (55 in our case this number should be related to the maximum height of object), and the depth values for the other points
16 16 were initialized to large negative values (-1.0e10). Instead of computing the distance to the light source, only the local surface height is computed and maximized, in order to select the minimum downhill direction. This is based on the fact that the distance to the light source is a monotonically increasing function of the height when the angle between the light source direction and the optical axis (z-axis here) is less than 90 degrees. Height values are updated with a Gauss-Seidel iterative scheme and the convergence is accelerated by altering the direction of the pass at each iteration. 4.3 Local Approaches Local approaches derive the shape by assuming local surface type. They use the intensity derivative information and assume spherical surface. Here, we describe Lee and Rosenfeld's [26] approach Lee and Rosenfeld (85) Lee and Rosenfeld [26] approximated the local surface regions by spherical patches. The slant and tilt of the surface were rst computed in the light source coordinate, then transformed back to the viewer coordinate. They proved that the tilt of the surface normal could be obtained from: = arctan I y cos S ; I x sin S I x cos S cos S + I y cos S sin S (18) where I x and I y are intensity derivatives along the x and y directions, S is the slant ofthe light source, and S is the tilt of the light source. If the surface has uniform reectance, and if the reectance map is given by I = ~ N ~ S (Lambertian), then the brightest point has its surface normal pointing toward the light source, and the cosine value of surface slant can be obtained by the ratio of its intensity and. This approach is an improvement of Pentland's rst approach, since it involves only the rst derivatives of the intensity rather than the second derivatives. This makes it less sensitive to noise. However, the local spherical assumption of the surface limits its application. The major part in the implementation of Lee and Rosenfeld's algorithm is the rotation of the image from the viewer coordinates to the light source coordinates, and the computation
17 17 of the intensity gradient in the light source coordinates. parameters which needed to be determined. For this method, there are no 4.4 Linear Approaches Linear approaches reduce the non-linear problem into a linear through the linearization of the reectance map. The idea is based on the assumption that the lower order components in the reectance map dominate. Therefore, these algorithms only work well under this assumption Pentland (88) Pentland's second approach [38] used the linear approximation of the reectance map in p and q. The reectance function can be expressed as follows: I(x y) =R(p q) = cos S + p cos S sin S + q sin S sin S p1+p2 + q 2 (19) where S and S are respectively the slant and tilt of light source. By taking the Taylor series expansion of the reectance function about p = p 0, q = q 0, and ignoring the high order terms, we have I(x y) =R(p 0 q 0 )+(p ; p (p 0 q 0 )+(q ; q (p 0 q 0 ): (20) For Lambertian reectance, the above equation at p 0 = q 0 = 0, reduces to I(x y) = cos S + p cos S sin S + q sin S sin S : Next, Pentland takes the Fourier transform of both sides of the equation. Since the rst term on the right isa DC term, it can be dropped. Using Z(x y)! F Z(! 1! 2 )(;i! 1 Z(x y)! F Z(! 1! 2 )(;i! 2 ) (22) where F Z is the Fourier transform of Z(x y), we get: F I = F Z (! 1! 2 )(;i! 1 )cos S sin S + F Z (! 1! 2 )(;i! 2 ) sin S sin S (23)
18 18 where F I is the Fourier transform of the image I(x y). The depth map Z(x y) can be computed by rearranging the terms in the above equation, and then taking the inverse Fourier transform. This algorithm gives a non-iterative, closed-form solution using Fourier transform. The problem lies in the linear approximation of the reectance map, which causes trouble when the non-linear terms are large. As pointed out by Pentland, when the quadratic terms in the reectance map dominate, the \frequency doubling" occurs, in this case, the recovered shape will not be consistent with the illumination conditions. The implementation of Pentland's algorithm was done using the fast Fourier transform and the inverse fast Fourier transform (see equation 23) Tsai and Shah (92) Tsai and Shah [50] employed the discrete approximations of p and q using nite dierences in order to linearize the reectance map in terms of Z. The reectance function for Lambertian surfaces is: R(p i j q i j )= ;s xp i j ; s y q i j + s q z (24) 1+p 2 i j + q 2 i j Using the following discrete approximations for p and q, p = Z i j ; Z i;1 j and q = Z i j ; Z i j;1, thereectance equation can be rewritten as: 0=f(I i j Z i j Z i;1 j Z i j;1) =I i j ; R(Z i j ; Z i;1 j Z i j ; Z i j;1): (25) For a xed point (i j) andagiven image I, a linear approximation (Taylor series expansion up through the rst order terms) of the function f (Equation (25)) about a given depth map Z n;1 is: 0 = f (Ii j Zi j Zi;1 j Zi j;1) f (Ii j Z i j n;1 Z i;1 j n;1 Zn;1 i j;1 )+(Z i j ; Z i j @Zi;1 j f (Ii j Z i j n;1 Z i;1 j n;1 Zn;1 i j;1 )+(Z i j;1 ; Z n;1 i j;1 ) f (Ii j Z i j n;1 Z i;1 j n;1 Zn;1 i j;1 )+(Z i;1 j ; Z i;1 j j;1 f (Ii j Z i j n;1 Z i;1 j n;1 Zn;1 i j;1 ): (26) For an M by M image, there are M 2 such equations, which will form a linear system. This system can be solved easily using the Jacobi iterative scheme, which simplies equation (26) into the following equation: 0=f(Z i j ) f(z n;1 i j )+(Z i j ; Z n;1 i j ) d f(z n;1 ): (27) i j dz i j
19 19 Then for Z i j = Z n i j, the depth map at the n-th iteration, can be solved directly: The initial estimate of Z 0 i j Z n i j = Z n;1 i j + ;f(z n;1 d dzi j i j ) f(z n;1 i j ) : (28) is set to zero for all pixels, Gaussian smoothing is applied to the nal depth map to get a smoother result. Note that depth is computed here by a simple division without any matrix inverse (Equation 28). This is a simple but ecient algorithm. However, it has problems with self-shadows. The implementation of Tsai and Shah's algorithm is straightforward. The depth is updated iteratively using equation 28. However, special care is needed to avoid division by zero [50]. 5 Experimental Images It is very dicult to choose good test images for SFS algorithms. A good test image must match the assumptions of the algorithms, e.g. Lambertian reectance model, constant albedo value, and innite point source illumination. It is not dicult to satisfy these assumptions for synthetic images, but no real scene perfectly satises these conditions. In real images there will be errors to the extent that these assumptions are not matched. In this section, we describe the images chosen to test the SFS algorithms. 5.1 Synthetic Images The synthetic images were generated using true depth maps, or range data obtained from a laser range nder. We simply computed the surface gradient ) using the forward discrete approximation of the depth, Z, and generated shaded images using the Lambertian reectance model. There are at least two advantages of using synthetic images. First, we can generate shaded images with dierent light source directions for the same surface. Second, with the true depth information, we can compute the error and compare the performance. However, the disadvantage of using synthetic images is that performance on synthetic images can not be used reliably to predict performance on real images. In our study we used ve synthetic surfaces (Synthetic Vase, Sphere, Mozart, Penny and Sombrero), with three dierent light sources ((0 0 1), (1 0 1), (5 5 7)). These images
20 20 (a) (b) Figure 4: Depth maps for the Synthetic Images: (a) Synthetic Vase. (b) Mozart. are available through our web site. In this paper, we focus our discussion on Synthetic Vase and Mozart with two light sources (0 0 1) and (1 0 1). See [53] for detailed discussion on all ve images. The Synthetic Vase was generated using the formula provided by Ascher and Carter [1] as follow: Z(x y) = q f(y) 2 ; x 2 where f(y) = 0:15 ; 0:1 y (6y +1) 2 (y ; 1) 2 (3y ; 2) ;0:5 x 0:5 and 0:0 y 1:0 : This yields a maximum depth value of approximately In order to generate a depth map with the proper size and scale, we map the x and y ranges to [0, 127], and scale Z by a factor of 128. The depth map for Mozart was provided by Professor Kuo of USC. The depth maps for both synthetic surfaces are shown in Figure 4. The synthetic images generated from these surfaces, with light source directions (0 0 1), and (1 0 1) are shown in Figure 5. The images are either reduced or expanded to make their sizes power of 2.
21 21 (a) (b) (c) (d) Figure 5: Synthetic images generated using two dierent light sources: (a) Synthetic Vase (0 0 1). (b) Mozart (0 0 1). (c) Synthetic Vase (1 0 1). (d) Mozart (1 0 1). (a) (b) (c)) Figure 6: Real images: (a) Lenna. (b) Pepper. (c) Vase. 5.2 Real Images In our study we used ve real images (Vase, Lenna, Pepper, Mannequin and Vase). These images are available through our web site. In this paper, we focus our discussion on Lenna, Pepper and Vase, shown in Figure 6. See [53] for detailed discussion on all ve images. The light source direction for Vase was estimated by the Lee and Rosenfeld method, and the rest were provided with the images: Lenna: Light source direction is (1:5 0:866 1). Pepper: Light source direction is (0:766 0:642 1). Vase: Light source direction is (;0:939 1:867 1:0).
22 22 The Vase image was provided by Professor Woodham of UBC. The other two images were provided by Professor Kuo of USC. Note that the light source direction is one possible source of uncertainty in these algorithms. 6 Experimental Results It is dicult to chose the proper control parameters for each algorithm because they may vary from image to image. It is even harder to chose similar parameters for dierent algorithms. We manually selected parameters for each algorithm in order to obtain the best results. This causes the variation of parameters for dierent algorithms and dierent images. All algorithms were treated equally when tuning the parameters. With respect to the initial conditions for dierent algorithms, we tried to provide the same initial condition for all the algorithms in order to obtain a fair comparison. Therefore, only singular points (points with the highest gray level) were used in the initialization of Bichsel and Pentland's algorithm, since singular points can be automatically detected. The depth maps computed from both synthetic and real images using each algorithm are shown in Figures 7 through 18. Below we analyze the results using the 3-D plots of depth maps. 6.1 Zheng and Chellappa The results for synthetic images are shown in Figure 7, and the results for real images are shown in Figure 13. The basic shapes for Lenna and Pepper are recovered with enough details, however, some errors can be seen around the mouth and on the cheeks in Lenna. Their method also has a problem with light source (0 0 1), which will zero out most of the terms in the iterative equations ([54] page 688). In order to get reasonable results for the images with light source (0 0 1), we used (0:01 0:01 1) instead as the light source direction. Their results also showed some error along the light source direction.
23 Lee and Kuo The results for synthetic images are shown in Figure 8, and the results for real images are shown in Figure 14. It can be seen that the algorithm performs ok, even when the light source is from the side, except in the cases of Vase which create the self-shadows. The recovered surfaces are well outlined, but lack details and have a tendency to be oversmoothed. Although dierent smoothing factors can be used for dierent images in order to get the best results, small changes in the smoothing factor will not aect the results very much. 6.3 Bichsel and Pentland The results for synthetic images are shown in Figure 9, and the results for real images are shown in Figure 15. All results were obtained after 8 iterations. The algorithm provides the best results for the cases when the light source is on the side. However, the algorithm does not give good results for real images considered in this study except for Pepper. This, we think, is due to the inaccuracy of the initial singular points. The algorithm is very fast usually only 5 iterations are required to provide reasonable results. 6.4 Lee and Rosenfeld The results for synthetic images are shown in Figure 10, and the results for real images are shown in Figure 16. Their method estimates the depth using local spherical assumption and intensity derivatives. This makes the algorithm unsuitable for non-spherical surfaces, and very sensitive to noise, which is clear from the depth maps obtained for the real images and some synthetic images, such as Mozart. The intensity of the real images varies slightly, causing the depth estimation to falter, while the synthetic images yield good depth maps, due to the smooth surfaces. However, the results can be improved by pre-ltering the input images in order to reduce the noise. 6.5 Pentland The results for synthetic images are shown in Figure 11, and the results for real images are shown in Figure 17. Pentland's algorithm produces reasonable results on most surfaces
24 24 (a1) (b1) (a2) (b2) Figure 7: Results for Zheng and Chellappa's method on synthetic images: (a) Vase. (b) Mozart. (a1) and (b1) show the results for test images with light source (0 0 1). (a2) and (b2) show the results for test images with light source (1 0 1). where the reectance changes linearly with respect to the surface shape, even if the surface is naturally varying such as a person's face. However, this algorithm has diculty when the reectance changes in a non-linear manner. For real images, the algorithm produces adequate results for all images except for Vase. 6.6 Tsai and Shah The results for synthetic images are shown in Figure 12, and the results for real images are shown in Figure 18. From Figure 12, we can clearly see that their method works well on smooth objects with the light source close to the viewing direction. However, it is sensitive to the noise, such as the black hole on the nose of Mozart image or the shadow areas. The results for real images are reasonable for Vase, but noisy for Lenna and Pepper, especially in the top and bottom regions of Pepper, and the nose, eyes, and hat regions of Lenna. 7 Error Analysis In the previous section, we reported results for synthetic and real images, and qualitatively analyzed the results by considering the 3-D plots of the depth map. In this section, we will quantitatively analyze the results for the synthetic images for which the true depth maps are available to be used in error measurements. There are several ways to report the error
25 25 (a21 (b1) (a2) (b2) Figure 8: Results for Lee and Kuo's method on synthetic images: (a) Vase. (b) Mozart. (a1) and (b1) show the results for test images with light source (0 0 1). (a2) and (b2) show the results for test images with light source (1 0 1). (a1) (b1) (a2) (b2) Figure 9: Results for Bichsel and Pentland's method on synthetic images: (b) Vase. (c) Mozart. (a1) and (b1) show the results for test images with light source (0 0 1). (a2) and (b2) show the results for test images with light source (1 0 1). (a1) (b1) (a2) (b2) Figure 10: Results for Lee and Rosenfeld's method on synthetic images: (a) Vase. (b) Mozart. (a1) and (b1) show the results for test images with light source (0 0 1). (a2) and (b2) show the results for test images with light source (1 0 1).
26 26 (a1) (b1) (a2) (b2) Figure 11: Results for Pentland's method on synthetic images: (b) Vase. (c) Mozart. (a1) and (b1) show the results for test images with light source (0 0 1). (a2) and (b2) show the results for test images with light source (1 0 1). (a1) (b1) (a2) (b2) Figure 12: Results for Tsai and Shah's method on synthetic images: (a) Vase. (b) Mozart. (a1) and (b1) show the results for test images with light source (0 0 1). (a2) and (b2) show the results for test images with light source (1 0 1). (a) (b) (c) Figure 13: Results for Zheng and Chellappa's method on real images: (a) Lenna. (b) Pepper. (c) Vase.
27 27 (a) (b) (c) Figure 14: Results for Lee and Kuo's method on real images: (a) Lenna. (b) Pepper. (c) Vase. (a) (b) (c) Figure 15: Results for Bichsel and Pentland's method on real images: (a) Lenna. (b) Pepper. (c) Vase. (a) (b) (c) Figure 16: Results for Lee and Rosenfeld's method on real images: (a) Lenna. (b) Pepper. (c) Vase.
28 28 (a) (b) (c) Figure 17: Results for Pentland's method on real images: (a) Lenna. (b) Pepper. (c) Vase. (a) (b) (c) Figure 18: Results for Tsai and Shah's method on real images: (a) Lenna. (b) Pepper. (c) Vase.
29 29 behavior. We use the following two: Mean and standard deviation of depth error (Tables 1 and 2). For each algorithm, we compared the recovered depth with the true depth from the range image. The output depth from each algorithm was rst normalized according to the true depth, then compared with the true depth for mean and standard deviation of depth error. Mean gradient error (Table 3). This indicates the error in the surface orientation. The standard deviation is not used here, since it does not have any physical meaning. The forward discrete approximation was used to compute the gradient from the recovered depth. For those algorithms which compute the surface gradient together with the depth, we still use the discrete approximation of the depth to calculate the surface gradient in the gradient error table, in order to be consistent with the other algorithms. There are several ways to analyze the error tables 1{3. One way to is to consider for each algorithm the sum of errors for four images, and rank algorithms based on this measure. According to the sum of average error the ranking of algorithms (table 4) is as follows: Lee and Kuo, Zheng and Chellappa, Bichsel and Pentland, Pentland, Lee and Rosenfeld, and Tsai and Shah. We have also tried to rank these algorithms based on sum of standard deviation of depth errors, sum of gradient (p-q) errors, and nally based on their performance on each image, the ranking is pretty much similar to the above ranking. 8 Timing CPU timing (Table 5) is computed on a SUN SPARC 4.Disk I/O time is not included, and only computational time is considered. All the synthetic images are 128 by 128. All the real images are 256 by 256. From the timing table (Table 5), we see that the minimization approaches are signicantly slower than other approaches. For the minimization approaches, time depends not only on the size of the input image, but also varies from scene to scene. For the other approaches, time depends only on the size of the input image. The order of the algorithms according to CPU time, from the slowest to the fastest, is Lee & Kuo's algorithm
30 30 Table 1: Average Z error for synthetic images. Images Methods Vase Mozart S1 S2 S1 S2 Zheng & Chellappa Lee & Kuo Bichsel & Pentland Lee &Rosenfeld Pentland Tsai & Shah then followed by Zheng & Chellappa's, Bichsel & Pentland's, Pentland's, Lee & Rosenfeld's, and Tsai & Shah. 9 Conclusions and Future Research SFS techniques recover the 3-D description of an object from a single view of the object. In this paper, we discussed several existing algorithms and grouped them into four dierent categories: Minimization techniques, propagation techniques, local techniques, and linear techniques. These groupings are based on the conceptual dierences among the algorithms. Six representative algorithms were implemented in order to compare their performance in terms of accuracy and time. This comparison was carried out on two synthetic surfaces, each was used to generate two synthetic images using dierent light source directions. To analyze the accuracy, the output for the synthetic images were compared with the true surface shapes and the results of comparison were shown in the forms of the average depth error, the average gradient error and the standard deviation of depth error. The output for real images was only analyzed and compared visually. The conclusion drawn from the analysis are as follows: 1. all the SFS algorithms produce generally poor results when given synthetic data, 2. results are even worse on real images, and
31 31 Table 2: Standard deviation of Z error for synthetic images. Images Methods Vase Mozart S1 S2 S1 S2 Zheng&Chellappa Lee &Kuo Bichsel & Pentland Lee &Rosenfeld Pentland Tsai & Shah S1, and S2 stand for two dierent light sources, (0 0 1), and (1 0 1). Table 3: Average p-q error for synthetic images. Images Methods Vase Mozart S1 S2 S1 S2 Zheng & Chellappa Lee &Kuo Bichsel & Pentland Lee &Rosenfeld Pentland Tsai & Shah
32 32 Table 4: Ranking based on sum of average depth errors (using Table 1) for synthetic images. Methods Sum of error ranking Zheng & Chellappa Lee & Kuo Bichsel & Pentland Lee &Rosenfeld Pentland Tsai & Shah Table 5: CPU time (in seconds) for synthetic images. Images Methods Vase Mozart S1 S2 S1 S2 Zheng & Chellappa Lee & Kuo Bichsel & Pentland Lee & Rosenfeld Pentland Tsai & Shah S1 and S2 stand for two dierent light sources, (0 0 1) and (1 0 1).
33 33 3. results on synthetic data are not generally predictive of results on real data. There are several possible directions for future research. As we noted, reectance models used in SFS methods are too simplistic recently, more sophisticated models have been proposed ([20, 24, 7, 35]). This not only includes more accurate models for Lambertian, specular, and hybrid reectance, but also includes replacing the assumption of orthographic projection with perspective projection, which is a more realistic model of cameras in the real world. The traditional simplication of lighting conditions, assuming an innite point light source, can also be eliminated by either assuming a non-innite point light source, or simulating lighting conditions using a set of point sources. This trend will continue. SFS methods employing more sophisticated models will be developed to provide more accurate, and realistic, results. Another direction is the combination of shading with some other cues. One can use the results of stereo or range data to improve the results of SFS (such as [25] and [41]), or use the results of SFS or range data to improve theresults of stereo. A dierent approach is to directly combine results from shading and stereo (such as [8]). Multiple images can also be employed by moving either the viewer (as in [16]) or the light source (as in [52]) in order to successively rene the shape. The successive renement can improve the quality of estimates by combining estimates between image frames, and reduce the computation time since the estimates from the previous frame can be used as the initial values for the next frame, which may be closer to the correct solution. By using successive renement, the process can be easily started at any frame, stopped at any frame, and restarted if new frames become available. The advantage of moving the light source over moving the viewer is the elimination of the mapping of the depth map (warping) between image frames. One problem with SFS is that the shape information in the shadow areas is not recovered, since shadow areas do not provide enough intensity information. This can be solved if we make use of the information available from shape-from-shadow (shape-from-darkness) [12, 13, 14, 22, 43, 44, 47] and combine it with the results from SFS. The depth values on the shadow boundaries from SFS can be used either as the initial values for shape-from-shadow, or as constraints for the shape-from-shadow algorithm. In the case of multiple image frames, the information recovered from shadow in the previous frame can also be used for SFS in
34 34 the next frame. Acknowledgments Thanks to Dr. Bichsel, Prof. Dupuis, Dr. Lee, Prof. Healey, Prof. Kuo, and Dr. Zheng for their helpful discussions.
35 35 References [1] U. M. Ascher and P. M. Carter. In SIAM J. Numer. Anal., Vol. 30, No. 1, pages 102{115, Feb., [2] H. G. Barrow and J. M. Tenenbaum. Retrospective on \Interpreting line drawings as three-dimensional surfaces". Articial Intelligence, Vol. 59, pages 71{80, [3] M. Bichsel and A. P. Pentland. A simple algorithm for shape from shading. In IEEE Proceedings of Computer Vision and Pattern Recognition, pages 459{465, [4] A. Blake and A. Zisserman. Surface descriptions from stereo and shading. Image and Vision Computing, 3(2):183{191, [5] M. J. Brooks and B. K. P. Horn. Shape and source from shading. In Proceedings of International Joint Conference on Articial Intelligence, pages932{936, [6] A. R. Bruss. The Eikonal equation: Some results applicable to computer vision. Journal of Mathematical Physics, 23(5):890{896, [7] J. J. Clark. Active photometric stereo. In IEEE Proceedings of Computer Vision and Pattern Recognition, pages 29{34, [8] J. Cryer, P. Tsai, and M. Shah. Integration of shape from x modules: Combining stereo and shading. In IEEE Proceedings of Computer Vision and Pattern Recognition, pages 720{721, June [9] P. Dupuis and J. Oliensis. Direct method for reconstructing shape from shading. In IEEE Proceedings of Computer Vision and Pattern Recognition, pages453{458, [10] D. Forsyth and A. Zisserman. Mutual illumination. In IEEE Proceedings of Computer Vision and Pattern Recognition, pages 466{473, [11] R. T. Frankot and R. Chellappa. A method for enforcing integrability in shape from shading algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 10:439{451, 1988.
36 36 [12] L. N. Hambrick and M. H. Loew. Entry-Exit Method of Shadow Boundary Segmentation. IEEE Proceedings of Computer Vision and Pattern Recognition, 656{658, [13] M. Hatzitheodorou and J. R. Kender. An Optimal Algorithm for the Derivation of Shape from Shadows. IEEE Proceedings of Computer Vision and Pattern Recognition, 486{491, [14] M. Hatzitheodorou. The Derivation of 3-D Surface Shape from Shadows. Image Understanding Workshop, 1012{1020, [15] G. Healey and T. O. Binford. Local shape from specularity. Computer Vision, Graphics, and Image Processing, 42:62{86, [16] J. Heel. Dynamic motion vision. Robotics and Autonomous Systems, 6(1), [17] B. K. P. Horn. Shape from Shading: A Method for Obtaining the Shape of a Smooth Opaque Object from One View. PhD thesis, MIT, [18] B. K. P. Horn. Height and gradient from shading. International Journal of Computer Vision, pages 37{75, [19] B. K. P. Horn, R. S. Szeliski, and A. L. Yuille. Impossible shaded images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(2):166{170, [20] D. R. Hougen and N. Ahuja. Estimation of the light source distribution and its use in integrated shape recovery from stereo and shading. In Proceedings of International Conference on Computer Vision, pages 29{34, [21] K. Ikeuchi and B.K.P. Horn. Numerical shape from shading and occluding boundaries. Articial Intelligence, 17(1-3):141{184, [22] C. Jiang and M. O. Ward. Shadow Identication. IEEE Proceedings of Computer Vision and Pattern Recognition, 606{612, [23] R. Kimmel and A. M. Bruckstein. Shape from shading via Level Sets. Israel Institute of Technology, CIS Report 9209, 1992.
37 37 [24] M. S. Langer and S. W. Zucker. Diuse shading, visibility elds, and the geometry of ambient light. In Proceedings of International Conference on Computer Vision, pages 29{34, [25] Y. G. Leclerc and A. F. Bobick. The direct computation of height from shading. In IEEE Proceedings of Computer Vision and Pattern Recognition, pages552{558, [26] C.H. Lee and A. Rosenfeld. Improved methods of estimating shape from shading using the light source coordinate system. Articial Intelligence, 26:125{143, [27] D. Lee. Aprovably convergent algorithm for shape from shading. In IEEE Proceedings of Computer Vision and Pattern Recognition, pages 478{485, [28] K. M. Lee and C. C. J. Kuo. Shape from shading with a linear triangular element surface model. IEEE Transaction on Pattern Analysis and Machine Intelligence, 15(8):815{822, [29] J. Malik and D. Maydan. Recovering three dimensional shape from a single image of curved objects. IEEE Transactions on Pattern Analysis and Machine Intelligence, 11, [30] E. Mingolla and J. T. Todd. Perception of solid shape from shading. Biological Cybernetics, Vol. 53, pages , [31] S. K. Nayar, K. Ikeuchi and T. Kanade. Shape from interreections. In Proceedings of International Conference on Computer Vision, pages 1{11, [32] S. K. Nayar, K. Ikeuchi and T. Kanade. Surface reection: physical and geometrical perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(7):611{634, [33] J. Oliensis. Shape from shading as a partially well-constrained problem. Computer Vision, Graphics, and Image Processing: Image Understanding, 54:163{183, [34] J. Oliensis and P. Dupuis. A global algorithm for Shape from shading. In Proceedings of International Conference on Computer Vision, pages , 1993.
38 38 [35] M. Oren and S. K. Nayar. Diuse reectance from rough surfaces. In IEEE Proceedings of Computer Vision and Pattern Recognition, pages 763{764, [36] A. P. Pentland. Finding the illuminant direction. J. Optical Society of America, pages 448{455, [37] A. P. Pentland. Local shading analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6:170{187, [38] A. Pentland. Shape information from shading: a theory about human perception. In Proceedings of International Conference on Computer Vision, pages 404{413, [39] A. Pentland. Photometric motion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13:879{890, [40] B. T. Phong. Illumination for computer generated pictures. Comm. of ACM, 18:311{ 317, [41] H. Pien and J. Gauch. A variational approach to sensor fusion using registered range and intensity data. In SPIE OE/Aerospace Sensing, April [42] W. H. Press, B. P. Flannery, S. A. Teukolsky, and W. T. Vetterling. Numerical Recipes in C. Cambridge University Press, [43] D. Raviv and Y. H. Pao and K. A. Loparo. Segmentation Between Overlapping Parts: The Moving Shadows Approach. IEEE Transactions on Systems, Man, and Cybernetics, 19(4):880{883, [44] D. Raviv and Y. H. Pao and K. A. Loparo. Reconstruction of Three-Dimensional Surfaces from Two-Dimensional Binary Images. IEEE Transactions on Robotics and Automation, 5(5):701{710, [45] V. S. Ramachandran Perceiving shape from shading. Scientic American, 159:76{83, [46] E. Rouy and A. Tourin. A Viscosity solutions approach to shape from shading. SIAM Journal on Numerical Analysis, 29(3):867{884, 1992.
39 39 [47] S. A. Shafer and T. Kanade. Using Shadows in Finding Surface Orientations. Computer Vision, Graphics, and Image Processing, 22:145{176, [48] R. Szeliski. Fast shape from shading. Computer Vision, Graphics, Image Processing: Image Understanding, 53:129{153, [49] K. E. Torrance and E. M. Sparrow. Theory for o-specular reection from roughened surfaces. J. Of Optical Society Of America, 57:1105{1114, [50] P.S. Tsai and M. Shah. Shape from shading using linear approximation. Image and Vision Computing Journal, 12(8):487{498, [51] O. E. Vega and Y. H. Yang. Shading logic: a heuristic approach to recover shape from shading. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(6):592{ 597, [52] Zhang, R., Tsai, P., and Shah, M. Photomotion. Computer Vision and Image Understanding, 63(2), March [53] Zhang, R., Tsai, P., Cryer, J., and Shah, M. A survey of shape from shading methods. Technical Report CS-TR-97-15, Computer Science Department University of Central Florida, [54] Q. Zheng and R. Chellappa. Estimation of illuminant direction, albedo, and shape from shading. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(7):680{ 702, 1991.
40 40 Ruo Zhang received her Ph.D. in 1995 from University of Central Florida. Her research interests are shape from shading, shape from photomotion, software engineering, and computer networks. Currently she is working in the industry. Ping-Sing Tsai was born in Taipei, Taiwan, R.O.C.. He received the B.S. degree in information and computer engineering from Chung-Yuan Christian University in 1985, and his Ph.D. in computer science at the University of Central Florida in Dr. Tsai is currently working at Intel Corporation, Digital Imaging and Video Division, as a senior system engineer. James Cryer received his B.S and M.S. degrees from University of Central Florida. He participated in Research Experience for Undergraduates (REU) during His research interests are computer vision, image processing, and parallell processing. Mubarak Shah received his BE degree in 1979 in electronics from Dawood College of Engineering and Technology, Karachi, Pakistan. He spent 1980 at Philips International Institute of Technology, Eindhoven, The Netherlands, where he completed the EDE diploma. Dr. Shah received his MS and Ph.D. degrees both in computer engineering from Wayne State University, Detroit, Michigan, in 1982 and 1986, respectively. Since 1986, he has been with the University of Central Florida, where he is currently professor of computer science and the director of the Computer Vision Lab. Prof. Shah spent year at the Information Systems Division of Harris Corporation, working with their Video Research and Development group as a technical consultant. Dr. Shah has received the Teaching Incentive Program Award, IEEE Distinguished Visitors Program Speaker Award, IEEE Outstanding Engineering Educator Award, and two TOKTEN awards (in 1995 and 1997) by UNDP. Prof. Shah has published one book with Ramesh Jain (Motion-Based Recognition, Kluwer Academic Publishers) and over 70 research papers in refereed journals and conferences on topics including optical ow, structure from motion, motion-based recognition, gesture recognition, activity recognition, lipreading, snakes, object recognition, edge and contour detection, multisensor fusion, shape from shading and stereo, and hardware algorithms for computer vision. Prof. Shah is an associate editor of IEEE Transactions on PAMI and Pattern Recognition journal. He has served on the program committees, and chaired
41 41 sessions of several conferences. He was an organizer of "Workshop on Recent Advances in Computer Vision," held in Karachi, Pakistan, Jan 1-2, 1998, and "Workshop on Computer Vision" held in Islamabad, Pakistan, January 3{5, Both workshops were supported by grants from NSF. Prof. Shah has served as a project director for the national site for REU, Research Experience for Undergraduates in Computer Vision, funded by the National Science Foundation since He has received over 2 millions in grants from NSF, DARPA, Army and the State of Florida.
Feature Tracking and Optical Flow
02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve
A Short Introduction to Computer Graphics
A Short Introduction to Computer Graphics Frédo Durand MIT Laboratory for Computer Science 1 Introduction Chapter I: Basics Although computer graphics is a vast field that encompasses almost any graphical
Subspace Analysis and Optimization for AAM Based Face Alignment
Subspace Analysis and Optimization for AAM Based Face Alignment Ming Zhao Chun Chen College of Computer Science Zhejiang University Hangzhou, 310027, P.R.China [email protected] Stan Z. Li Microsoft
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
Tracking Moving Objects In Video Sequences Yiwei Wang, Robert E. Van Dyck, and John F. Doherty Department of Electrical Engineering The Pennsylvania State University University Park, PA16802 Abstract{Object
An Iterative Image Registration Technique with an Application to Stereo Vision
An Iterative Image Registration Technique with an Application to Stereo Vision Bruce D. Lucas Takeo Kanade Computer Science Department Carnegie-Mellon University Pittsburgh, Pennsylvania 15213 Abstract
MA 323 Geometric Modelling Course Notes: Day 02 Model Construction Problem
MA 323 Geometric Modelling Course Notes: Day 02 Model Construction Problem David L. Finn November 30th, 2004 In the next few days, we will introduce some of the basic problems in geometric modelling, and
Reflection and Refraction
Equipment Reflection and Refraction Acrylic block set, plane-concave-convex universal mirror, cork board, cork board stand, pins, flashlight, protractor, ruler, mirror worksheet, rectangular block worksheet,
Common Core Unit Summary Grades 6 to 8
Common Core Unit Summary Grades 6 to 8 Grade 8: Unit 1: Congruence and Similarity- 8G1-8G5 rotations reflections and translations,( RRT=congruence) understand congruence of 2 d figures after RRT Dilations
Solving Simultaneous Equations and Matrices
Solving Simultaneous Equations and Matrices The following represents a systematic investigation for the steps used to solve two simultaneous linear equations in two unknowns. The motivation for considering
INTRODUCTION TO RENDERING TECHNIQUES
INTRODUCTION TO RENDERING TECHNIQUES 22 Mar. 212 Yanir Kleiman What is 3D Graphics? Why 3D? Draw one frame at a time Model only once X 24 frames per second Color / texture only once 15, frames for a feature
Monash University Clayton s School of Information Technology CSE3313 Computer Graphics Sample Exam Questions 2007
Monash University Clayton s School of Information Technology CSE3313 Computer Graphics Questions 2007 INSTRUCTIONS: Answer all questions. Spend approximately 1 minute per mark. Question 1 30 Marks Total
Mean-Shift Tracking with Random Sampling
1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of
Automatic Labeling of Lane Markings for Autonomous Vehicles
Automatic Labeling of Lane Markings for Autonomous Vehicles Jeffrey Kiske Stanford University 450 Serra Mall, Stanford, CA 94305 [email protected] 1. Introduction As autonomous vehicles become more popular,
1996 IFAC World Congress San Francisco, July 1996
1996 IFAC World Congress San Francisco, July 1996 TRAJECTORY GENERATION FOR A TOWED CABLE SYSTEM USING DIFFERENTIAL FLATNESS Richard M. Murray Mechanical Engineering, California Institute of Technology
Static Environment Recognition Using Omni-camera from a Moving Vehicle
Static Environment Recognition Using Omni-camera from a Moving Vehicle Teruko Yata, Chuck Thorpe Frank Dellaert The Robotics Institute Carnegie Mellon University Pittsburgh, PA 15213 USA College of Computing
Practice Final Math 122 Spring 12 Instructor: Jeff Lang
Practice Final Math Spring Instructor: Jeff Lang. Find the limit of the sequence a n = ln (n 5) ln (3n + 8). A) ln ( ) 3 B) ln C) ln ( ) 3 D) does not exist. Find the limit of the sequence a n = (ln n)6
DYNAMIC RANGE IMPROVEMENT THROUGH MULTIPLE EXPOSURES. Mark A. Robertson, Sean Borman, and Robert L. Stevenson
c 1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or
Introduction to Logistic Regression
OpenStax-CNX module: m42090 1 Introduction to Logistic Regression Dan Calderon This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 Abstract Gives introduction
11.1. Objectives. Component Form of a Vector. Component Form of a Vector. Component Form of a Vector. Vectors and the Geometry of Space
11 Vectors and the Geometry of Space 11.1 Vectors in the Plane Copyright Cengage Learning. All rights reserved. Copyright Cengage Learning. All rights reserved. 2 Objectives! Write the component form of
Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard
Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express
Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition
Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition 1. Image Pre-Processing - Pixel Brightness Transformation - Geometric Transformation - Image Denoising 1 1. Image Pre-Processing
Face Model Fitting on Low Resolution Images
Face Model Fitting on Low Resolution Images Xiaoming Liu Peter H. Tu Frederick W. Wheeler Visualization and Computer Vision Lab General Electric Global Research Center Niskayuna, NY, 1239, USA {liux,tu,wheeler}@research.ge.com
Highlight Removal by Illumination-Constrained Inpainting
Highlight Removal by Illumination-Constrained Inpainting Ping Tan Stephen Lin Long Quan Heung-Yeung Shum Microsoft Research, Asia Hong Kong University of Science and Technology Abstract We present a single-image
Texture. Chapter 7. 7.1 Introduction
Chapter 7 Texture 7.1 Introduction Texture plays an important role in many machine vision tasks such as surface inspection, scene classification, and surface orientation and shape determination. For example,
Geometry of Vectors. 1 Cartesian Coordinates. Carlo Tomasi
Geometry of Vectors Carlo Tomasi This note explores the geometric meaning of norm, inner product, orthogonality, and projection for vectors. For vectors in three-dimensional space, we also examine the
Three-Dimensional Data Recovery Using Image-Based Modeling
Three-Dimensional Data Recovery Using Image-Based Modeling Jeremy W. Cannon Jonathan C. Derryberry Vitaly Y. Kulikov [email protected] [email protected] [email protected] Abstract 6.837: Introduction to Computer
Epipolar Geometry. Readings: See Sections 10.1 and 15.6 of Forsyth and Ponce. Right Image. Left Image. e(p ) Epipolar Lines. e(q ) q R.
Epipolar Geometry We consider two perspective images of a scene as taken from a stereo pair of cameras (or equivalently, assume the scene is rigid and imaged with a single camera from two different locations).
2.2 Creaseness operator
2.2. Creaseness operator 31 2.2 Creaseness operator Antonio López, a member of our group, has studied for his PhD dissertation the differential operators described in this section [72]. He has compared
Accurate and robust image superresolution by neural processing of local image representations
Accurate and robust image superresolution by neural processing of local image representations Carlos Miravet 1,2 and Francisco B. Rodríguez 1 1 Grupo de Neurocomputación Biológica (GNB), Escuela Politécnica
Daniel F. DeMenthon and Larry S. Davis. Center for Automation Research. University of Maryland
Model-Based Object Pose in 25 Lines of Code Daniel F. DeMenthon and Larry S. Davis Computer Vision Laboratory Center for Automation Research University of Maryland College Park, MD 20742 Abstract In this
Path Tracking for a Miniature Robot
Path Tracking for a Miniature Robot By Martin Lundgren Excerpt from Master s thesis 003 Supervisor: Thomas Hellström Department of Computing Science Umeå University Sweden 1 Path Tracking Path tracking
Relating Vanishing Points to Catadioptric Camera Calibration
Relating Vanishing Points to Catadioptric Camera Calibration Wenting Duan* a, Hui Zhang b, Nigel M. Allinson a a Laboratory of Vision Engineering, University of Lincoln, Brayford Pool, Lincoln, U.K. LN6
Face detection is a process of localizing and extracting the face region from the
Chapter 4 FACE NORMALIZATION 4.1 INTRODUCTION Face detection is a process of localizing and extracting the face region from the background. The detected face varies in rotation, brightness, size, etc.
Basic Problem: Map a 3D object to a 2D display surface. Analogy - Taking a snapshot with a camera
3D Viewing Basic Problem: Map a 3D object to a 2D display surface Analogy - Taking a snapshot with a camera Synthetic camera virtual camera we can move to any location & orient in any way then create a
TESLA Report 2003-03
TESLA Report 23-3 A multigrid based 3D space-charge routine in the tracking code GPT Gisela Pöplau, Ursula van Rienen, Marieke de Loos and Bas van der Geer Institute of General Electrical Engineering,
Robot Perception Continued
Robot Perception Continued 1 Visual Perception Visual Odometry Reconstruction Recognition CS 685 11 Range Sensing strategies Active range sensors Ultrasound Laser range sensor Slides adopted from Siegwart
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
Section 1.1. Introduction to R n
The Calculus of Functions of Several Variables Section. Introduction to R n Calculus is the study of functional relationships and how related quantities change with each other. In your first exposure to
Advanced Computer Graphics. Rendering Equation. Matthias Teschner. Computer Science Department University of Freiburg
Advanced Computer Graphics Rendering Equation Matthias Teschner Computer Science Department University of Freiburg Outline rendering equation Monte Carlo integration sampling of random variables University
3. Interpolation. Closing the Gaps of Discretization... Beyond Polynomials
3. Interpolation Closing the Gaps of Discretization... Beyond Polynomials Closing the Gaps of Discretization... Beyond Polynomials, December 19, 2012 1 3.3. Polynomial Splines Idea of Polynomial Splines
Linear Threshold Units
Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear
We can display an object on a monitor screen in three different computer-model forms: Wireframe model Surface Model Solid model
CHAPTER 4 CURVES 4.1 Introduction In order to understand the significance of curves, we should look into the types of model representations that are used in geometric modeling. Curves play a very significant
So which is the best?
Manifold Learning Techniques: So which is the best? Todd Wittman Math 8600: Geometric Data Analysis Instructor: Gilad Lerman Spring 2005 Note: This presentation does not contain information on LTSA, which
Math 241, Exam 1 Information.
Math 241, Exam 1 Information. 9/24/12, LC 310, 11:15-12:05. Exam 1 will be based on: Sections 12.1-12.5, 14.1-14.3. The corresponding assigned homework problems (see http://www.math.sc.edu/ boylan/sccourses/241fa12/241.html)
Vision based Vehicle Tracking using a high angle camera
Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu [email protected] [email protected] Abstract A vehicle tracking and grouping algorithm is presented in this work
Adding vectors We can do arithmetic with vectors. We ll start with vector addition and related operations. Suppose you have two vectors
1 Chapter 13. VECTORS IN THREE DIMENSIONAL SPACE Let s begin with some names and notation for things: R is the set (collection) of real numbers. We write x R to mean that x is a real number. A real number
EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines
EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation
The Fourth International DERIVE-TI92/89 Conference Liverpool, U.K., 12-15 July 2000. Derive 5: The Easiest... Just Got Better!
The Fourth International DERIVE-TI9/89 Conference Liverpool, U.K., -5 July 000 Derive 5: The Easiest... Just Got Better! Michel Beaudin École de technologie supérieure 00, rue Notre-Dame Ouest Montréal
AP Physics 1 and 2 Lab Investigations
AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks
From 3D to 2D: Orthographic and Perspective Projection Part 1
From 3D to 2D: Orthographic and Perspective Projection Part 1 History Geometrical Constructions Types of Projection Projection in Computer Graphics Andries van Dam September 13, 2001 3D Viewing I 1/34
Manual for simulation of EB processing. Software ModeRTL
1 Manual for simulation of EB processing Software ModeRTL How to get results. Software ModeRTL. Software ModeRTL consists of five thematic modules and service blocks. (See Fig.1). Analytic module is intended
Colorado School of Mines Computer Vision Professor William Hoff
Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description
Segmentation of building models from dense 3D point-clouds
Segmentation of building models from dense 3D point-clouds Joachim Bauer, Konrad Karner, Konrad Schindler, Andreas Klaus, Christopher Zach VRVis Research Center for Virtual Reality and Visualization, Institute
6 Space Perception and Binocular Vision
Space Perception and Binocular Vision Space Perception and Binocular Vision space perception monocular cues to 3D space binocular vision and stereopsis combining depth cues monocular/pictorial cues cues
The Lindsey-Fox Algorithm for Factoring Polynomials
OpenStax-CNX module: m15573 1 The Lindsey-Fox Algorithm for Factoring Polynomials C. Sidney Burrus This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 2.0
Simultaneous Gamma Correction and Registration in the Frequency Domain
Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong [email protected] William Bishop [email protected] Department of Electrical and Computer Engineering University
Curriculum Map by Block Geometry Mapping for Math Block Testing 2007-2008. August 20 to August 24 Review concepts from previous grades.
Curriculum Map by Geometry Mapping for Math Testing 2007-2008 Pre- s 1 August 20 to August 24 Review concepts from previous grades. August 27 to September 28 (Assessment to be completed by September 28)
Chapter 111. Texas Essential Knowledge and Skills for Mathematics. Subchapter B. Middle School
Middle School 111.B. Chapter 111. Texas Essential Knowledge and Skills for Mathematics Subchapter B. Middle School Statutory Authority: The provisions of this Subchapter B issued under the Texas Education
Prentice Hall Algebra 2 2011 Correlated to: Colorado P-12 Academic Standards for High School Mathematics, Adopted 12/2009
Content Area: Mathematics Grade Level Expectations: High School Standard: Number Sense, Properties, and Operations Understand the structure and properties of our number system. At their most basic level
Part-Based Recognition
Part-Based Recognition Benedict Brown CS597D, Fall 2003 Princeton University CS 597D, Part-Based Recognition p. 1/32 Introduction Many objects are made up of parts It s presumably easier to identify simple
SO(3) Camillo J. Taylor and David J. Kriegman. Yale University. Technical Report No. 9405
Minimization on the Lie Group SO(3) and Related Manifolds amillo J. Taylor and David J. Kriegman Yale University Technical Report No. 945 April, 994 Minimization on the Lie Group SO(3) and Related Manifolds
Performance Level Descriptors Grade 6 Mathematics
Performance Level Descriptors Grade 6 Mathematics Multiplying and Dividing with Fractions 6.NS.1-2 Grade 6 Math : Sub-Claim A The student solves problems involving the Major Content for grade/course with
Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf. Flow Visualization. Image-Based Methods (integration-based)
Visualization and Feature Extraction, FLOW Spring School 2016 Prof. Dr. Tino Weinkauf Flow Visualization Image-Based Methods (integration-based) Spot Noise (Jarke van Wijk, Siggraph 1991) Flow Visualization:
Space Perception and Binocular Vision
Space Perception and Binocular Vision Space Perception Monocular Cues to Three-Dimensional Space Binocular Vision and Stereopsis Combining Depth Cues 9/30/2008 1 Introduction to Space Perception Realism:
Finite cloud method: a true meshless technique based on a xed reproducing kernel approximation
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING Int. J. Numer. Meth. Engng 2001; 50:2373 2410 Finite cloud method: a true meshless technique based on a xed reproducing kernel approximation N.
Optical Flow. Shenlong Wang CSC2541 Course Presentation Feb 2, 2016
Optical Flow Shenlong Wang CSC2541 Course Presentation Feb 2, 2016 Outline Introduction Variation Models Feature Matching Methods End-to-end Learning based Methods Discussion Optical Flow Goal: Pixel motion
Applications to Data Smoothing and Image Processing I
Applications to Data Smoothing and Image Processing I MA 348 Kurt Bryan Signals and Images Let t denote time and consider a signal a(t) on some time interval, say t. We ll assume that the signal a(t) is
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
3D Model based Object Class Detection in An Arbitrary View
3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/
Two vectors are equal if they have the same length and direction. They do not
Vectors define vectors Some physical quantities, such as temperature, length, and mass, can be specified by a single number called a scalar. Other physical quantities, such as force and velocity, must
Computer Graphics. Introduction. Computer graphics. What is computer graphics? Yung-Yu Chuang
Introduction Computer Graphics Instructor: Yung-Yu Chuang ( 莊 永 裕 ) E-mail: [email protected] Office: CSIE 527 Grading: a MatchMove project Computer Science ce & Information o Technolog og Yung-Yu Chuang
How To Analyze Ball Blur On A Ball Image
Single Image 3D Reconstruction of Ball Motion and Spin From Motion Blur An Experiment in Motion from Blur Giacomo Boracchi, Vincenzo Caglioti, Alessandro Giusti Objective From a single image, reconstruct:
Nonlinear Iterative Partial Least Squares Method
Numerical Methods for Determining Principal Component Analysis Abstract Factors Béchu, S., Richard-Plouet, M., Fernandez, V., Walton, J., and Fairley, N. (2016) Developments in numerical treatments for
Computer Graphics. Geometric Modeling. Page 1. Copyright Gotsman, Elber, Barequet, Karni, Sheffer Computer Science - Technion. An Example.
An Example 2 3 4 Outline Objective: Develop methods and algorithms to mathematically model shape of real world objects Categories: Wire-Frame Representation Object is represented as as a set of points
Edge tracking for motion segmentation and depth ordering
Edge tracking for motion segmentation and depth ordering P. Smith, T. Drummond and R. Cipolla Department of Engineering University of Cambridge Cambridge CB2 1PZ,UK {pas1001 twd20 cipolla}@eng.cam.ac.uk
Geometric Camera Parameters
Geometric Camera Parameters What assumptions have we made so far? -All equations we have derived for far are written in the camera reference frames. -These equations are valid only when: () all distances
Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary
Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals
The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,
Experiment #1, Analyze Data using Excel, Calculator and Graphs.
Physics 182 - Fall 2014 - Experiment #1 1 Experiment #1, Analyze Data using Excel, Calculator and Graphs. 1 Purpose (5 Points, Including Title. Points apply to your lab report.) Before we start measuring
Rendering Area Sources D.A. Forsyth
Rendering Area Sources D.A. Forsyth Point source model is unphysical Because imagine source surrounded by big sphere, radius R small sphere, radius r each point on each sphere gets exactly the same brightness!
Derive 5: The Easiest... Just Got Better!
Liverpool John Moores University, 1-15 July 000 Derive 5: The Easiest... Just Got Better! Michel Beaudin École de Technologie Supérieure, Canada Email; [email protected] 1. Introduction Engineering
PASSIVE DRIVER GAZE TRACKING WITH ACTIVE APPEARANCE MODELS
PASSIVE DRIVER GAZE TRACKING WITH ACTIVE APPEARANCE MODELS Takahiro Ishikawa Research Laboratories, DENSO CORPORATION Nisshin, Aichi, Japan Tel: +81 (561) 75-1616, Fax: +81 (561) 75-1193 Email: [email protected]
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
Thnkwell s Homeschool Precalculus Course Lesson Plan: 36 weeks
Thnkwell s Homeschool Precalculus Course Lesson Plan: 36 weeks Welcome to Thinkwell s Homeschool Precalculus! We re thrilled that you ve decided to make us part of your homeschool curriculum. This lesson
Metrics on SO(3) and Inverse Kinematics
Mathematical Foundations of Computer Graphics and Vision Metrics on SO(3) and Inverse Kinematics Luca Ballan Institute of Visual Computing Optimization on Manifolds Descent approach d is a ascent direction
Projection Center Calibration for a Co-located Projector Camera System
Projection Center Calibration for a Co-located Camera System Toshiyuki Amano Department of Computer and Communication Science Faculty of Systems Engineering, Wakayama University Sakaedani 930, Wakayama,
H.Calculating Normal Vectors
Appendix H H.Calculating Normal Vectors This appendix describes how to calculate normal vectors for surfaces. You need to define normals to use the OpenGL lighting facility, which is described in Chapter
NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS
NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document
1 Example of Time Series Analysis by SSA 1
1 Example of Time Series Analysis by SSA 1 Let us illustrate the 'Caterpillar'-SSA technique [1] by the example of time series analysis. Consider the time series FORT (monthly volumes of fortied wine sales
Measurement with Ratios
Grade 6 Mathematics, Quarter 2, Unit 2.1 Measurement with Ratios Overview Number of instructional days: 15 (1 day = 45 minutes) Content to be learned Use ratio reasoning to solve real-world and mathematical
VECTORAL IMAGING THE NEW DIRECTION IN AUTOMATED OPTICAL INSPECTION
VECTORAL IMAGING THE NEW DIRECTION IN AUTOMATED OPTICAL INSPECTION Mark J. Norris Vision Inspection Technology, LLC Haverhill, MA [email protected] ABSTRACT Traditional methods of identifying and
Wii Remote Calibration Using the Sensor Bar
Wii Remote Calibration Using the Sensor Bar Alparslan Yildiz Abdullah Akay Yusuf Sinan Akgul GIT Vision Lab - http://vision.gyte.edu.tr Gebze Institute of Technology Kocaeli, Turkey {yildiz, akay, akgul}@bilmuh.gyte.edu.tr
Computational Optical Imaging - Optique Numerique. -- Deconvolution --
Computational Optical Imaging - Optique Numerique -- Deconvolution -- Winter 2014 Ivo Ihrke Deconvolution Ivo Ihrke Outline Deconvolution Theory example 1D deconvolution Fourier method Algebraic method
DSP First Laboratory Exercise #9 Sampling and Zooming of Images In this lab we study the application of FIR ltering to the image zooming problem, where lowpass lters are used to do the interpolation needed
Video stabilization for high resolution images reconstruction
Advanced Project S9 Video stabilization for high resolution images reconstruction HIMMICH Youssef, KEROUANTON Thomas, PATIES Rémi, VILCHES José. Abstract Super-resolution reconstruction produces one or
3D Scanner using Line Laser. 1. Introduction. 2. Theory
. Introduction 3D Scanner using Line Laser Di Lu Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute The goal of 3D reconstruction is to recover the 3D properties of a geometric
