Analyzing Facial Expressions for Virtual Conferencing
|
|
|
- Sheryl Owens
- 10 years ago
- Views:
Transcription
1 Animating Virtual Humans Analzing Facial Epressions for Virtual Conferencing Video coding applications such as video conferencing, telepresence, and teleteaching have attracted considerable interest in recent ears. Transmitting image sequences tpicall imposes high demands on storage, processing power, and network bandwidth. Common networks like POTS (Plain Old Telephone Sstem), ISDN (Integrated Services Digital Network), and man computer networks provide onl low bit rates We present a model-based and cannot handle transmitting uncompressed video. Therefore, algorithm that estimates 3D encoding the sequences proves essential. Highl efficient coding motion and facial schemes like H.263 and MPEG-1 and 2 achieve compression factors epressions from 2D image of about 1:1 with acceptable qualit. sequences showing head Using model-based coding techniques further improves compression while simultaneousl providing and shoulder scenes tpical us with the possibilit of manipulating a scene s content interactivel. 1,2 of video telephone and Model-based coding aims to create teleconferencing a virtual 3D world with distinct 3D objects described b their shape and applications. teture. In this world we onl have to specif how the objects move and deform. The objects shape transmits onl once, and we can then describe the scene in all the following frames with few parameters specifing the objects position and deformation. We reconstruct the 2D image sequence b rendering the virtual world using computer graphics techniques. Peter Eisert and Bernd Girod Universit of Erlangen Model-based coding of head and shoulder scenes Model-based coding of image sequences requires 3D models for all objects in the scene. Therefore, we must put some restrictions on the scene content. In this article we focus on head and shoulder scenes for video telephone and conferencing applications. Let s see how this process works. A speaking person sits in front of the sstem while a single camera records video frames. The encoder analzes the incoming frames and estimates the person s 3D motion and facial epressions. We obtain a set of facial-epression parameters that describe (together with the 3D model) the person s current appearance. We onl need to encode and transmit a few parameters, leading to ver low data rates (tpicall less than 1 Kbps). At the decoder we use the parameters to deform our head model according to the person s facial epressions and approimate the original camera frame b rendering our 3D model. Figure 1 shows the model-based coder s principal structure. We can easil manipulate and interact with the scene content once segmented into single 3D objects. For eample, the ee contact problem that occurs in conventional video telephone sstems resolves if we position the virtual camera inside the monitor that displas the image of the person with whom we re talking. Or, we can look around objects if we move the snthetic camera according to the ee trajector the coder estimates. Virtual conferencing, 3,4 another application for model-based coding, etends the video telephone framework. Here s how it works. Several people are connected to a network talking to each other. A 3D model of each person specifies the person s individual shape, color, and motion characteristics. From the video frames, facial animation parameters (FAPs) are estimated and distributed to all participants. Given the head models and their corresponding motion parameters, we can put all people at a table in a virtual conference room and generate individual views for them. Here we present a method for estimating a speaking person s FAPs from two successive video frames. This algorithm serves as the basis for the applications mentioned above. First, we show the generic head model used to describe a person s appearance. This 3D model also restricts the possible motions in the face, simplifing the facial parameter estimation. We then propose our gradient-based linear motion estimator, which works hierarchicall with low computational compleit. Finall, we validate the algorithm with eperimental results for snthetic and real image sequences. 7 September/October /98/$ IEEE
2 Coder about 1 Kbps Decoder Video Analsis Estimation of motion and facial epressions Parameter coding Parameter decoding Snthesis Rendering of the head model Video Model Shape Teture Illumination Dnamics 1 Basic structure of a modelbased coder. 3D head model To estimate motion, we can use a simple virtual scene consisting of a head model and camera model. If we want to generate snthetic views of a virtual conference, we have to combine several head models with other 3D models of objects in the room during rendering. However, for estimating FAPs, modeling one person suffices. The head model in the virtual world specifies a speaking person s 3D shape and color, but also constrains the facial motion caused b epressions. Like other well-known facial models, 5,6 (a) our proposed model consists of a triangular mesh and teture is mapped onto the surface to obtain a photorealistic appearance. The use of secondorder triangular B-splines 7 for the definition of the surface s shape facilitates the modeling of facial epressions. Our face model is a generic 3D model consisting of 11 triangular B-spline patches for the face and neck area. It also contains 36 rigid patches on the back of the head. Teeth and the mouth s interior form separate 3D objects. The patches topolog (see Figure 2a) based on the Candide model 6 and the definition of the FAP units remains fied for all persons. To adjust the model to an individual person, we need onl change the teture and control points position using information from 3D laser scans. To simplif the rendering, we approimate the adjusted model s spline surface b a number of planar triangles, which results in a triangular mesh. Figure 2b shows an eample of an individuall adapted and subdivided mesh. Triangular B-spline-based surface modeling The set of vertices that define our model s shape has a ver large number of degrees of freedom. Therefore, (b) we can model comple objects with sharp edges and discontinuities. A person s face, however, has a smooth surface. Thus facial epressions result in smooth movements of surface points due to the anatomical properties of tissue and muscles. These restrictions on curvature and motion can be modeled b splines, which satisf specific continuit constraints. Hoch et al. 8 have shown that B-splines model facial skin well. This tpe of spline ehibits some interesting properties useful for implementing the head model: Smoothness: A B-spline of order n is C n 1 continuous. For our model we use second-order splines leading to C 1 continuit of the surface. Local control: Movement of a single control point influences the surface just in a local neighborhood, which simplifies modeling facial epressions. Affine invariance: An affine transformation of the surface results from appling the same transformation on the control points. Facial movements are now defined b the transformation of a small number of 2 (a) Topolog of the triangular B-patches. (b) Individuall adapted triangular mesh used for rendering and motion estimation. IEEE Computer Graphics and Applications 71
3 Animating Virtual Humans 3 Teture and depth information from a 3D laser scanner. 4 Control points of the shaped spline surface. control points instead of appling transformations on each verte, which reduces the computational compleit. Conventional B-splines suffer from one well-known drawback: since the re defined on a rectangular topolog, the don t refine curved areas locall. To overcome this restriction, we use triangular B- splines. 7 This spline scheme, based on triangular patches (shown in Figure 2a), can easil be refined while still preserving the above mentioned properties of normal B-splines. For rendering, we approimate the smooth spline and subdivide each patch into a number of planar triangles, leading to a mesh as illustrated in Figure 2b. We can var the number of triangles to get either a good approimation or higher frame rate during rendering. The resulting triangular mesh is defined b a discrete set of vertices on the mathematicall defined spline surface. We need to compute the B-spline basis functions 7 onl at the discrete verte positions on the surface (this can be done offline). Once we have determined the basis functions, we calculate the position of the vertices v j b v = N c j ji i i I j (1) with N ji the precalculated basis functions of verte j with N ji = 1 i I j (2) where I j is the inde set of verte j and c i the ith control point. The inde set I j usuall contains three to si indices for the control points that influence the verte s position. To individualize the generic model, we use a 3D laser scan of the person with a neutral epression. The scan provides us with information about color and depth as shown in Figure 3. We map the teture on the 3D model and optimize the control points position to adapt the spline surface to the measured data. After the optimization, shape and teture coincide with the person s appearance. The model can now be deformed to show other epressions rather than the neutral one of the scan. 5 Different snthesized epressions. Modeling facial epressions To estimate the FAPs requires animating our model to create different facial epressions. We simplified this task b using splines because the alread constrain the neighboring vertices motion. To parameterize the facial epressions, we adapted the MPEG-4 Snthetic-Natural Hbrid Coding (SNHC) group s proposal. 9 According to that scheme, ever facial epression can be generated b a superposition of 68 action units. These include both global motion (such as head rotation) and local motion (ee or mouth movement). Currentl, 46 of these 68 FAPs are implemented. To model the local movements, our generic face model contains a table describing how the mesh s con- 72 September/October 1998
4 trol points are translated or rotated for each FAP. This is done onl for the small number of control points shown in Figure 4. For a given set of FAPs that specif a person s epression, all positions of the control points are updated. Then the verte locations are computed according to Equation 1 b a linear combination of the control points. Figure 5 shows eamples of rendered epressions for different persons. Camera model The camera model describes the relation between our virtual 3D world and the 2D video frames. It s used for both rendering and parameter estimation. We use the perspective projection as shown in Figure 6, where the 3D coordinates of an object point [ z] T are projected into the image plane according to z Y X, Y X Object Image plane 6 Camera model and its associated coordinate sstems. X = X f Y = Y f z z (3) Here, f and f denote the focal length multiplied b scaling factors in the - and -directions, respectivel. These scaling factors transform the image coordinates into piel coordinates X and Y. In addition, the allow the use of non-square piel geometries. The two parameters X and Y describe the image center and its translation from the optical ais due to inaccurate placement of the charge-coupled device (CCD) sensor in the camera. To estimate the FAPs, we must model the real camera used for recording the video sequence in our snthetic world. The four parameters f, f, X, and Y are therefore obtained from an initial camera calibration using Tsai s algorithm. 1 At the decoder we can then use arbitrar camera parameter settings if we don t want to reconstruct the original image eactl. This lets us zoom into the scene if desired. Motion estimation and facial epression analsis Our motion estimation algorithm analzes a speaking person s facial epressions b estimating changes of FAPs. We use the whole face image for the estimation, in contrast to feature-based approaches that eploit the information of discrete feature point correspondences. Estimating 3D motion vectors from 2D images proves difficult. The errors of the estimates become ver large if we do not determine eactl the position of the small number of features. Therefore, we set up equations at each piel location of the image, leading to a large number of correspondences used for parameter estimation. To keep the computational compleit low, we developed a linear algorithm for the estimation that solves the large set of equations in a least-squares sense. The small errors that arise due to linearization are corrected using a feedback structure in a hierarchical framework. Feedback loop The motion estimation algorithm presented here esti- Camera image Snthetic image I(n) Î(n 1) Model Analsis Shape Teture Epressions Snthesis Facial parameters Model update mates the FAPs from two successive frames using the shape and teture information from the 3D model. An eas wa of doing this is to estimate the motion between two camera frames and move the 3D head model according to these estimates in the virtual world. However, small errors in the estimation result in placing the 3D model whose shape information proves necessar for computing the motion incorrectl. In the net frame, the facial parameters become even worse because the shape does not recover from the model eactl. To avoid error accumulation in the long-term parameter estimation, we use a feedback loop 11 as depicted in Figure 7. The head s model moves according to the parameters estimated from two successive frames. Rendering the model with a modified shape and position generates a snthetic image. The estimation is then performed between the actual camera frame I(n) and the snthetic image of the previous frame ^ I(n 1), which assures that no misalignment of the model occurs. An error in the estimated parameters again leads to placing the model incorrectl. However, the error can be removed during the estimation in the net frame because no mismatch eists between the 3D model and the sntheticall rendered image ^ I(n 1). 7 Feedback structure of the coder. IEEE Computer Graphics and Applications 73
5 Animating Virtual Humans 8 Transformation from FAPs to image points. FAPs 3D motion equation Estimating the 3D motion of fleible bodies from 2D images proves difficult. However, we can constrain the deformation of most objects to simplif the estimation process. In our algorithm we use a parameterized head model whose local deformations are a function of the 68 unknown FAPs. For each 3D object point of the surface, we can set up one linear 3D motion equation that determines how the point moves in the 3D space if the FAPs change. Instead of the si parameters specifing a rigid bod motion, we now have 68 unknowns. Due to the large number of equations for the parameter estimation, we still have an over-determined linear sstem of equations that we can solve easil. To set up the 3D motion equation, we must combine several transformations, shown in Figure 8, to take into account the dependencies between the FAPs and the model s 3D object points. First, the surface s control points are moved according to the given FAPs. Using the basis functions of the splines, the algorithm calculates the position of the vertices from the control points. Three vertices form a triangle, and the 3D motion of all object points inside this triangle specified b their barcentric coordinates is determined. Finall, the 2D image point is obtained b projecting the 3D point into the image plane. We incorporated these transformations all linear ecept for the projection into our parameter estimation algorithm. The new control point position c i can be determined from the position c i in the previous frame b where 3D model 9 Approimation of rotations. Control points Basis functions c = c + FAP d i i k ik k FAPk = FAP k FAPk Vertices Barcentric coordinates d ik α k r i 3D object points (4) (5) is the change of the facial animation parameter k between the two frames and d ik the 3D direction vector of the corresponding movement. Strictl speaking, Equation 4 is valid onl for translations. If a number of control points rotate around given O Camera model α k 2D object points aes b some action units, the description for the motion of control points becomes more complicated due to the combination of rotation (defined b rotation matrices) and translation. The order of these operations can no longer be echanged, and the use of matri multiplication results in a set of equations that is nonlinear in the parameters that must be estimated. However, we can also use the linear description from Equation 4 for rotation if we assume that the rotation angles between two successive frames are small. Then, the trajector of a control point i that rotates around the center O can be approimated b its tangent d ik as shown in Figure 9. This tangent differs for all object points, but we have to set up Equation 4 for all points individuall anhow because of local deformations in the surface. For a rotation b the angle α k, α k = FAP k s k (6) defined b the facial animation parameter changes FAP k and the corresponding scaling factor s k, the length of the translation vector d ik can be calculated b d ik = FAP k r i s k (7) Here, r i is the distance between the object point and the given rotation ais. With this assumption (the direction of d ik is specified b the direction of the tangent), Equation 4 can also be used for rotation, leading to a simple linear description for all FAPs. Additionall, we can estimate both global and local motion simultaneousl. The small error caused b the approimation is compensated after some iterations in the feedback structure shown in Figure 7. Having modeled the shift in control points, we can determine the motion of the triangular mesh s vertices using Equation 1. The local motion of an object point is calculated from that using 2 2 = λmv m = λm Nmj c m= (8) where λ m are the barcentric coordinates of the object point in the triangle that encloses that point. The motion equation for a surface point can be represented as = + FAP k t k = + T FAP k j J m= (9) where t k s are the new direction vectors to the corresponding FAP calculated from d k b appling the linear transforms of Equations 1 and 8. T combines all the vectors in a single matri and FAP is the vector of all FAP changes. The matri T can be derived from the 3D model. Equation 9 then describes the change of the 3D point location as a linear function of FAP changes FAP. Due to the presence of local deformations, we cannot describe the transformation with the same matri T for all object points (as in the case for rigid bod motion) but have to set T up for each point independentl. j 74 September/October 1998
6 . Motion estimation For motion estimation, we use the whole face image b setting up the optical flow constraint equation I X u+ I Y v+ I = t (1) where [I X, I Y] is the gradient of the intensit at point [X, Y], uand vthe velocit in - and -direction, and I tthe intensit gradient in the temporal direction. For gralevel images I represents the luminance using color images leads to three independent equations for the three-color components. Since u and v can take on arbitrar values for each point [X,Y], this set of equations is under-determined, and we need additional constraints to compute a unique solution. Instead of determining the optical flow field b using smoothness constraints and then etracting the motion parameter set from this flow field, we estimate the FAPs from Equation 1 together with the 3D motion equations of the head s object points. 12,13 This technique resembles the one described b DeCarlo and Metaas. 14 One main difference is that we estimate the motion from snthetic frames and camera images using a feedback loop as shown in Figure 7. This allows the use of a hierarchical framework that can handle larger motion vectors between two successive frames. Beond that, we don t need edge forces to avoid an error accumulation (as described b DeCarlo and Metaas 14 ) with the rendering feedback loop. Another difference between the two approaches is that we use a tetured 3D model, which lets us generate new snthetic views of our virtual scene after estimating the motion. Writing Equation 9 in its single components leads to = t FAP = t FAP (11) (12) 1 [ z I X f t + I Y f t + I X ( X X ) + IY ( Y Y ) t z ] FAP It (16) ( ) = with z being the depth information coming from the model. We obtain an over-determined sstem that can be solved in a least-squares sense with low computational compleit. The sstem s size depends directl on the number of FAP. Outlier removal At each piel position of the object, we can set up Equation 16 leading to a large number of equations. We need at least the same number of equations as the number of estimated FAPs, but due to the large number of face piels each of which contributes one additional equation we can discard some possible outliers. These outliers can be detected b analzing the partial derivatives of the intensit and the motion model. The optical flow constraint of Equation 1 is onl valid for small displacement vectors due to the linearization of the intensit values. If the estimate of the displacement vector length for the piel at position [X,Y], ( ) = 2 ^ It d XY, 2 2 I X + I Y (17) is larger than a threshold G, we classif the piel as an outlier and don t use it for motion estimation. Hierarchical motion estimation The inherent linearization of the intensit in the optical flow constraint and the approimations used to obtain a linear solution prevent dealing with large displacement vectors between two successive video frames. To overcome this limitation, we use a hierarchical scheme for the motion estimation as shown in Figure 1. First, an approimation for the motion parameters between frame n and n 1 is computed from low-pass z = z t z z FAP (13) t, t, and t zare the row vectors of matri T. Dividing Equations 11 and 12 b Equation 13, inserting the camera model from Equation 3, and using a first-order approimation leads to ( ) 1 u = X X (14) z f t + X X t FAP ( ) 1 v = Y Y (15) z f t + Y Y t FAP ( ) ( ) z This equation serves as the motion constraint in the 2D image plane. Together with Equation 1 a linear equation at each piel position can be set up: Frame n Frame n Frame n Camera frames Motion estimation Snthetic frames Render 3D model z Motion Render estimation 3D model Frame n 1 Motioncompensated frame Motioncompensated frame 1 Image pramid of the hierarchical motion estimation scheme. IEEE Computer Graphics and Applications 75
7 . Animating Virtual Humans 11 PSNR after global and global plus local motion compensation. PSNR (decibels) Global + local motion Global motion Frame time, the threshold G used to detect outliers for the motion estimation reduces from 5 (first level) to.5 (highest resolution), which means that at higher levels more piels become classified as outliers. Eperiments with this hierarchical scheme showed that it can estimate displacements of up to 3 piels between two frames. Eperimental results To validate the estimation algorithm, we performed eperiments with snthetic and real video. To create the snthetic sequences, we rendered the 3D model with known FAPs, which let us compare the estimated FAPs with original ones. For real sequences recorded with a camera, we had to individualize the generic 3D model to the person in the video using a 3D laser scan of that person. Unfortunatel, we couldn t determine the accurac of the estimated FAP values for real image sequences, since we didn t know their correct values. Therefore, we could onl compare the reconstructed snthetic image with the original camera frame. The error measure that we used for that purpose is the peak signal noise ratio (PSNR), defined as 12 Camera frames (left) and corresponding reconstructed snthetic frames (right). PSNR = 1 log N 1 i= ( Iorig, i Isnth, i ) (18) In this equation I orig,i is the luminance component of piel i of the original camera image with values from to 255, and I snth,i is the corresponding value in the snthetic image. N denotes the number of piels in the image. filtered and subsampled versions of the camera frame I(n) and the snthetic frame ^ I(n 1). Due to using lowresolution images, the linear intensit assumption remains valid over a wider range. For subsampling, simple moving average filters reduces aliasing. A Gauss filter to smooth the edges before estimating the motion further filters the resulting images. The estimated parameter set generates a motion-compensated image b moving the 3D model and rendering it at the new position. Due to the motion compensation, the differences between the new snthetic image and camera frame decrease. Then, the procedure is repeated at higher resolutions, each time ielding a more accurate motion parameter set. Note that we use the same camera frame in all levels, and that the snthetic image is changed from ^ I(n 1) to ^ I(n) iterativel. In our current implementation we use three levels of resolution starting from piels. For each new level the resolution doubles in both directions, leading to a final Common Intermediate Format (CIF) resolution of piels. At the same Snthetic sequence In the first eperiment we created a snthetic sequence (1 frames) with a resolution of piels (CIF resolution). Rendering the 3D model with well-defined FAPs and a viewing angle of about 25 degrees achieves this. We estimated 1 FAPs for ever fifth frame using the proposed algorithm and compared the results to the correct values. Table 1 shows the relative error averaged over all estimates. The values in the table were measured relative to the maimum of each value that corresponded to an etreme epression. In this eperiment we used the FAPs for global motion of the head (FAPs 48, 49, 5), opening of the jaw (FAP 3), and movements of eelids (FAPs 19, 2), eebrows (FAPs 35, 36), and lip corners (FAPs 12, 13). The accurate values for the estimated FAPs also resulted in an ecellent reconstruction of the original frames. Because we used the same model for creating the sequence and estimating the parameters, we achieved a nearl perfect reconstruction. You can see this in Figure 11, where we plot the measured PSNR between the original and snthetic image. Averaging the PSNR values computed onl in the facial area and not for the background leads to a value of about 7 decibels. For comparison, Figure 11 also shows the PSNR for the reconstructed sequence we generated using onl global motion parameters (head translation and rotation). 76 September/October 1998
8 Table 1. Relative average error of the estimated FAPs (in percentages). FAP Relative error Camera sequences In a second eperiment, we recorded a video of a talking person with a camera in CIF resolution and a frame rate of 12.5 Hz. We estimated the FAPs for all 23 frames and rendered the corresponding snthetic sequence. We estimated 17 parameters including global head motion (si parameters) and movements of eebrows (four parameters) and mouth motion (seven parameters). The total processing time for each frame was about 1.8 seconds on a 175-MHz Silicon Graphics O2 workstation; 3.3 seconds of the time was spent for the motion estimation algorithm, including setting up the equations and solving for the unknowns. Rendering the model and accessing files took up the remaining time. Figure 12 shows three frames of the camera sequence and the snthesized frames from the estimated parameters. Since we used real camera sequences, we had to measure the estimation s accurac b comparing the 2D images from the camera and the renderer. The PSNR between original and snthetic images computed in the facial area averaged 31.6 db. Figure 13 shows the PSNR for each frame when using 17 parameters for the estimation (blue) or when performing onl global motion estimation (si parameters). Comparable results are also achieved for other sequences. Figure 14 shows the original and snthetic views for a second sequence. To estimate the bit rate needed for transmitting a head and shoulders sequence over a network, we used a Huffman coder to encode the FAPs. The 17 FAPs are predicted from the previous frame while the prediction error is quantized with 7 bits and then Huffman coded. This leads to an average bit rate of about.58 Kbps, which corresponds to about 47 bits per frame. This calculation assumes that the decoder has alread received the model. Once we have estimated the FAPs for our 3D model, we can use them to create virtual environments. Figure 15 shows an eample for such a manipulation. Here, we recorded a sequence of a speaking person (left side) and estimated the FAPs. These parameters were transmitted with an average bit rate of about.57 Kbps. The decoder then rendered the 3D model together with a description of the background leading to the images shown on the right side of Figure 15. Conclusions In this article we presented a method for the estimation of FAPs from 2D image sequences. The combination of the optical flow constraint with 3D motion models lead to a robust and low compleit algorithm. Eperiments with real video sequences showed that we can achieve bit rates as low as.6 Kbps at 31.6 db PSNR. Currentl, we re investigating the influence of illumination variation that violates the optical flow constraint. To reduce this influence, we add illumination models to our virtual scene and estimate both motion PSNR (decibels) Global + local motion Global motion Frame and illumination parameters like light direction and intensit. 15 Also, we re working on adding temporal 3D motion constraints to the estimation framework to further increase its robustness. 13 PSNR of a coded talking head sequence. 14 Camera frames (left) and corresponding reconstructed snthetic frames (right). 15 Camera frames (left) and corresponding reconstructed snthetic frames in a virtual environment (right). IEEE Computer Graphics and Applications 77
9 Animating Virtual Humans References 1. D.E. Pearson, Developments in Model-Based Video Coding, Proc. IEEE, Vol. 83, No. 6, June 1995, pp W.J. Welsh, S. Searsb, and J.B. Waite, Model-Based Image Coding, British Telecom Technolog J., Vol. 8, No. 3, Jul 199, pp K. Aizawa and T.S. Huang, Model-Based Image Coding: Advanced Video Coding Techniques for Ver Low Bit-Rate Applications, Proc. IEEE, Vol. 83, No. 2, Feb. 1995, pp I.S. Pandzic et al., Towards Natural Communication in Networked Collaborative Virtual Environments, Proc. Framework for Immersive Environments (FIVE) 96, 1996, 5. F.I. Parke, Parameterized Models for Facial Animation, IEEE CG&A, Vol. 2, No. 9, 1982, pp M. Rdfalk, Candide: A Parameterized Face, LiTH-ISY-I- 866, Image Coding Group, Linköping Univ., Linköping, Sweden, Oct., G. Greiner and H.-P. Seidel, Modeling with Triangular B- Splines, Proc. ACM/IEEE Solid Modeling Smp. 93, 1993, ACM Press, New York, pp M. Hoch, G. Fleischmann, and B. Girod, Modeling and Animation of Facial Epressions Based on B-Splines, Visual Computer, Vol. 11, 1994, pp SNHC Sstems Verification Model 4., ISO/IEC JTC1/SC29/WG11 N1666, Bristol, Great Britain, April R.Y. Tsai, A Versatile Camera Calibration Technique for High-Accurac 3D Machine Vision Metrolog Using Offthe-Shelf TV Cameras and Lenses, IEEE J. of Robotics and Automation, Vol. RA-3, No. 4, Aug. 1987, pp R. Koch, Dnamic 3D Scene Analsis through Snthesis Feedback Control, IEEE Trans. PAMI, Vol. 15, No. 6, June 1993, pp H. Li, P. Roivainen, and R. Forchheimer, 3D Motion Estimation in Model-Based Facial Image Coding, IEEE Trans. PAMI, Vol. 15, No. 6, June 1993, pp J. Ostermann, Object-Based Analsis-Snthesis Coding (OBACS) Based on the Source Model of Moving Fleible 3D Objects, IEEE Trans. Image Processing, Vol. 3, No. 5, Sept. 1994, pp D. DeCarlo and D. Metaas, The Integration of Optical Flow and Deformable Models with Applications to Human Face Shape and Motion Estimation, Proc. Computer Vision and Pattern Recognition (CVPR) 96, IEEE CS Press, Los Alamitos, Calif., 1996, pp P. Eisert and B. Girod, Model-Based Coding of Facial Image Sequences at Varing Illumination Conditions, Proc. 1th Image and Multidimensional Digital Signal Processing Workshop 98, IEEE Press, Piscatawa, N.J., Jul 1998, pp Peter Eisert joined the image communication group at the Telecommunications Institute of the Universit of Erlangen-Nuremberg, German in As a member of the Center of Ecellence s 3D Image Analsis and Snthesis group, he is currentl working on his PhD thesis. His research interests include modelbased video coding, image communication, and computer vision. He received his diploma in electrical engineering from the Universit of Karlsruhe, German in Bernd Girod is a chaired professor of telecommunications in the Electrical Engineering Department of the Universit of Erlangen-Nuremberg. His research interests are in image communication, 3D image analsis and snthesis, and multimedia sstems. He has been the director of the Center of Ecellence s 3D Image Analsis and Snthesis group in Erlangen since He received an MS from Georgia Institute of Technolog, Atlanta, and a PhD in engineering from the Universit of Hannover, German. Readers ma contact the authors at the Telecommunications Laborator, Universit of Erlangen, Cauerstrasse 7, D-9158, Erlangen, German, {eisert, girod}@ nt.e-technik.uni-erlangen.de. International Smposium on Visualization Ma 1999 Vienna, Austria The IEEE Technical Committee on Computer Graphics is cosponsoring its first visualization conference in Europe. In conjunction with Eurographics, the TCCG will cosponsor the joint Eurographics-IEEE TCCG Smposium on Visualization on Ma 1999 in Vienna, Austria. A Program Committee will review both papers and case stud submissions. Attendance is open to all with strong representation epected from both sides of the Atlantic. There will also be other eciting events coordinated with the conference. For more information visit or the TCCG Web page at 78 September/October 1998
10 IEEE Computer Graphics and Applications 79
Analyzing Facial Expressions for Virtual Conferencing
IEEE Computer Graphics & Applications, pp. 70-78, September 1998. Analyzing Facial Expressions for Virtual Conferencing Peter Eisert and Bernd Girod Telecommunications Laboratory, University of Erlangen,
Affine Transformations
A P P E N D I X C Affine Transformations CONTENTS C The need for geometric transformations 335 C2 Affine transformations 336 C3 Matri representation of the linear transformations 338 C4 Homogeneous coordinates
Template-based Eye and Mouth Detection for 3D Video Conferencing
Template-based Eye and Mouth Detection for 3D Video Conferencing Jürgen Rurainsky and Peter Eisert Fraunhofer Institute for Telecommunications - Heinrich-Hertz-Institute, Image Processing Department, Einsteinufer
3D Arm Motion Tracking for Home-based Rehabilitation
hapter 13 3D Arm Motion Tracking for Home-based Rehabilitation Y. Tao and H. Hu 13.1 Introduction This paper presents a real-time hbrid solution to articulated 3D arm motion tracking for home-based rehabilitation
Introduction to polarization of light
Chapter 2 Introduction to polarization of light This Chapter treats the polarization of electromagnetic waves. In Section 2.1 the concept of light polarization is discussed and its Jones formalism is presented.
View Sequence Coding using Warping-based Image Alignment for Multi-view Video
View Sequence Coding using Warping-based mage Alignment for Multi-view Video Yanwei Liu, Qingming Huang,, Wen Gao 3 nstitute of Computing Technology, Chinese Academy of Science, Beijing, China Graduate
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA N. Zarrinpanjeh a, F. Dadrassjavan b, H. Fattahi c * a Islamic Azad University of Qazvin - [email protected]
Very Low Frame-Rate Video Streaming For Face-to-Face Teleconference
Very Low Frame-Rate Video Streaming For Face-to-Face Teleconference Jue Wang, Michael F. Cohen Department of Electrical Engineering, University of Washington Microsoft Research Abstract Providing the best
VIRTUAL VIDEO CONFERENCING USING 3D MODEL-ASSISTED IMAGE-BASED RENDERING
VIRTUAL VIDEO CONFERENCING USING 3D MODEL-ASSISTED IMAGE-BASED RENDERING Peter Eisert Fraunhofer Institute for Telecommunications, Heinrich-Hertz-Institute Image Processing Department Einsteinufer 37,
A Study on Intelligent Video Security Surveillance System with Active Tracking Technology in Multiple Objects Environment
Vol. 6, No., April, 01 A Stud on Intelligent Video Securit Surveillance Sstem with Active Tracking Technolog in Multiple Objects Environment Juhun Park 1, Jeonghun Choi 1, 1, Moungheum Park, Sukwon Hong
This week. CENG 732 Computer Animation. Challenges in Human Modeling. Basic Arm Model
CENG 732 Computer Animation Spring 2006-2007 Week 8 Modeling and Animating Articulated Figures: Modeling the Arm, Walking, Facial Animation This week Modeling the arm Different joint structures Walking
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
Mathematics Placement Packet Colorado College Department of Mathematics and Computer Science
Mathematics Placement Packet Colorado College Department of Mathematics and Computer Science Colorado College has two all college requirements (QR and SI) which can be satisfied in full, or part, b taking
LINEAR FUNCTIONS OF 2 VARIABLES
CHAPTER 4: LINEAR FUNCTIONS OF 2 VARIABLES 4.1 RATES OF CHANGES IN DIFFERENT DIRECTIONS From Precalculus, we know that is a linear function if the rate of change of the function is constant. I.e., for
3 Optimizing Functions of Two Variables. Chapter 7 Section 3 Optimizing Functions of Two Variables 533
Chapter 7 Section 3 Optimizing Functions of Two Variables 533 (b) Read about the principle of diminishing returns in an economics tet. Then write a paragraph discussing the economic factors that might
1. a. standard form of a parabola with. 2 b 1 2 horizontal axis of symmetry 2. x 2 y 2 r 2 o. standard form of an ellipse centered
Conic Sections. Distance Formula and Circles. More on the Parabola. The Ellipse and Hperbola. Nonlinear Sstems of Equations in Two Variables. Nonlinear Inequalities and Sstems of Inequalities In Chapter,
Functions and Graphs CHAPTER INTRODUCTION. The function concept is one of the most important ideas in mathematics. The study
Functions and Graphs CHAPTER 2 INTRODUCTION The function concept is one of the most important ideas in mathematics. The stud 2-1 Functions 2-2 Elementar Functions: Graphs and Transformations 2-3 Quadratic
EQUILIBRIUM STRESS SYSTEMS
EQUILIBRIUM STRESS SYSTEMS Definition of stress The general definition of stress is: Stress = Force Area where the area is the cross-sectional area on which the force is acting. Consider the rectangular
Chapter 8. Lines and Planes. By the end of this chapter, you will
Chapter 8 Lines and Planes In this chapter, ou will revisit our knowledge of intersecting lines in two dimensions and etend those ideas into three dimensions. You will investigate the nature of planes
Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)
Project Proposal Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Sumedha Phatak-1000731131- [email protected] Objective: A study, implementation and comparison of
Zeros of Polynomial Functions. The Fundamental Theorem of Algebra. The Fundamental Theorem of Algebra. zero in the complex number system.
_.qd /7/ 9:6 AM Page 69 Section. Zeros of Polnomial Functions 69. Zeros of Polnomial Functions What ou should learn Use the Fundamental Theorem of Algebra to determine the number of zeros of polnomial
A Reliability Point and Kalman Filter-based Vehicle Tracking Technique
A Reliability Point and Kalman Filter-based Vehicle Tracing Technique Soo Siang Teoh and Thomas Bräunl Abstract This paper introduces a technique for tracing the movement of vehicles in consecutive video
When I was 3.1 POLYNOMIAL FUNCTIONS
146 Chapter 3 Polnomial and Rational Functions Section 3.1 begins with basic definitions and graphical concepts and gives an overview of ke properties of polnomial functions. In Sections 3.2 and 3.3 we
2D Geometrical Transformations. Foley & Van Dam, Chapter 5
2D Geometrical Transformations Fole & Van Dam, Chapter 5 2D Geometrical Transformations Translation Scaling Rotation Shear Matri notation Compositions Homogeneous coordinates 2D Geometrical Transformations
COMPLEX STRESS TUTORIAL 3 COMPLEX STRESS AND STRAIN
COMPLX STRSS TUTORIAL COMPLX STRSS AND STRAIN This tutorial is not part of the decel unit mechanical Principles but covers elements of the following sllabi. o Parts of the ngineering Council eam subject
Study and Implementation of Video Compression standards (H.264/AVC, Dirac)
Study and Implementation of Video Compression standards (H.264/AVC, Dirac) EE 5359-Multimedia Processing- Spring 2012 Dr. K.R Rao By: Sumedha Phatak(1000731131) Objective A study, implementation and comparison
SECTION 2.2. Distance and Midpoint Formulas; Circles
SECTION. Objectives. Find the distance between two points.. Find the midpoint of a line segment.. Write the standard form of a circle s equation.. Give the center and radius of a circle whose equation
Higher. Polynomials and Quadratics 64
hsn.uk.net Higher Mathematics UNIT OUTCOME 1 Polnomials and Quadratics Contents Polnomials and Quadratics 64 1 Quadratics 64 The Discriminant 66 3 Completing the Square 67 4 Sketching Parabolas 70 5 Determining
Data Mining Cluster Analysis: Basic Concepts and Algorithms. Clustering Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining
Data Mining Cluster Analsis: Basic Concepts and Algorithms Lecture Notes for Chapter 8 Introduction to Data Mining b Tan, Steinbach, Kumar Clustering Algorithms K-means and its variants Hierarchical clustering
Video compression: Performance of available codec software
Video compression: Performance of available codec software Introduction. Digital Video A digital video is a collection of images presented sequentially to produce the effect of continuous motion. It takes
Physics 53. Kinematics 2. Our nature consists in movement; absolute rest is death. Pascal
Phsics 53 Kinematics 2 Our nature consists in movement; absolute rest is death. Pascal Velocit and Acceleration in 3-D We have defined the velocit and acceleration of a particle as the first and second
Classification of Fingerprints. Sarat C. Dass Department of Statistics & Probability
Classification of Fingerprints Sarat C. Dass Department of Statistics & Probability Fingerprint Classification Fingerprint classification is a coarse level partitioning of a fingerprint database into smaller
SERVO CONTROL SYSTEMS 1: DC Servomechanisms
Servo Control Sstems : DC Servomechanisms SERVO CONTROL SYSTEMS : DC Servomechanisms Elke Laubwald: Visiting Consultant, control sstems principles.co.uk ABSTRACT: This is one of a series of white papers
PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY
PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY V. Knyaz a, *, Yu. Visilter, S. Zheltov a State Research Institute for Aviation System (GosNIIAS), 7, Victorenko str., Moscow, Russia
Tracking Moving Objects In Video Sequences Yiwei Wang, Robert E. Van Dyck, and John F. Doherty Department of Electrical Engineering The Pennsylvania State University University Park, PA16802 Abstract{Object
CS 534: Computer Vision 3D Model-based recognition
CS 534: Computer Vision 3D Model-based recognition Ahmed Elgammal Dept of Computer Science CS 534 3D Model-based Vision - 1 High Level Vision Object Recognition: What it means? Two main recognition tasks:!
DEFORMATION ANALYSIS USING SHEAROGRAPHY
DEFORMATION ANALYSIS USING SHEAROGRAPHY b Francisco Javier Casillas Rodríguez Submitted in partial fulfillment of the requirements for the degree of doctor in sciences (Optics at Centro de Investigaciones
Connecting Transformational Geometry and Transformations of Functions
Connecting Transformational Geometr and Transformations of Functions Introductor Statements and Assumptions Isometries are rigid transformations that preserve distance and angles and therefore shapes.
Automatic Labeling of Lane Markings for Autonomous Vehicles
Automatic Labeling of Lane Markings for Autonomous Vehicles Jeffrey Kiske Stanford University 450 Serra Mall, Stanford, CA 94305 [email protected] 1. Introduction As autonomous vehicles become more popular,
REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4
REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4 S. Heymann, A. Smolic, K. Mueller, Y. Guo, J. Rurainsky, P. Eisert, T. Wiegand Fraunhofer Institute
Introduction to Computer Graphics
Introduction to Computer Graphics Torsten Möller TASC 8021 778-782-2215 [email protected] www.cs.sfu.ca/~torsten Today What is computer graphics? Contents of this course Syllabus Overview of course topics
EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines
EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation
Parametric Comparison of H.264 with Existing Video Standards
Parametric Comparison of H.264 with Existing Video Standards Sumit Bhardwaj Department of Electronics and Communication Engineering Amity School of Engineering, Noida, Uttar Pradesh,INDIA Jyoti Bhardwaj
Performance Analysis and Comparison of JM 15.1 and Intel IPP H.264 Encoder and Decoder
Performance Analysis and Comparison of 15.1 and H.264 Encoder and Decoder K.V.Suchethan Swaroop and K.R.Rao, IEEE Fellow Department of Electrical Engineering, University of Texas at Arlington Arlington,
Solving Quadratic Equations by Graphing. Consider an equation of the form. y ax 2 bx c a 0. In an equation of the form
SECTION 11.3 Solving Quadratic Equations b Graphing 11.3 OBJECTIVES 1. Find an ais of smmetr 2. Find a verte 3. Graph a parabola 4. Solve quadratic equations b graphing 5. Solve an application involving
Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet
DICTA2002: Digital Image Computing Techniques and Applications, 21--22 January 2002, Melbourne, Australia Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet K. Ramkishor James. P. Mammen
WHITE PAPER. Are More Pixels Better? www.basler-ipcam.com. Resolution Does it Really Matter?
WHITE PAPER www.basler-ipcam.com Are More Pixels Better? The most frequently asked question when buying a new digital security camera is, What resolution does the camera provide? The resolution is indeed
Downloaded from www.heinemann.co.uk/ib. equations. 2.4 The reciprocal function x 1 x
Functions and equations Assessment statements. Concept of function f : f (); domain, range, image (value). Composite functions (f g); identit function. Inverse function f.. The graph of a function; its
We can display an object on a monitor screen in three different computer-model forms: Wireframe model Surface Model Solid model
CHAPTER 4 CURVES 4.1 Introduction In order to understand the significance of curves, we should look into the types of model representations that are used in geometric modeling. Curves play a very significant
15.1. Exact Differential Equations. Exact First-Order Equations. Exact Differential Equations Integrating Factors
SECTION 5. Eact First-Order Equations 09 SECTION 5. Eact First-Order Equations Eact Differential Equations Integrating Factors Eact Differential Equations In Section 5.6, ou studied applications of differential
How To Decode On A Computer Game On A Pc Or Mac Or Macbook
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 N2202 Tokyo, March 1998 INFORMATION
3D Human Face Recognition Using Point Signature
3D Human Face Recognition Using Point Signature Chin-Seng Chua, Feng Han, Yeong-Khing Ho School of Electrical and Electronic Engineering Nanyang Technological University, Singapore 639798 [email protected]
Efficient Storage, Compression and Transmission
Efficient Storage, Compression and Transmission of Complex 3D Models context & problem definition general framework & classification our new algorithm applications for digital documents Mesh Decimation
How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm
IJSTE - International Journal of Science Technology & Engineering Volume 1 Issue 10 April 2015 ISSN (online): 2349-784X Image Estimation Algorithm for Out of Focus and Blur Images to Retrieve the Barcode
Motion Planning for Dynamic Variable Inertia Mechanical Systems with Non-holonomic Constraints
Motion Planning for Dnamic Variable Inertia Mechanical Sstems with Non-holonomic Constraints Elie A. Shammas, Howie Choset, and Alfred A. Rizzi Carnegie Mellon Universit, The Robotics Institute Pittsburgh,
Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 [email protected]
Video Coding Standards Yao Wang Polytechnic University, Brooklyn, NY11201 [email protected] Yao Wang, 2003 EE4414: Video Coding Standards 2 Outline Overview of Standards and Their Applications ITU-T
7.3 Parabolas. 7.3 Parabolas 505
7. Parabolas 0 7. Parabolas We have alread learned that the graph of a quadratic function f() = a + b + c (a 0) is called a parabola. To our surprise and delight, we ma also define parabolas in terms of
Figure 1: Relation between codec, data containers and compression algorithms.
Video Compression Djordje Mitrovic University of Edinburgh This document deals with the issues of video compression. The algorithm, which is used by the MPEG standards, will be elucidated upon in order
Geometric Constraints
Simulation in Computer Graphics Geometric Constraints Matthias Teschner Computer Science Department University of Freiburg Outline introduction penalty method Lagrange multipliers local constraints University
A Short Introduction to Computer Graphics
A Short Introduction to Computer Graphics Frédo Durand MIT Laboratory for Computer Science 1 Introduction Chapter I: Basics Although computer graphics is a vast field that encompasses almost any graphical
HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT
International Journal of Scientific and Research Publications, Volume 2, Issue 4, April 2012 1 HANDS-FREE PC CONTROL CONTROLLING OF MOUSE CURSOR USING EYE MOVEMENT Akhil Gupta, Akash Rathi, Dr. Y. Radhika
Segmentation of building models from dense 3D point-clouds
Segmentation of building models from dense 3D point-clouds Joachim Bauer, Konrad Karner, Konrad Schindler, Andreas Klaus, Christopher Zach VRVis Research Center for Virtual Reality and Visualization, Institute
3D Scanner using Line Laser. 1. Introduction. 2. Theory
. Introduction 3D Scanner using Line Laser Di Lu Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute The goal of 3D reconstruction is to recover the 3D properties of a geometric
Shape Measurement of a Sewer Pipe. Using a Mobile Robot with Computer Vision
International Journal of Advanced Robotic Systems ARTICLE Shape Measurement of a Sewer Pipe Using a Mobile Robot with Computer Vision Regular Paper Kikuhito Kawasue 1,* and Takayuki Komatsu 1 1 Department
LESSON EIII.E EXPONENTS AND LOGARITHMS
LESSON EIII.E EXPONENTS AND LOGARITHMS LESSON EIII.E EXPONENTS AND LOGARITHMS OVERVIEW Here s what ou ll learn in this lesson: Eponential Functions a. Graphing eponential functions b. Applications of eponential
INVESTIGATIONS AND FUNCTIONS 1.1.1 1.1.4. Example 1
Chapter 1 INVESTIGATIONS AND FUNCTIONS 1.1.1 1.1.4 This opening section introduces the students to man of the big ideas of Algebra 2, as well as different was of thinking and various problem solving strategies.
5.2 Inverse Functions
78 Further Topics in Functions. Inverse Functions Thinking of a function as a process like we did in Section., in this section we seek another function which might reverse that process. As in real life,
Efficient Coding Unit and Prediction Unit Decision Algorithm for Multiview Video Coding
JOURNAL OF ELECTRONIC SCIENCE AND TECHNOLOGY, VOL. 13, NO. 2, JUNE 2015 97 Efficient Coding Unit and Prediction Unit Decision Algorithm for Multiview Video Coding Wei-Hsiang Chang, Mei-Juan Chen, Gwo-Long
Lines and Planes 1. x(t) = at + b y(t) = ct + d
1 Lines in the Plane Lines and Planes 1 Ever line of points L in R 2 can be epressed as the solution set for an equation of the form A + B = C. The equation is not unique for if we multipl both sides b
also describes the method used to collect the data for the faces. These techniques could be used to animate other flexible surfaces.
Computer Generated Animation of Faces Frederick I. Parke, University of Utah This paper describes the representation, animation and data collection techniques that have been used to produce "realistic"
Classifying Manipulation Primitives from Visual Data
Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if
Single Depth Image Super Resolution and Denoising Using Coupled Dictionary Learning with Local Constraints and Shock Filtering
Single Depth Image Super Resolution and Denoising Using Coupled Dictionary Learning with Local Constraints and Shock Filtering Jun Xie 1, Cheng-Chuan Chou 2, Rogerio Feris 3, Ming-Ting Sun 1 1 University
Spin-lattice and spin-spin relaxation
Spin-lattice and spin-spin relaation Sequence of events in the NMR eperiment: (i) application of a 90 pulse alters the population ratios, and creates transverse magnetic field components (M () ); (ii)
Client Based Power Iteration Clustering Algorithm to Reduce Dimensionality in Big Data
Client Based Power Iteration Clustering Algorithm to Reduce Dimensionalit in Big Data Jaalatchum. D 1, Thambidurai. P 1, Department of CSE, PKIET, Karaikal, India Abstract - Clustering is a group of objects
Building an Advanced Invariant Real-Time Human Tracking System
UDC 004.41 Building an Advanced Invariant Real-Time Human Tracking System Fayez Idris 1, Mazen Abu_Zaher 2, Rashad J. Rasras 3, and Ibrahiem M. M. El Emary 4 1 School of Informatics and Computing, German-Jordanian
Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences
Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences Byoung-moon You 1, Kyung-tack Jung 2, Sang-kook Kim 2, and Doo-sung Hwang 3 1 L&Y Vision Technologies, Inc., Daejeon,
B4 Computational Geometry
3CG 2006 / B4 Computational Geometry David Murray [email protected] www.robots.o.ac.uk/ dwm/courses/3cg Michaelmas 2006 3CG 2006 2 / Overview Computational geometry is concerned with the derivation
PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO
PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO V. Conotter, E. Bodnari, G. Boato H. Farid Department of Information Engineering and Computer Science University of Trento, Trento (ITALY)
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
w = COI EYE view direction vector u = w ( 010,, ) cross product with y-axis v = w u up vector
. w COI EYE view direction vector u w ( 00,, ) cross product with -ais v w u up vector (EQ ) Computer Animation: Algorithms and Techniques 29 up vector view vector observer center of interest 30 Computer
Understanding Compression Technologies for HD and Megapixel Surveillance
When the security industry began the transition from using VHS tapes to hard disks for video surveillance storage, the question of how to compress and store video became a top consideration for video surveillance
Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur
Module 8 VIDEO CODING STANDARDS Version ECE IIT, Kharagpur Lesson H. andh.3 Standards Version ECE IIT, Kharagpur Lesson Objectives At the end of this lesson the students should be able to :. State the
Bernice E. Rogowitz and Holly E. Rushmeier IBM TJ Watson Research Center, P.O. Box 704, Yorktown Heights, NY USA
Are Image Quality Metrics Adequate to Evaluate the Quality of Geometric Objects? Bernice E. Rogowitz and Holly E. Rushmeier IBM TJ Watson Research Center, P.O. Box 704, Yorktown Heights, NY USA ABSTRACT
More Equations and Inequalities
Section. Sets of Numbers and Interval Notation 9 More Equations and Inequalities 9 9. Compound Inequalities 9. Polnomial and Rational Inequalities 9. Absolute Value Equations 9. Absolute Value Inequalities
Overview: Video Coding Standards
Overview: Video Coding Standards Video coding standards: applications and common structure Relevant standards organizations ITU-T Rec. H.261 ITU-T Rec. H.263 ISO/IEC MPEG-1 ISO/IEC MPEG-2 ISO/IEC MPEG-4
EXPANDING THE CALCULUS HORIZON. Hurricane Modeling
EXPANDING THE CALCULUS HORIZON Hurricane Modeling Each ear population centers throughout the world are ravaged b hurricanes, and it is the mission of the National Hurricane Center to minimize the damage
SECTION 7-4 Algebraic Vectors
7-4 lgebraic Vectors 531 SECTIN 7-4 lgebraic Vectors From Geometric Vectors to lgebraic Vectors Vector ddition and Scalar Multiplication Unit Vectors lgebraic Properties Static Equilibrium Geometric vectors
MetropoGIS: A City Modeling System DI Dr. Konrad KARNER, DI Andreas KLAUS, DI Joachim BAUER, DI Christopher ZACH
MetropoGIS: A City Modeling System DI Dr. Konrad KARNER, DI Andreas KLAUS, DI Joachim BAUER, DI Christopher ZACH VRVis Research Center for Virtual Reality and Visualization, Virtual Habitat, Inffeldgasse
H 261. Video Compression 1: H 261 Multimedia Systems (Module 4 Lesson 2) H 261 Coding Basics. Sources: Summary:
Video Compression : 6 Multimedia Systems (Module Lesson ) Summary: 6 Coding Compress color motion video into a low-rate bit stream at following resolutions: QCIF (76 x ) CIF ( x 88) Inter and Intra Frame
Automatic Restoration Algorithms for 35mm film
P. Schallauer, A. Pinz, W. Haas. Automatic Restoration Algorithms for 35mm film. To be published in Videre, Journal of Computer Vision Research, web: http://mitpress.mit.edu/videre.html, 1999. Automatic
