Relaxed Pairwise Learned Metric for Person Re-Identification
|
|
|
- Shon Turner
- 10 years ago
- Views:
Transcription
1 Relaxed Pairwise Learned Metric for Person Re-Identification Martin Hirzer, Peter M. Roth, Martin Köstinger, and Horst Bischof Institute for Computer Graphics and Vision Graz University of Technology Abstract. Matching persons across non-overlapping cameras is a rather challenging task. Thus, successful methods often build on complex feature representations or sophisticated learners. A recent trend to tackle this problem is to use metric learning to find a suitable space for matching samples from different cameras. However, most of these approaches ignore the transition from one camera to the other. In this paper, we propose to learn a metric from pairs of samples from different cameras. In this way, even less sophisticated features describing color and texture information are sufficient for finally getting state-of-the-art classification results. Moreover, once the metric has been learned, only linear projections are necessary at search time, where a simple nearest neighbor classification is performed. The approach is demonstrated on three publicly available datasets of different complexity, where it can be seen that state-of-the-art results can be obtained at much lower computational costs. 1 Introduction Person re-identification, i.e., recognizing an individual across spatially disjoint cameras, is becoming one of the major challenges in visual surveillance. Typical applications include but are not limited to tracking criminals, analyzing crowd movements in public places, and finding children who lost their parents. Since the number of public areas that become subject to video surveillance is ever growing, efficient, automatic systems are required to reduce the load on human operators. In general, person re-identification is very challenging for several reasons. First, the appearance of an individual can vary extremely across a network of cameras due to changing view points, illumination, different poses, etc. Second, there is a potentially high number of similar persons (e.g., people wear rather dark clothes in winter). Third, in contrast to similar large scale search problems typically no accurate temporal and spatial constraints can be exploited to ease the task. Thus, motivated by the high number of practical applications and still unresolved difficulties there has been an increased scientific interest (e.g., [1 10]) in recent years, and also various benchmark datasets [11, 8, 6] have been published.
2 2 Hirzer et al. Due to the complexity of the task different strategies have been proposed to solve it. The methods can be roughly divided into the following groups: (a) descriptive methods (e.g., [2, 1, 12, 13]), (b) discriminative methods (e.g., [3, 5 7, 14, 8, 4]), and (c) metric learning methods (e.g., [10, 15]). The main idea of descriptive methods is to extract visual features that are both, distinctive and stable under changing conditions between different cameras. After feature extraction, a standard distance measure is applied to compare different person representations. However, large intra-class appearance variations often prevent the computation of distinctive and stable features under realistic conditions. To overcome these limitations, discriminative methods additionally take advantage of class information to exploit the discriminative information given by the data. But, as a drawback, such methods tend to overfit to the training data. Moreover, they are often based on local image descriptors, which might be a severe disadvantage. For instance, a red bag visible in on view would be very discriminative, however, if it is not visible in the other view it becomes impossible to find the specific person again. A midway between both approaches is to use metric learning [10, 15]. Such methods are similar to descriptive methods the data is modeled by some kind of descriptive feature. However, they differ as the descriptors are not directly compared in the feature space, but instead a non-euclidean distance is used. Since this has to be estimated beforehand, in contrast to descriptive models a training stage is necessary. In fact, such metrics describe the transition in feature space between two camera views, which makes these approaches much more suitable for real world scenarios. Moreover, during evaluation metric learning approaches are very efficient since additionally to the feature extraction and the matching only a linear projection has to be computed. However, there are two main drawbacks considering the training stage. First, existing metric learners such as Large Margin Nearest Neighbor (LMNN) [16], Information Theoretic Metric Learning (ITML) [17], and Logistic Discriminant Metric Learning (LDML) [18] build on complex optimization schemes resulting in high computational costs. Second, these methods typically assume a multi-class classification problem, which is not the case for person re-identification. In fact, we are only given image pairs, so existing methods have to be adapted. There are only a few methods such as [19, 20] which directly indent learning a metric from data pairs. But again, these methods build on complex numerical methods, making them infeasible in practice. The goal of this work is to benefit from the advantages of metric learning, however, reducing the computational effort. In fact, we introduce a pairwise metric learning approach taking advantage of the structure of the data, however, aim to reduce the computational effort. This is realized by relaxing the original hard constraints, getting a simpler problem and thus avoiding iterative procedures. In this way, we finally obtain state-of-the-art or even significantly better results on three different datasets of varying size and complexity. This is in particular of interest, since, compared to existing methods, only rather simple image descriptors extracting color (HSV and Lab) and texture information (LBP) are
3 Relaxed Pairwise Learned Metric for Person Re-Identification 3 used. Moreover, we give a detailed comparison to other methods, an analysis of runtime, and show the influence of the number of training samples. 2 Related Work Many of the proposed approaches try to tackle the person re-identification problem by seeking a very distinctive and at the same time stable feature representation for describing a person s appearance. For instance, Gheissari et al. [1] try to fit a triangulated graph to each person to cope with pose variations. However, their approach works only if people are seen from similar viewpoints, which is not the case in most practical setups. The same restriction applies for the work of Wang et al. [2]. The image of a person is divided into regions and their color spatial structure is captured by a co-occurrence matrix. In [12], Farenzena et al. combine multiple features to describe the appearance of a person by exploiting perceptual principles. After obtaining a person s silhouette through a segmentation step, symmetry and asymmetry axes are found and used for accumulating color and texture feature responses. Cheng et al. [21] apply Pictorial Structures to person re-identification. They fit a body configuration composed of chest, head, thighs, and legs on pedestrian images and extract per-part color information as well as color displacement within the whole body. The extracted descriptors are then used in a matching step. Additionally, the authors introduce a method to customize the fit of Pictorial Structures on a specific person in cases when more images are available. In contrast to such descriptive approaches relying on hand crafted features, other methods aim at learning a more discriminative feature model. For instance, Bak et al. [4] first apply a person detector, and then use AdaBoost to generate a visual signature consisting of Haar-like features. Gray and Tao [3] use AdaBoost to select the most relevant out of a set of color and texture features. To compare corresponding features they additionally estimate a likelihood ratio test providing a similarity function. Lin and Davis [5] propose to learn pairwise dissimilarities that can be applied for nearest neighbor classification. Schwartz and Davis [6] use Partial Least Squares reduction to project high dimensional signatures onto a low dimensional discriminant space. Another method is presented by Prosser et al. [7]. Here, the person re-identification problem is formulated as a ranking problem. The authors introduce Ensemble RankSVM, a method that learns a subspace where the potential true match gets the highest rank. Hirzer et al. [8] combine both of the aforementioned strategies, i.e., they apply a descriptive and a discriminative model in parallel, showing that using the complementary information captured by both models leads to improved performance. Exploiting geometry was proposed by Baltieri et al. [22]. They generate a highly sophisticated 3D human body model from foreground segmented person images, which can then be matched using a histogram based distance. The authors put strong assumptions on the input data and require a sufficiently accurate tracker, capable of extracting foreground segmented images as well as position
4 4 Hirzer et al. and orientation data of persons. To improve classification results some methods also exploit additional cues besides visual information. Makris et al. [23] and Rahimi et al. [13] simplify the problem by applying temporal constraints based on the spatial layout of the observed scene. Javed et al. [24] try to learn transitions between cameras to cope with illumination changes, and Zheng et al. [9] use contextual, visual information that comes from surrounding people. A new direction that has recently been pursued is metric learning. Dikmen et al. [10] learn a Mahalanobis distance that is optimal for k-nearest neighbor classification using a maximum margin formulation. In fact, they extend the work of Weinberger and Saul [16] by introducing a rejection option into the LMNN framework. This option enables the LMNN classifier to return no matches if all nearest neighbors are beyond a certain distance, thereby telling the user that a searched person does not occur in a certain scene. Similarly, Zheng et al. [15] also use metric learning, but formulate it in a probabilistic manner. They seek a distance that maximizes the probability of a matching pair having a smaller distance than a non-matching pair. 3 Relaxed Pairwise Metric Learning Metric learning for person re-identification has been previously addressed [10, 15]. However, as a main drawback, these methods require computationally complex optimization schemes, which hamper a practical application in large scale scenarios. Thus, the goal of this paper is to introduce a more efficient, still effective metric learning approach, which will be derived in the following. One prominent approach for metric learning is Mahalanobis distance learning. Given n data points x i R m, the goal is to estimate a matrix M such that d M (x i, x j ) = (x i x j ) M(x i x j ) (1) describes a pseudo-metric. In fact, this is assured if M is positive semi-definite, i.e., M 0. If M = Σ 1, i.e., the inverse of the covariance matrix Σ, the distance defined by Eq. (1) is referred to as the Mahalanobis distance. An alternative formulation for Eq. (1), which is more intuitive, is given via the squared distance which is easily obtained from d L (x i, x j ) = L(x i x j ) 2, (2) (x i x j ) M(x i x j ) = (x i x j ) L } {{ L } (x i x j ) = L(x i x j ) 2. (3) M If additionally class labels are given, not only the generative structure of the data but also the discriminative information can be exploited. However, the person re-identification problem is lacking class labels, but we can exploit
5 Relaxed Pairwise Learned Metric for Person Re-Identification 5 the information that the data is given in form of pairs. Thus, we break down the original multi-class problem into a two-class problem in two steps. First, we transform the samples from the data space to the label agnostic difference space, which is inherently given by the metric definition in Eqs. (1) and (2). Second, the original class labels are discarded and the samples are arranged using pairwise equality and inequality constraints, where we obtain the class same S if two data points x i and x j share the same label and the class different D otherwise. In our case sharing a label means that the samples x i and x j describe views of the same person in different cameras. In the following, we introduce an efficient but still effective solution for this task. We build on the simple observation that distances d L (x i, x j ) for (x i, x j ) S should be small whereas they are expected to be large for (x i, x j ) D. This can be realized by using the following objective function: L(L) = 1 S (i,j) S L(x i x j ) 2 1 D (i,j) D L(x i x j ) 2. (4) Let be the Frobenius norm, then we can re-write the squared distances: L(x i x j ) 2 = L(x i x j ), L(x i x j ) = tr ( (x i x j ) L L(x i x j ) ) = tr ( M(x i x j )(x i x j ) ), (5) where, indicates the inner product and tr( ) the matrix trace. In particular, we are exploiting the fact that the inner product can be transferred to a trace formulation and that the trace is invariant under cyclic permutations. Thus, we can re-write the objective function Eq. (4) to Finally, let L(M) = 1 S 1 D (i,j) S Σ S = 1 S (i,j) D Σ D = 1 D (i,j) S tr ( M(x i x j )(x i x j ) ) (i,j) D tr ( M(x i x j )(x i x j ) ). (6) (x i x j )(x i x j ) (7) (x i x j )(x i x j ) (8) be the sample covariance matrices of S and D, respectively, and taking into account the linear properties of the trace, we get the objective function L(M) = tr(mσ S ) tr(mσ D ) = tr(m(σ S Σ D )). (9) To avoid trivial solutions and an overfitting to the training data an additional regularization is required. In our case, we would like to enforce the minimization
6 6 Hirzer et al. of the distances within S but also to avoid unbounded distances for D, which would degenerate the metric. Thus, we introduce a scaling on both, S and D, obtaining the optimization problem min L(M) s.t. M 0, LΣ S L = I, LΣ D L = I, (10) which is hard to solve (i.e., requires a complex iteration scheme). Hence, we relax the positivity constraint M 0. Further taking into account that tr(mσ S ) = tr(lσ S L ) = tr(lσ D L ) = tr(mσ D ) = m (11) the optimization problem Eq. (10) can be relaxed to solve tr(m(σ S Σ D )) = 0. (12) Technically, due to the relaxation the finally obtained matrix M does not describe a pseudo-metric. Nevertheless, the experimental results show that the estimated solution provides a sufficient approximation for the given task and that competitive results can be obtained; however, on a much lower computational effort. 4 Person Re-ID System In the following, we introduce our proposed person re-identification system consisting of three stages: (1) feature extraction, (2) metric learning, and (3) classification. The overall system is illustrated in Figure 1. During training the metric between two cameras is estimated, which is then used for calculating the distances between an unknown sample and the samples given in the database. The three steps are discussed in more detail in the next sections. Fig. 1. Person re-identification system consisting of three stages: (1) feature extraction dense sampling of color and texture features, (2) metric learning exploiting the structure of similar and dissimilar pairs, (3) classification nearest neighbor search under the learned metric.
7 Relaxed Pairwise Learned Metric for Person Re-Identification Representation Color and texture features have proven to be successful for the task of person re-identification. We use HSV and Lab color channels as well as Local Binary Patterns to create a person image representation. The features are extracted from 8x16 rectangular regions sampled from the image with a grid of 4x8 pixels, i.e., 50% overlap in both directions, which is illustrated in Figure 2. Fig. 2. Global image descriptor: different local features (HSV, Lab, LBP) are extracted from overlapping regions and are then concatenated to a single feature vector. In each rectangular patch we calculate the mean values per color channel, which are then discretized to the range 0 to 40. Additionally, a histogram of LBP codes is generated from a gray value representation of the patch. These values are then put together to form a feature vector. Finally, the vectors from all regions are concatenated to generate a representation for the whole image. 4.2 Metric Learning First of all, we run a PCA step, to reduce the dimensionality and for noise removal. In general, this step is not critical (the particular settings are given in Sec. 5), however, we recognized that for smaller datasets also a lower dimensional representation is sufficient. During training we learn a Mahalanobis metric M according to Eq. (12). Once M has been estimated, during evaluation the distance between two samples x i and x j is calculated via Eq. (1). Hence, additionally to the actual classification effort only linear projections are required. 4.3 Classification In person re-identification we want to recognize a certain person across different, non-overlapping camera views. In the work at hand, we assume that we have already detected the persons in all camera views, i.e., we do not tackle the detection problem. The goal of person re-identification now is to find a person image that has been selected in one view (probe image) in all the images from another view (gallery images). This is achieved by calculating the distances between the probe image and all gallery images using the learned metric, and returning those gallery images with the smallest distances as potential matches.
8 8 Hirzer et al. 5 Experimental Results We evaluated our approach on three publicly available datasets, the VIPeR dataset [11], the PRID 2011 dataset (single shot version) [8], and the ETHZ dataset [6]. Examples of all three are shown in Figure 3. We chose these datasets because they provide many challenges faced in real world person re-identification applications, e.g., viewpoint, pose and illumination changes, different backgrounds, image resolutions, occlusions, etc. Since the VIPeR dataset is widely used for evaluating person re-identification methods, we performed a more detailed analysis on this dataset. This includes an analysis of different training set sizes, comparisons to the state-of-the-art for person re-identification as well as for metric learning, and an analysis of computation times. As mentioned before, the PCA dimensions are not critical, but in particular we used 75 dimensions for VIPeR, 40 for PRID 2011 and ETHZ SEQ. #1, 20 for ETHZ SEQ. #2, and 15 for ETHZ SEQ. #3. (a) (b) (c) Fig. 3. Example image pairs from the VIPeR (a), the PRID 2011 (b), and the ETHZ dataset (c). Upper and lower row correspond to different appearances of the same person. 5.1 VIPeR Dataset The VIPeR dataset contains 632 person image pairs taken from two different camera views. Changes of viewpoint, illumination and pose are the most prominent sources of appearance variation between the two images of a person. For evaluation we followed the procedure described in [3]. The set of 632 image pairs is randomly split into two sets of 316 image pairs each, one for training and one for testing. In the test case, the two images of an image pair are randomly assigned to a probe and a gallery set. A single image from the probe set is then selected and matched with all images from the gallery set. This process is repeated for all images in the probe set. The whole evaluation procedure is carried out 10 times, and the average result is reported in form of a Cumulative Matching Characteristic (CMC) curve [2], which represents the expectation of finding the true match within the first r ranks.
9 Relaxed Pairwise Learned Metric for Person Re-Identification 9 The thus obtained results are shown in Figure 4. In addition, we give a comparison to simple feature matching using Euclidean distance and to baseline approaches, i.e., Linear Discriminant Analysis (LDA) and standard Mahalanobis metric. It is obvious that using the proposed metric leads to a huge performance gain over simple feature matching and that also the baselines can be outperformed. Moreover, in Table 1 we show a comparison of our approach to the state-of-the-art in both, person re-identification and metric learning. In the latter case, exactly the same features and training and test set splits have been used. As can be seen, our method outperforms all other methods over the whole range of ranks, however, at much reduced computational costs. In Table 1 we also analyze the computation time of our method using a Matlab implementation on a 2.83 GHz quad core CPU. The big advantage of our approach compared to others is its training time efficiency, since it does not rely on computationally complex optimization schemes. Please note, the standard Mahalanobis metric is still more efficient, however, the performance gain justifies the slightly higher computational effort. Using the learned metric during the evaluation step is efficient either, making it suitable for even large scale problems. Moreover, we investigated the influence of the training set size on the performance. Following [15], we reduced the number of training samples from 316 to 200 and 100 respectively. A comparison of our method using the original and the reduced training sets is depicted in Figure 5. As can be seen, reducing the number of training samples to 200 has only little influence on our method. Further reducing it to 100 decreases the performance notably. With decreasing number of samples estimating a reliable metric becomes more and more difficult. Still, our method outperforms the state-of-the-art on both reduced training sets, as shown in Table CMC 100 CMC Matching Rate (%) Matching Rate (%) Proposed Mahalanobis LDA Euclidean N train = 316 N train = 200 N train = Rank Rank Fig. 4. Average CMC curve of our approach, Mahalanobis, LDA, and feature matching using Euclidean distance on the VIPeR dataset. Fig. 5. Average CMC curve of our approach for different training set sizes on the VIPeR dataset.
10 10 Hirzer et al. Method r = Timings Proposed sec LMNN [16] sec ITML [17] sec LDML [18] sec Mahalanobis sec LDA sec ELF [3] hrs SDALF [12] ERSVM [7] min DDC [8] PS [21] PRDC [15] min LMNN-R* [10] Table 1. Comparison of matching rates in [%] at different ranks r and, if available, average training times per trial on the VIPeR dataset. (* indicates that the best run was reported, which cannot be directly compared to the other results!) Method N train = 200 N train = 100 r = r = Proposed Best reported in [15] Table 2. Matching rates of our approach and the best methods reported in [15] in [%] at different ranks r on reduced training sets. 5.2 PRID 2011 Dataset The PRID 2011 dataset consists of person images recorded from two different static surveillance cameras, where we used the single shot scenario (one image per person in each camera view) for our evaluation. Typical challenges on this dataset are viewpoint and pose changes as well as significant differences in illumination, background and camera characteristics. Camera view A contains 385 persons, camera view B contains 749 persons, with 200 of them appearing in both views. Hence, there are 200 person image pairs in the dataset. These image pairs are randomly split into a training and a test set of equal size. For evaluation on the test set, we followed the procedure described in [8], i.e., camera A is used for the probe set and camera B is used for the gallery set. Thus, each of the 100 persons in the probe set is searched in a gallery set of 649 persons (all images of camera view B except the 100 training samples). Again, the whole procedure is repeated 10 times and the result is reported in form of an average CMC curve in Figure 6. As can be seen, applying the proposed metric leads to superior performance compared to using LDA or Euclidean distance. Since the diversity in this dataset is smaller, i.e., it contains a lot of similar persons, the benefits of discriminative learning cannot be fully exploited.
11 Relaxed Pairwise Learned Metric for Person Re-Identification CMC Matching Rate (%) Proposed Mahalanobis 10 LDA Euclidean Rank Fig. 6. Average CMC curve of our approach, Mahalanobis, LDA, and feature matching using Euclidean distance on the PRID 2011 dataset. Table 3 compares our approach to the work of Hirzer et al. [8] for different ranks. Their approach uses a descriptive and a discriminative model to rank the images in the gallery set. The highest matching rates are achieved by a combination of both models. However, their system is designed for human-inthe-loop interaction, so a human operator actually makes the decision which of both models to use. In particular, we compare our method to their descriptive model, which also uses a single shot setup, i.e., person image pairs. In contrast, their discriminative model uses a multi shot setup, so that a fair comparison is not possible. As can be seen by the numbers, our method clearly outperforms their descriptive model at all ranks, especially in the middle range with nearly 20% performance gain. Also note that the approach of Hirzer et al. does not need a training phase, so the authors do not split the image pairs into a training and a test set. Instead, they use all 200 image pairs and all 749 gallery images of camera view B. This is slightly different from our setup, where we exclude the 100 training image pairs from the test phase. Method r = Proposed Descr. Model [8] Table 3. Matching rates of our approach and the descriptive model of [8] in [%] at different ranks r on the PRID 2011 dataset. 5.3 ETHZ Dataset The ETHZ dataset [6] contains video sequences of urban scenes captured from moving cameras. Originally proposed for pedestrian detection [25] it was later
12 12 Hirzer et al. modified for benchmarking person re-identification approaches. The dataset consists of person images extracted from three video sequences structured as follows: SEQ. #1 contains 83 persons (4.857 images), SEQ. #2 contains 35 persons (1.961 images), and SEQ. #3 contains 28 persons (1.762 images). All images have been resized to 64x32 pixels. The most challenging aspects of the ETHZ dataset are illumination changes and occlusions. However, since person images are captured from a single moving camera, the dataset does not provide a realistic scenario for person re-identification with multiple, disjoint cameras, different viewpoints, different camera characteristics, etc. Despite this limitation it is commonly used for person re-identification, so we also evaluated our approach on this dataset. Similar to [12] and [6] we use a single shot evaluation strategy, i.e., we randomly sample two images per person to build a training pair, and another two images to build a test pair. The images of the test pairs are then assigned to the probe and the gallery set. After learning a metric from the training pairs, each image in the probe set is matched with all images in the gallery set. The whole procedure is repeated 10 times to generate an average CMC curve, as shown in Figure CMC 100 CMC 100 CMC Matching Rate (%) Matching Rate (%) Matching Rate (%) Proposed Mahalanobis LDA Euclidean Proposed Mahalanobis LDA Euclidean Proposed Mahalanobis LDA Euclidean Rank Rank Rank (a) SEQ. #1 (b) SEQ. #2 (c) SEQ. #3 Fig. 7. Average CMC curve of our approach, Mahalanobis, LDA, and feature matching using Euclidean distance on the ETHZ dataset. According to [6], only the first 7 ranks are shown. As already stated above, here only tracks of persons are considered but not different camera views. Hence, as expected, metric learning which mainly captures the transitions in inter-camera feature space has only little influence. Moreover, especially for SEQ. #2 and SEQ. #3 the number of training samples is very small, making metric learning quite difficult. Nevertheless, for the larger sequence, SEQ. #1, a clear performance gain can be achieved. Moreover, in Table 4 we compare our matching rates to two other methods that also use a single shot evaluation, namely SDALF [12] and PLS [6]. On SEQ. #1, we achieve state-ofthe-art results comparable to [6], and outperform [12]. Interestingly, even using the Euclidean distance slightly outperforms [12]. Although on the other two sequences the power of metric learning is not fully exploited, results in the range of the state-of-the-art (SEQ. #2) or better (SEQ. #3) can be obtained.
13 Relaxed Pairwise Learned Metric for Person Re-Identification 13 SEQ. #1 SEQ. #2 SEQ. #3 Method Proposed SDALF [12] PLS [6] Table 4. Matching rates of our approach, SDALF (single shot), and PLS in [%] on the ETHZ dataset at the first 7 ranks. 6 Conclusion Recently, metric learning was introduced for the task of person re-identification, which is a considerable tradeoff between descriptive and discriminative modeling. In fact, good results can be obtained, however, at high computational costs. Since for the given task even the training should be efficient, in this work we targeted a more efficient metric learning approach. In particular, we proposed to use a discriminative Mahalanobis metric learning, which can be efficiently solved after some relaxations. The benefits of the proposed method are clearly demonstrated in the experimental results, where we show state-of-the-art or even better results on three standard benchmark datasets, i.e., VIPeR, PRID 2011, and ETHZ. This is in particular of interest, since we build on a quite simple representation. In fact, compared to the usage of Euclidean distance metric learning drastically boosts the performance. Moreover, the results reveal that the benefits of discriminative metric learning can be fully exploited for highly diverse views. Furthermore, we gave evaluations on timings and the importance of number of training samples. Future work would include the application of more sophisticated features and a more detailed study of the relaxed optimization problem. Acknowledgments The work was supported by the Austrian Science Foundation (FWF) project Advanced Learning for Tracking and Detection in Medical Workflow Analysis (I535-N23) and by the Austrian Research Promotion Agency (FFG) project SHARE in the IV2Splus program. References 1. Gheissari, N., Sebastian, T.B., Hartley, R.: Person reidentification using spatiotemporal appearance. In: Proc. CVPR. (2006) 2. Wang, X., Doretto, G., Sebastian, T.B., Rittscher, J., Tu, P.H.: Shape and appearance context modeling. In: Proc. ICCV. (2007) 3. Gray, D., Tao, H.: Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Proc. ECCV. (2008) 4. Bak, S., Corvee, E., Brémond, F., Thonnat, M.: Person re-idendification using Haar-based and DCD-based signature. In: Proc. Workshop on Activity Monitoring by Multi-Camera Surveillance Systems. (2010)
14 14 Hirzer et al. 5. Lin, Z., Davis, L.S.: Learning pairwise dissimilarity profiles for appearance recognition in visual surveillance. In: Advances Int l Visual Computing Symposium. (2008) 6. Schwartz, W.R., Davis, L.S.: Learning discriminative appearance-based models using partial least squares. In: Proc. Brazilian Symposium on Computer Graphics and Image Processing. (2009) 7. Prosser, B., Zheng, W.S., Gong, S., Xiang, T.: Person re-identification by support vector ranking. In: Proc. BMVC. (2010) 8. Hirzer, M., Beleznai, C., Roth, P.M., Bischof, H.: Person re-identification by descriptive and discriminative classification. In: Proc. SCIA. (2011) 9. Zheng, W.S., Gong, S., Xiang, T.: Associating groups of people. In: Proc. BMVC. (2009) 10. Dikmen, M., Akbas, E., Huang, T.S., Ahuja, N.: Pedestrian recognition with a learned metric. In: Proc. ACCV. (2010) 11. Gray, D., Brennan, S., Tao, H.: Evaluating appearance models for recognition, reacquisition, and tracking. In: Proc. PETS. (2007) 12. Farenzena, M., Bazzani, L., Perina, A., Murino, V., Cristani, M.: Person reidentification by symmetry-driven accumulation of local features. In: Proc. CVPR. (2010) 13. Rahimi, A., Dunagan, B., Darrell, T.: Simultaneous calibration and tracking with a network of non-overlapping sensors. In: Proc. CVPR. (2004) 14. Chapelle, O., Keerthi, S.S.: Efficient algorithms for ranking with SVMs. Information Retrieval 13 (2010) Zheng, W.S., Gong, S., Xiang, T.: Person re-identification by probabilistic relative distance comparison. In: Proc. CVPR. (2011) 16. Weinberger, K.Q., Saul, L.K.: Fast solvers and efficient implementations for distance metric learning. In: Proc. ICML. (2008) 17. Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: Proc. ICML. (2007) 18. Guillaumin, M., Verbeek, J., Schmid, C.: Is that you? Metric learning approaches for face identification. In: Proc. ICCV. (2009) 19. Ghodsi, A., Wilkinson, D.F., Southey, F.: Improving embeddings by flexible exploitation of side information. In: Proc. Int l Joint Conf. on Artificial Intelligence. (2007) 20. Alipanahi, B., Biggs, M., Ghodsi, A.: Distance metric learning vs. fisher discriminant analysis. In: Proc. AAAI Conf. on Artificial Intelligence. (2008) 21. Cheng, D.S., Cristani, M., Stoppa, M., Bazzani, L., Murino, V.: Custom pictorial structures for re-identification. In: Proc. BMVC. (2011) 22. Baltieri, D., Vezzani, R., Cucchiara, R.: SARC3D: A new 3d body model for people tracking and re-identification. In: Proc. Int l Conf. on Image Analysis and Processing. (2011) 23. Makris, D., Ellis, T., Black, J.: Bridging the gaps between cameras. In: Proc. CVPR. (2004) 24. Javed, O., Shafique, K., Shah, M.: Appearance modeling for tracking in multiple non-overlapping cameras. In: Proc. CVPR. (2005) 25. Ess, A., Leibe, B., Gool, L.v.: Depth and appearance for mobile scene analysis. In: Proc. ICCV. (2007)
Learning Locally-Adaptive Decision Functions for Person Verification
Learning Locally-Adaptive Decision Functions for Person Verification Zhen Li UIUC [email protected] Thomas S. Huang UIUC [email protected] Shiyu Chang UIUC [email protected] Liangliang Cao IBM Research
Local features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
Interactive person re-identification in TV series
Interactive person re-identification in TV series Mika Fischer Hazım Kemal Ekenel Rainer Stiefelhagen CV:HCI lab, Karlsruhe Institute of Technology Adenauerring 2, 76131 Karlsruhe, Germany E-mail: {mika.fischer,ekenel,rainer.stiefelhagen}@kit.edu
Probabilistic Latent Semantic Analysis (plsa)
Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg [email protected] www.multimedia-computing.{de,org} References
How does Person Identity Recognition Help Multi-Person Tracking?
How does Person Identity Recognition Help Multi-Person Tracking? Cheng-Hao Kuo and Ram Nevatia University of Southern California, Institute for Robotics and Intelligent Systems Los Angeles, CA 90089, USA
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,
Journal of Industrial Engineering Research. Adaptive sequence of Key Pose Detection for Human Action Recognition
IWNEST PUBLISHER Journal of Industrial Engineering Research (ISSN: 2077-4559) Journal home page: http://www.iwnest.com/aace/ Adaptive sequence of Key Pose Detection for Human Action Recognition 1 T. Sindhu
Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang
Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform
Classifying Manipulation Primitives from Visual Data
Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if
Is that you? Metric Learning Approaches for Face Identification
Is that you? Metric Learning Approaches for Face Identification Matthieu Guillaumin, Jakob Verbeek and Cordelia Schmid LEAR, INRIA Grenoble Laboratoire Jean Kuntzmann [email protected] Abstract
Tensor Methods for Machine Learning, Computer Vision, and Computer Graphics
Tensor Methods for Machine Learning, Computer Vision, and Computer Graphics Part I: Factorizations and Statistical Modeling/Inference Amnon Shashua School of Computer Science & Eng. The Hebrew University
3D Model based Object Class Detection in An Arbitrary View
3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems
View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems Avinash Ravichandran, Rizwan Chaudhry and René Vidal Center for Imaging Science, Johns Hopkins University, Baltimore, MD 21218,
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
Mean-Shift Tracking with Random Sampling
1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of
Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall
Automatic Photo Quality Assessment Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall Estimating i the photorealism of images: Distinguishing i i paintings from photographs h Florin
Determining optimal window size for texture feature extraction methods
IX Spanish Symposium on Pattern Recognition and Image Analysis, Castellon, Spain, May 2001, vol.2, 237-242, ISBN: 84-8021-351-5. Determining optimal window size for texture feature extraction methods Domènec
Tracking Groups of Pedestrians in Video Sequences
Tracking Groups of Pedestrians in Video Sequences Jorge S. Marques Pedro M. Jorge Arnaldo J. Abrantes J. M. Lemos IST / ISR ISEL / IST ISEL INESC-ID / IST Lisbon, Portugal Lisbon, Portugal Lisbon, Portugal
Object Recognition. Selim Aksoy. Bilkent University [email protected]
Image Classification and Object Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] Image classification Image (scene) classification is a fundamental
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. [email protected]
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE 1 S.Manikandan, 2 S.Abirami, 2 R.Indumathi, 2 R.Nandhini, 2 T.Nanthini 1 Assistant Professor, VSA group of institution, Salem. 2 BE(ECE), VSA
Subspace Analysis and Optimization for AAM Based Face Alignment
Subspace Analysis and Optimization for AAM Based Face Alignment Ming Zhao Chun Chen College of Computer Science Zhejiang University Hangzhou, 310027, P.R.China [email protected] Stan Z. Li Microsoft
A secure face tracking system
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 10 (2014), pp. 959-964 International Research Publications House http://www. irphouse.com A secure face tracking
Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006
Practical Tour of Visual tracking David Fleet and Allan Jepson January, 2006 Designing a Visual Tracker: What is the state? pose and motion (position, velocity, acceleration, ) shape (size, deformation,
Image Normalization for Illumination Compensation in Facial Images
Image Normalization for Illumination Compensation in Facial Images by Martin D. Levine, Maulin R. Gandhi, Jisnu Bhattacharyya Department of Electrical & Computer Engineering & Center for Intelligent Machines
Tracking And Object Classification For Automated Surveillance
Tracking And Object Classification For Automated Surveillance Omar Javed and Mubarak Shah Computer Vision ab, University of Central Florida, 4000 Central Florida Blvd, Orlando, Florida 32816, USA {ojaved,shah}@cs.ucf.edu
Statistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
The Visual Internet of Things System Based on Depth Camera
The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed
Tracking in flussi video 3D. Ing. Samuele Salti
Seminari XXIII ciclo Tracking in flussi video 3D Ing. Tutors: Prof. Tullio Salmon Cinotti Prof. Luigi Di Stefano The Tracking problem Detection Object model, Track initiation, Track termination, Tracking
Biometric Authentication using Online Signatures
Biometric Authentication using Online Signatures Alisher Kholmatov and Berrin Yanikoglu [email protected], [email protected] http://fens.sabanciuniv.edu Sabanci University, Tuzla, Istanbul,
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
Building an Advanced Invariant Real-Time Human Tracking System
UDC 004.41 Building an Advanced Invariant Real-Time Human Tracking System Fayez Idris 1, Mazen Abu_Zaher 2, Rashad J. Rasras 3, and Ibrahiem M. M. El Emary 4 1 School of Informatics and Computing, German-Jordanian
Learning Motion Categories using both Semantic and Structural Information
Learning Motion Categories using both Semantic and Structural Information Shu-Fai Wong, Tae-Kyun Kim and Roberto Cipolla Department of Engineering, University of Cambridge, Cambridge, CB2 1PZ, UK {sfw26,
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Automated Image Forgery Detection through Classification of JPEG Ghosts
Automated Image Forgery Detection through Classification of JPEG Ghosts Fabian Zach, Christian Riess and Elli Angelopoulou Pattern Recognition Lab University of Erlangen-Nuremberg {riess,elli}@i5.cs.fau.de
MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos [email protected]
Machine Learning for Computer Vision 1 MVA ENS Cachan Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos [email protected] Department of Applied Mathematics Ecole Centrale Paris Galen
Simple and efficient online algorithms for real world applications
Simple and efficient online algorithms for real world applications Università degli Studi di Milano Milano, Italy Talk @ Centro de Visión por Computador Something about me PhD in Robotics at LIRA-Lab,
Face Model Fitting on Low Resolution Images
Face Model Fitting on Low Resolution Images Xiaoming Liu Peter H. Tu Frederick W. Wheeler Visualization and Computer Vision Lab General Electric Global Research Center Niskayuna, NY, 1239, USA {liux,tu,wheeler}@research.ge.com
SCALABILITY OF CONTEXTUAL GENERALIZATION PROCESSING USING PARTITIONING AND PARALLELIZATION. Marc-Olivier Briat, Jean-Luc Monnot, Edith M.
SCALABILITY OF CONTEXTUAL GENERALIZATION PROCESSING USING PARTITIONING AND PARALLELIZATION Abstract Marc-Olivier Briat, Jean-Luc Monnot, Edith M. Punt Esri, Redlands, California, USA [email protected], [email protected],
Finding people in repeated shots of the same scene
Finding people in repeated shots of the same scene Josef Sivic 1 C. Lawrence Zitnick Richard Szeliski 1 University of Oxford Microsoft Research Abstract The goal of this work is to find all occurrences
Terrain Traversability Analysis using Organized Point Cloud, Superpixel Surface Normals-based segmentation and PCA-based Classification
Terrain Traversability Analysis using Organized Point Cloud, Superpixel Surface Normals-based segmentation and PCA-based Classification Aras Dargazany 1 and Karsten Berns 2 Abstract In this paper, an stereo-based
Character Image Patterns as Big Data
22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
Automatic parameter regulation for a tracking system with an auto-critical function
Automatic parameter regulation for a tracking system with an auto-critical function Daniela Hall INRIA Rhône-Alpes, St. Ismier, France Email: [email protected] Abstract In this article we propose
AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION
AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION Saurabh Asija 1, Rakesh Singh 2 1 Research Scholar (Computer Engineering Department), Punjabi University, Patiala. 2 Asst.
Real-Time Tracking of Pedestrians and Vehicles
Real-Time Tracking of Pedestrians and Vehicles N.T. Siebel and S.J. Maybank. Computational Vision Group Department of Computer Science The University of Reading Reading RG6 6AY, England Abstract We present
Multi-class Classification: A Coding Based Space Partitioning
Multi-class Classification: A Coding Based Space Partitioning Sohrab Ferdowsi, Svyatoslav Voloshynovskiy, Marcin Gabryel, and Marcin Korytkowski University of Geneva, Centre Universitaire d Informatique,
Distance Metric Learning in Data Mining (Part I) Fei Wang and Jimeng Sun IBM TJ Watson Research Center
Distance Metric Learning in Data Mining (Part I) Fei Wang and Jimeng Sun IBM TJ Watson Research Center 1 Outline Part I - Applications Motivation and Introduction Patient similarity application Part II
DYNAMIC RANGE IMPROVEMENT THROUGH MULTIPLE EXPOSURES. Mark A. Robertson, Sean Borman, and Robert L. Stevenson
c 1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or
Distributed forests for MapReduce-based machine learning
Distributed forests for MapReduce-based machine learning Ryoji Wakayama, Ryuei Murata, Akisato Kimura, Takayoshi Yamashita, Yuji Yamauchi, Hironobu Fujiyoshi Chubu University, Japan. NTT Communication
Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance
2012 IEEE International Conference on Multimedia and Expo Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance Rogerio Feris, Sharath Pankanti IBM T. J. Watson Research Center
Tracking performance evaluation on PETS 2015 Challenge datasets
Tracking performance evaluation on PETS 2015 Challenge datasets Tahir Nawaz, Jonathan Boyle, Longzhen Li and James Ferryman Computational Vision Group, School of Systems Engineering University of Reading,
CS231M Project Report - Automated Real-Time Face Tracking and Blending
CS231M Project Report - Automated Real-Time Face Tracking and Blending Steven Lee, [email protected] June 6, 2015 1 Introduction Summary statement: The goal of this project is to create an Android
Multiple Kernel Learning on the Limit Order Book
JMLR: Workshop and Conference Proceedings 11 (2010) 167 174 Workshop on Applications of Pattern Analysis Multiple Kernel Learning on the Limit Order Book Tristan Fletcher Zakria Hussain John Shawe-Taylor
Visibility optimization for data visualization: A Survey of Issues and Techniques
Visibility optimization for data visualization: A Survey of Issues and Techniques Ch Harika, Dr.Supreethi K.P Student, M.Tech, Assistant Professor College of Engineering, Jawaharlal Nehru Technological
A Prototype For Eye-Gaze Corrected
A Prototype For Eye-Gaze Corrected Video Chat on Graphics Hardware Maarten Dumont, Steven Maesen, Sammy Rogmans and Philippe Bekaert Introduction Traditional webcam video chat: No eye contact. No extensive
Feature Tracking and Optical Flow
02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve
LOGISTIC SIMILARITY METRIC LEARNING FOR FACE VERIFICATION
LOGISIC SIMILARIY MERIC LEARNING FOR FACE VERIFICAION Lilei Zheng, Khalid Idrissi, Christophe Garcia, Stefan Duffner and Atilla Baskurt Université de Lyon, CNRS, INSA-Lyon, LIRIS, UMR5205, F-69621, France
Open-Set Face Recognition-based Visitor Interface System
Open-Set Face Recognition-based Visitor Interface System Hazım K. Ekenel, Lorant Szasz-Toth, and Rainer Stiefelhagen Computer Science Department, Universität Karlsruhe (TH) Am Fasanengarten 5, Karlsruhe
Fast Matching of Binary Features
Fast Matching of Binary Features Marius Muja and David G. Lowe Laboratory for Computational Intelligence University of British Columbia, Vancouver, Canada {mariusm,lowe}@cs.ubc.ca Abstract There has been
Forschungskolleg Data Analytics Methods and Techniques
Forschungskolleg Data Analytics Methods and Techniques Martin Hahmann, Gunnar Schröder, Phillip Grosse Prof. Dr.-Ing. Wolfgang Lehner Why do we need it? We are drowning in data, but starving for knowledge!
Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05
Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 2015-03-05 Roman Kern (KTI, TU Graz) Ensemble Methods 2015-03-05 1 / 38 Outline 1 Introduction 2 Classification
How To Identify A Churner
2012 45th Hawaii International Conference on System Sciences A New Ensemble Model for Efficient Churn Prediction in Mobile Telecommunication Namhyoung Kim, Jaewook Lee Department of Industrial and Management
The Delicate Art of Flower Classification
The Delicate Art of Flower Classification Paul Vicol Simon Fraser University University Burnaby, BC [email protected] Note: The following is my contribution to a group project for a graduate machine learning
Methodology for Emulating Self Organizing Maps for Visualization of Large Datasets
Methodology for Emulating Self Organizing Maps for Visualization of Large Datasets Macario O. Cordel II and Arnulfo P. Azcarraga College of Computer Studies *Corresponding Author: [email protected]
Linear Threshold Units
Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear
Big Data: Image & Video Analytics
Big Data: Image & Video Analytics How it could support Archiving & Indexing & Searching Dieter Haas, IBM Deutschland GmbH The Big Data Wave 60% of internet traffic is multimedia content (images and videos)
Relative Permeability Measurement in Rock Fractures
Relative Permeability Measurement in Rock Fractures Siqi Cheng, Han Wang, Da Huo Abstract The petroleum industry always requires precise measurement of relative permeability. When it comes to the fractures,
Transform-based Domain Adaptation for Big Data
Transform-based Domain Adaptation for Big Data Erik Rodner University of Jena Judy Hoffman Jeff Donahue Trevor Darrell Kate Saenko UMass Lowell Abstract Images seen during test time are often not from
Data Mining Yelp Data - Predicting rating stars from review text
Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University [email protected] Chetan Naik Stony Brook University [email protected] ABSTRACT The majority
Model-Based 3D Human Motion Capture Using Global-Local Particle Swarm Optimizations
Model-Based 3D Human Motion Capture Using Global-Local Particle Swarm Optimizations Tomasz Krzeszowski, Bogdan Kwolek, and Konrad Wojciechowski Abstract. We present an approach for tracking the articulated
High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound
High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound Ralf Bruder 1, Florian Griese 2, Floris Ernst 1, Achim Schweikard
A Genetic Algorithm-Evolved 3D Point Cloud Descriptor
A Genetic Algorithm-Evolved 3D Point Cloud Descriptor Dominik Wȩgrzyn and Luís A. Alexandre IT - Instituto de Telecomunicações Dept. of Computer Science, Univ. Beira Interior, 6200-001 Covilhã, Portugal
A Short Introduction to Computer Graphics
A Short Introduction to Computer Graphics Frédo Durand MIT Laboratory for Computer Science 1 Introduction Chapter I: Basics Although computer graphics is a vast field that encompasses almost any graphical
Tracking and Recognition in Sports Videos
Tracking and Recognition in Sports Videos Mustafa Teke a, Masoud Sattari b a Graduate School of Informatics, Middle East Technical University, Ankara, Turkey [email protected] b Department of Computer
Interactive Offline Tracking for Color Objects
Interactive Offline Tracking for Color Objects Yichen Wei Jian Sun Xiaoou Tang Heung-Yeung Shum Microsoft Research Asia, Beijing, China {yichenw,jiansun,xitang,hshum}@microsoft.com Abstract In this paper,
REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING
REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING Ms.PALLAVI CHOUDEKAR Ajay Kumar Garg Engineering College, Department of electrical and electronics Ms.SAYANTI BANERJEE Ajay Kumar Garg Engineering
Introduction to Support Vector Machines. Colin Campbell, Bristol University
Introduction to Support Vector Machines Colin Campbell, Bristol University 1 Outline of talk. Part 1. An Introduction to SVMs 1.1. SVMs for binary classification. 1.2. Soft margins and multi-class classification.
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
A Tool for Generating Partition Schedules of Multiprocessor Systems
A Tool for Generating Partition Schedules of Multiprocessor Systems Hans-Joachim Goltz and Norbert Pieth Fraunhofer FIRST, Berlin, Germany {hans-joachim.goltz,nobert.pieth}@first.fraunhofer.de Abstract.
Vision based Vehicle Tracking using a high angle camera
Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu [email protected] [email protected] Abstract A vehicle tracking and grouping algorithm is presented in this work
Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.
Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table
Segmentation of building models from dense 3D point-clouds
Segmentation of building models from dense 3D point-clouds Joachim Bauer, Konrad Karner, Konrad Schindler, Andreas Klaus, Christopher Zach VRVis Research Center for Virtual Reality and Visualization, Institute
January 2008 PhD Degree (Dr.techn.) 2004 2007 PhD student at Graz University of Technology (computer science)
Curriculum Vitae Peter Michael Roth Graz University of Technology Institute for Computer Graphics and Vision Inffeldgasse 16/II, 8010 Graz, Austria [email protected] http://lrs.icg.tugraz.at Graz, Oct
RUN-LENGTH ENCODING FOR VOLUMETRIC TEXTURE
RUN-LENGTH ENCODING FOR VOLUMETRIC TEXTURE Dong-Hui Xu, Arati S. Kurani, Jacob D. Furst, Daniela S. Raicu Intelligent Multimedia Processing Laboratory, School of Computer Science, Telecommunications, and
15.062 Data Mining: Algorithms and Applications Matrix Math Review
.6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop
2.11 CMC curves showing the performance of sketch to digital face image matching. algorithms on the CUHK database... 40
List of Figures 1.1 Illustrating different stages in a face recognition system i.e. image acquisition, face detection, face normalization, feature extraction, and matching.. 10 1.2 Illustrating the concepts
Machine Learning Logistic Regression
Machine Learning Logistic Regression Jeff Howbert Introduction to Machine Learning Winter 2012 1 Logistic regression Name is somewhat misleading. Really a technique for classification, not regression.
SYMMETRIC EIGENFACES MILI I. SHAH
SYMMETRIC EIGENFACES MILI I. SHAH Abstract. Over the years, mathematicians and computer scientists have produced an extensive body of work in the area of facial analysis. Several facial analysis algorithms
