Feature description based on Center-Symmetric Local Mapped Patterns

Size: px
Start display at page:

Download "Feature description based on Center-Symmetric Local Mapped Patterns"

Transcription

1 Feature description based on Center-Symmetric Local Mapped Patterns Carolina Toledo Ferraz, Osmando Pereira Jr., Adilson Gonzaga Department of Electrical Engineering, University of São Paulo Av. Trabalhador São Carlense, São Carlos, SP - Brazil caferraz8@usp.br, osmandoj@gmail.com, agonzaga@sc.usp.br ABSTRACT Local feature description has gained a lot of interest in many applications, such as texture recognition, image retrieval and face recognition. This paper presents a novel method for local feature description based on gray-level difference mapping, called Center-Symmetric Local Mapped Pattern (CS- LMP). The proposed descriptor is invariant to image scale, rotation, illumination and partial viewpoint changes. Furthermore, this descriptor more effectively captures the nuances of the image pixels. The training set is composed of rotated and scaled images, with changes in illumination and view points. The test set is composed of rotated and scaled images. In our experiments, the descriptor is compared to the Center-Symmetric Local Binary Pattern (CS-LBP). The results show that our descriptor performs favorably compared to the CS-LBP. Categories and Subject Descriptors I.5.2 [Pattern Recognition]: Design Methodology feature evaluation and selection Keywords region description, image matching, region detection, feature description. INTRODUCTION In image processing, the local feature description plays an important role in applications such as texture recognition, content-based image retrieval and face recognition. The objective is to build a descriptor histogram vector providing a representation that allows efficient matching of local structures between images. A good descriptor has some important features including invariance to illumination, rotation and scale. Furthermore, it should be robust to errors in detection, geometric distortions from different points of view and photometric deformations caused by different lighting conditions [3]. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. SAC 4 March 24-28, 24, Gyeongju, Korea. Copyright 24 ACM /4/3...$ One of the most widely known key feature descriptors published in the literature is the Scale-Invariant Feature Transform (SIFT) [2]. It was introduced by Lowe in 999 []. It is characterized by a 3D histogram of gradient locations and orientations and stores the bins in a vector of 28 positions. Many variations of SIFT have been proposed, such as Principal Components Analysis SIFT (PCA-SIFT) [9], Gradient Location and Orientation Histogram (GLOH) [], Speeded Up Robust Features (SURF) [3], Colored SIFT (CSIFT) [], Center-Symmetric Local Binary Pattern (CS-LBP) [8] and Kernel Projection Based SIFT (KPB-SIFT) [7]. PCA-SIFT applies the standard Principal Components Analysis (PCA) technique to the normalized gradient patches extracted around local features. PCA is able to linearly project high-dimensional descriptors onto a low-dimensional feature space and reduce the high frequency noise in the descriptors. However, the PCA-based feature descriptors need an offline stage to train and estimate the covariance matrix used for PCA projection. Thus, the application of PCAbased features is limited. The GLOH descriptor also uses PCA to reduce the descriptor size. It is very similar to SIFT, however, it uses a Log-Polar location grid instead of a Cartesian one. SURF approximates SIFT using the Haar wavelet response and using integral images to compute the histograms bins. It uses a descriptor vector of 64 positions, providing a better processing speed. CSIFT tries to solve the problem of color image descriptors, improving on SIFT, which is designed mainly for gray images. It uses the color invariance model to build the descriptor. CS-LBP [8] combines the strengths of the well-known SIFT descriptor and the Local Binary Pattern (LBP) texture operator. In [8], it is shown that the construction of the CS-LBP descriptor is simpler than SIFT, but it generates a feature vector of 256 positions, which is twice the SIFT vector size. KBP-SIFT [7] reduced the dimension of the descriptor from 28 to only 36 bins by applying a kernel projection on the orientation gradient patches rather than using smoothed weighted histograms. The approach is tolerant to geometric distortions and does not require a training stage; however, the performance evaluation is not superior in all cases with respect to SIFT. In a recent work, Vieira, Chierici, Ferraz and Gonzaga [5] present a new descriptor based on fuzzy numbers that they called the Local Fuzzy Pattern (LFP). This method interprets the gray-level values of an image neighborhood as a fuzzy set and each pixel level as a fuzzy number. A membership function is used to describe the membership degree of the central pixel to a particular neighborhood. Moreover,

2 the LFP methodology is a generalization of previously published techniques such as the LBP, the Texture Unit [6], the Fuzzy Number Edge Detector (FUNED) [4] and the Census Transform [6]. One evolution of the LFP is the Local Mapped Pattern (LMP) [5], where the authors consider the sum of the differences of each gray-level of a given neighborhood to the central pixel as a local pattern that can be mapped to a histogram bin using a mapping function. In this paper, we propose a new interest region descriptors, denoted as the Center-Symmetric Local Mapped Pattern (CS-LMP). CS-LMP combines the desirable properties of the CS-LBP using the LMP methodology. This new descriptor captures the small transitions of pixels in the image more accurately, resulting in a greater number of correct matches than with CS-LBP. Furthermore, they do not use PCA to decrease the size of the feature vector, but instead quantize the histogram into a fixed number of bins. The rest of this paper is organized as follows. In Section 2, we briefly describe the methods of CS-LBP and LMP. Section 3 gives details on the proposed approach CS-LMP. The experimental evaluation and the results are presented in Section 4. Finally, we conclude the paper in Section LOCAL DESCRIPTORS In computer vision, local descriptors are based on image features extracted from compact image regions using histograms to generate a feature vector. In this section, two local descriptors are presented: the CS-LBP and the LMP. 2. Center-Symmetric Local Binary Pattern (CS-LBP) The CS-LBP is a modified version of the LBP texture feature and SIFT descriptor, inheriting the desirable properties of both texture features and gradient-based features. The features are extracted by an operator similar to the LBP and the constrution of the descriptor is similar to SIFT. This methodology has many properties such as tolerance to illumination changes, robustness on flat image areas, and computational simplicity [7]. In [8], the authors state that the LBP produces a rather long histogram and therefore is difficult to use in the context of an image descriptor. To solve this problem, they modified the scheme of how to compare the pixels in the neighborhood. Instead of comparing each pixel with the center one, the method compares centersymmetric pairs of pixels as illustrated in Figure. We can see that for 8 neighbors, LBP produces 256 different binary patterns (pattern LBP [, 255]), whereas for CS-LBP, this number is only 6 (pattern [, 5])). The interest region detectors, such as Hessian-Affine, Harris- Affine, Hessian-Laplace and Harris-Laplace [4, 3], provide the regions that are used to compute the descriptors. After computing the regions, they are mapped to a circular region of a constant radius to obtain the scale and affine invariance. These regions are fixed at 4 4 pixels and the pixel values are in the range [,]. Rotation invariance is obtained by rotating the normalized regions in the direction of the dominant gradient orientation. For the construction of the descriptor, the image regions are divided into cells with a location grid, and for each cell, a CS-LBP histogram is built. The CS-LBP feature extraction for each pixel of the region is accomplished according to Equation : 4 Figure : LBP and CS-LBP features for a neighborhood of 8 pixels CS LBP R,N,T (x, y) = (N/2) i= s(n i n i+(n/2) )2 i, () where n i and n i+(n/2) correspond to the gray values of center-symmetric pairs of pixels of N equally spaced pixels on a circle of radius R and {, if x>t, s(x) =, otherwise. To avoid boundary effects in which the descriptor abruptly changes as a feature shifts from one cell to another, bilinear interpolation over the x and y dimensions is used to share the weight of the feature between the four nearest cells. The share for a cell is determined by the bilinear interpolation weights [8]. Thus, the resulting descriptor is a 3D histogram of the CS-LBP feature locations and values. The final descriptor is built by concatenating the feature histograms computed for the cells and the last normalization to obtain the maximum unit for each bin. To eliminate the influence of very large descriptor elements, the bins of the histogram are limited to and the histogram is normalized again. For a 4 4 Cartesian grid, the final feature vector contains 256 positions. 2.2 Local Mapped Pattern (LMP) The LMP methodology assumes that each gray-level distribution within an image neighborhood is a local pattern. This pattern can be represented by the gray-level differences around the central pixel. Figure 2 shows an example of a 3 3 neighborhood and a 3D graphic of the respective local pattern. Figure 2: Gray-level differences as a local pattern Each pattern defined by a W W neighborhood will be mapped to a histogram bin h b using Equation 2, where f g(i,j) is the mapping function, P (k, l) is a weighting matrix of predefined values for each pixel position within the neighborhood, and B is the number of the histogram bins. This equation represents the weighted sum of each gray-level difference

3 between the neighboring pixels and the central one, mapped onto the [,] interval by a mapping function and rounded to B possible bins. ( W W k= l= h b = round (f ) g(i,j)p (k, l)) W W k= l= P (k, l) (B ), (2) Through Equation 2, it is possible to derive some previously published approaches for local pattern analysis. The Local Binary Pattern (LBP) can be derived from Equation 2 using the Heaviside step function shown in Equation 3: where H[A(k, l) g(i, j)] = f g(i,j) = H[A(k, l) g(i, j)] (3) { if [A(k, l) g(i, j)] < ; if [A(k, l) g(i, j)]. Taking into account the basic LBP with a 3 3neighborhood of pixels, the weighting matrix will be: P (k, l) = The LBP is thus a particular case of the LMP approach. Using different mapping functions, the LMP methodology could extract specific features from the image using local pattern processing. For texture analysis, the LMP approach uses a smooth approximation to the Heaviside step function by a logistic function, a logistic curve, or a common sigmoid curve as in Equation 4. f g(i,j) = +e [A(k,l) g(i,j)] β, (4) where β is the curve slope and [A(k, l) g(i, j)] are the graylevel differences within the neighborhood centered in g(i, j). The proposed weighting matrix is: P (k, l) = When using the sigmoid curve, the method is called LMPs. Sigmoid functions, whose graphs are S-shaped curves, appear in a great variety of contexts, such as the transfer functions of many neural networks. Their ubiquity is no accident; these curves are among the simplest nonlinear curves, striking a graceful balance between linear and nonlinear behavior. Figure 3 shows the mapping functions f g(i,j) used in Equation 2 for the LMP-s (sigmoid) and LBP (Heaviside step) approaches. 3. CENTER-SYMMETRIC LOCAL MAPPED PATTERN (CS-LMP) The CS-LBP descriptor described in the previous section has the following disadvantage: the difference between the symmetrical pairs and the center are thresholded to or, and do not capture the nuances that can occur in the image. 4 (a) Mapping function to the LMP-s method (b) Mapping function to the LBP method Figure 3: Gray-level differences mapped by Equation 2. a) Local Mapped Pattern. b) Local Binary Pattern Analyzing the advantage of flexibility of the LMP method to be malleable by changing the mapping function and the weight matrix in the Equation 2, and the disadvantage of the CS-LBP descriptor (stiffness in using the step function), we propose a new descriptor named CS-LMP based on the CS-LBP, but using a mapping function suitable for the calculation of characteristics in the LMP method. Equation 5 shows the feature extraction for each pixel of the interest region, defined in a neighborhood W W : ( N/2 ) i= f(i)p (i)) R,N,f (x, y) =round N/2 b (5) i= P (i) where b is the number of histogram bins minus one, P (i) is the ith element of the weight matrix P defined in the Equation 6, and f(i) is the sigmoid mapping function defined in Equation 7. where P =[p p p 2...p N/2 ] t (6) f(i) = p i =2 i [n i n i+(n/2) ] +e β where n i and n i+(n/2) correspond to the gray level values of the center-symmetric pairs of N equally spaced pixels on a circle of radius R and β is the sigmoid curve slope. If the coordinates of n i on the circle do not correspond to the image coordinates, the gray-value of this point is interpolated. Figure 4 illustrates the operation of the CS-LMP after setting b to 5 (6 bins minus one). We adopted the Hessian-Affine detector because it presents better results for detecting interest regions in images [8]. Afterwards, the regions are mapped to a circular region of constant radius and rotated in the direction of the dominant gradient, as described in [8] and [2]. Each region is divided into cells with a grid size of 4 4. The features are extracted for each pixel of the interest region using a CS-LMP descriptor and a histogram is built for each cell. The parameters R, N and β were empirically established, the best results were obtained and these will be discussed in Section 4. As it was presented in [7], a bilinear interpolation is made over the x and y dimensions to share the weight of the feature between the four nearest cells. Thus, a 3D histogram of (7)

4 (a) graf (b) graf4 (c) bark (d) bark4 leu- (f) ven4 leu- (e) ven (g) trees (h) trees4 Figure 4: CS-LMP features for a neighborhood of 8 pixels the CS-LMP feature locations and values is built as shown in Figure 5. The descriptor is made by concatenating the feature histograms computed for the cells, obtaining a vector of 256 positions(4 4 6). The descriptor is then normalized as in the CS-LBP presented in Section 2.. (a) Input (b) 4 4 (c) 4 4 6=256 region cells divisionated bins of concate- histogram Figure 5: CS-LMP descriptor. 4. EXPERIMENTAL EVALUATION In this section, we present the performance of the CS-LMP and CS-LBP descriptors applied to the matching of image pairs using nearest-neighbor similarity. The image database is available on the internet 2. It has 8 sets of images with 6 different types of transformations: viewpoint change, rotation change, scale, ilumination, JPEG compression and blur. The training images used in this paper are shown in Figure 6 and the test images are showninfigure7. Table presents the evaluation of the parameter β in the method CS-LMP for the image bark. We tested some values of the β parameter and we generated the curves, measuring its respective area under curve (AUC). For most of the test images, the best value of β was near for CS-LMP. The evaluation criteria is explained in the next subsection. 4. Evaluation criteria The performance of the CS-LMP descriptor was evaluated and compared with CS-LBP, using an evaluation criterion based on Heikkilä et al. [7, 8]. This criterion consists of the number of correct and false matches for a couple of images, knowing a priori the set of true matches (i.e., the ground truth). Heikkilä et. al. [8] determine the ground truth vgg/research/affine/ (i) ubc (j) ubc4 (k) boat (l) boat4 (m) bike (n) bike4 (o) wall (p) wall4 Figure 6: Training Images: graf (viewpoint change), bark (scale and rotation change), trees (blur image), leuven (ilumination change), ubc (JPEG compression), boat (scale and rotation change), bike (blur image), wall (viewpoint change) (a) resid (b) resid4 (c) east park (e) east (f) east (g) south south4 zmars (i) zmonet (j) zmonet4 (m) belledone (n) belledone4 (o) bip (d) east park4 (h) zmars4 ax- (l) terix4 (k) terix ax- (p) bip4 Figure 7: Test Images: resid, east park, east south (scale and rotation), zmarks, zmonet (rotation change), axterix, belledone, bip (scale change) by the overlap error of the two regions. The overlap error determines how a region A of image overlaps region B of image 2. Image 2 is projected on image by a homography matrix H. The overlap error is defined by the ratio of the intersection

5 Table : β evaluation example for the image bark β AUC (a) resid.. (b) east park and the union of the regions given by Equation 8: ɛ S = (A H T BH)/(A H T BH) (8) A match is assumed to be correct if ɛ S <. To evaluate the performance of our descriptors, we present the results with curves [2] and their respective AUC. Precision represents the number of correct matches over the total number of possible matches returned (falses+corrects) andisdefinedbyequation9. = nc (9) n where n is the number of possible matches returned and the Euclidean distance between their descriptors is below a threshold t d. The number of correct matches n c includes the matched pairs whose Euclidean distance between their descriptors is below a threshold t d and ɛ S < Recall represents the ratio between the correct matches and the total number of true matches (the ground truth) and is defined by Equation. = nc M, () where M is the number of elements in the set of true matches. To construct the curve, each point is defined by choosing n [,N], where N represents the total number of possible matches. Analyzing this curve, a perfect descriptor would give a equal to for any. 4.2 Matching results The interest region detector used to compute the descriptors was the Hessian-Affine detector. As discussed in the previous subsection, the parameters set for the CS-LMP were β =, b = 5, R =2,andN =8. FortheCS- LBP, we used the parameters R =2,N =8,andT =. [7]. We generated the curves for all images of the test set and calculated the AUC for each of them. However, for reasons of clarity, we present only four of them in Figure 8. Note that there is a good for low for the four descriptors, i.e., the first region pairs returned are correct matches; however, as the number of region pairs returned increases, the decreases. We can see that the reduction of is more accentuated by CS-LBP. As the number of returned matches increases (n), CS-LMP achieves a greater number of correct matches (n c) compared to CS-LBP. The AUC results are shown in Table 2 for the CS-LMP and CS-LBP descriptors. Note that for all images, CS-LMP has an AUC value greater than CS-LBP (c) east south.. (d) zmars Figure 8: Results of the curves for CS-LMP and CS-LBP descriptors. Table 2: Area under curve (AUC) Image CS-LMP CS-LBP resid east park east south zmars zmonet axterix belledonne bip Table 3 presents the values for, and the n best returned correspondences by the two descriptors. For example, for the image named resid, taking into account the first 27 possible matches returned, we have 58 correct matches for the CS-LMP and 545 for the CS-LBP descriptor. The image east south presents the difference of 6 correct matches favorable to the CS-LMP and for the image east park, our proposed approach outstands by 52 correct matches over the CS-LBP. Figure 9 presents the correct matches achieved between the image pair called resid. It was performed to check the images test results when applying the descriptors CS-LMP and CS-LBP. 5. CONCLUSIONS In this paper, we proposed a novel method for feature description based on local pattern analysis. The method combines the LMP operator and the CS-LBP descriptor. It uses the CS-LBP-like descriptor and replaces the step function by the sigmoid function. CS-LMP features has many properties that make it wellsuited for this task: a relatively short histogram and tolerance to rotation and scale change. Furthermore, it does not require setting many parameters. The performance of the CS-LMP descriptor was compared to the CS-LBP descriptor in terms of curves. As CS-LMP

6 Table 3: Correct Matches Image Recall Precision Correct Matches CS-LMP resid /27 CS-LBP /27 CS-LMP east park /37 CS-LBP /37 CS-LMP east south /55 CS-LBP /55 CS-LMP axterix /385 CS-LBP /385 CS-LMP belledonne. 93 5/6 CS-LBP 667 3/6 CS-LMP bip /728 CS-LBP /728 CS-LMP zmars /22 CS-LBP /22 CS-LMP zmonet /52 CS-LBP /52 (a) CS-LMP correct matches (b) CS-LBP - 35 correct matches Figure 9: Matches achieved between the image pair called resid. Total of 36 correspondences. captures the gray-level nuances of the images better, the curves show a better performance than CS-LBP methodology. We can conclude that CS-LMP is a promising descriptor, with better performance over the CS-LBP descriptor. Among the 8 tested images, the AUC of the curve was greater for all images for CS-LMP than CS-LBP. Moreover, the number of correct correspondences outperformed the CS-LBP. 6. ACKNOWLEDGMENTS The authors would like to thank the São Paulo Research Foundation (FAPESP), grant #2/8645-2, and National Council for Scientific and Technological Development (CNPQ) for their financial support of this research. 7. REFERENCES [] A. E. Abdel-Hakim and A. A. Farag. Csift: A sift descriptor with color invariant characteristics. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR6), 2: , [2] R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. New York, Addison-Wesley, 999. [3] H. Bay, T. Tuytelaars, and L. V. Gool. Surf: speeded up robust features. European Conference on Computer Vision, :44 47, 26. [4] I. A. G. Boaventura and A. Gonzaga. Edge detection in digital images using fuzzy numbers. International Journal of Innovative Computing and Applications, 2: 2, 29. [5] C. Chierici, R. T. Vieira, C. T. Ferraz, and et. al. A generalized method for texture classification by local pattern analysis. IEEE Transactions on Systems, Man, and Cybernectics, Part B: Cybernectics (submitted), 23. [6] D.-C. He and L. Wang. Texture unit, texture spectrum, and texture analysis. IEEE Transactions On Geoscience and Remote Sensing, 28:59 52, 99. [7] M. Heikkilä, M. Pietikäinen, and C. Schmid. Description of interest regions with center-symmetric local binary patterns. 5th Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP 26, 4338:58 69, december 26. [8] M. Heikkilä, M. Pietikäinen, and C. Schmid. Description of interest regions with local binary patterns. Pattern Recognition, 42: , 29. [9] Y. Ke and R. Sukthankar. Pca-sift: A more distinctive representation for local image descriptors. Computer Vision and Pattern Recognition, 24., 2:56 53, 24. [] K.Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach Intell., 27():65 63, 25. [] D. G. Lowe. Object recognition from local scale-invariant features. International Conference on Computer Vision, pages 5 57, 999. [2] D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 6:9, 24. [3] K. Mikolajczyk and C. Schmid. Scale & affine invariant interest point detectors. International Journal of Computer Vision. Kluwer Academic Publishers, Hingham, MA, USA, 6():63 86, 24. [4] K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, and L. V. Gool. A comparison of affine region detectors. Int. J. Comput. Vision, 65(/2):43 72, 25. [5] R. T. Vieira, C. E. de Oliveira Chierici, C. T. Ferraz, and A. Gonzaga. Local fuzzy pattern: A new way for micro-pattern analysis. Springer-Verlag Berlin Heidelberg. Ideal 22, 7435:62 6, 22. [6] R. Zabih and J. Woodfill. Non-parametric local transforms for computing visual correspondence. Third European Conference on Computer Vision. Springer-Verlag New York, Inc., New Jersey, pages 5 58, 994. [7] G. Zhao, L. Chen, G. Chen, and J. Yuan. Kpb-sift: a compact local feature descriptor. MM Proceedings of the international conference on Multimedia, pages 75 78, 2.

Probabilistic Latent Semantic Analysis (plsa)

Probabilistic Latent Semantic Analysis (plsa) Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} References

More information

Local features and matching. Image classification & object localization

Local features and matching. Image classification & object localization Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to

More information

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report

Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles mjhustc@ucla.edu and lunbo

More information

3D Model based Object Class Detection in An Arbitrary View

3D Model based Object Class Detection in An Arbitrary View 3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/

More information

TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service

TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service Feng Tang, Daniel R. Tretter, Qian Lin HP Laboratories HPL-2012-131R1 Keyword(s): image recognition; cloud service;

More information

A Comparative Study of SIFT and its Variants

A Comparative Study of SIFT and its Variants 10.2478/msr-2013-0021 MEASUREMENT SCIENCE REVIEW, Volume 13, No. 3, 2013 A Comparative Study of SIFT and its Variants Jian Wu 1, Zhiming Cui 1, Victor S. Sheng 2, Pengpeng Zhao 1, Dongliang Su 1, Shengrong

More information

Image Segmentation and Registration

Image Segmentation and Registration Image Segmentation and Registration Dr. Christine Tanner (tanner@vision.ee.ethz.ch) Computer Vision Laboratory, ETH Zürich Dr. Verena Kaynig, Machine Learning Laboratory, ETH Zürich Outline Segmentation

More information

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow

A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow , pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices

More information

siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service

siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service Ahmad Pahlavan Tafti 1, Hamid Hassannia 2, and Zeyun Yu 1 1 Department of Computer Science, University of Wisconsin -Milwaukee,

More information

Build Panoramas on Android Phones

Build Panoramas on Android Phones Build Panoramas on Android Phones Tao Chu, Bowen Meng, Zixuan Wang Stanford University, Stanford CA Abstract The purpose of this work is to implement panorama stitching from a sequence of photos taken

More information

Randomized Trees for Real-Time Keypoint Recognition

Randomized Trees for Real-Time Keypoint Recognition Randomized Trees for Real-Time Keypoint Recognition Vincent Lepetit Pascal Lagger Pascal Fua Computer Vision Laboratory École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne, Switzerland Email:

More information

MIFT: A Mirror Reflection Invariant Feature Descriptor

MIFT: A Mirror Reflection Invariant Feature Descriptor MIFT: A Mirror Reflection Invariant Feature Descriptor Xiaojie Guo, Xiaochun Cao, Jiawan Zhang, and Xuewei Li School of Computer Science and Technology Tianjin University, China {xguo,xcao,jwzhang,lixuewei}@tju.edu.cn

More information

Classifying Manipulation Primitives from Visual Data

Classifying Manipulation Primitives from Visual Data Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if

More information

SURF: Speeded Up Robust Features

SURF: Speeded Up Robust Features SURF: Speeded Up Robust Features Herbert Bay 1, Tinne Tuytelaars 2, and Luc Van Gool 12 1 ETH Zurich {bay, vangool}@vision.ee.ethz.ch 2 Katholieke Universiteit Leuven {Tinne.Tuytelaars, Luc.Vangool}@esat.kuleuven.be

More information

Incremental PCA: An Alternative Approach for Novelty Detection

Incremental PCA: An Alternative Approach for Novelty Detection Incremental PCA: An Alternative Approach for Detection Hugo Vieira Neto and Ulrich Nehmzow Department of Computer Science University of Essex Wivenhoe Park Colchester CO4 3SQ {hvieir, udfn}@essex.ac.uk

More information

Distinctive Image Features from Scale-Invariant Keypoints

Distinctive Image Features from Scale-Invariant Keypoints Distinctive Image Features from Scale-Invariant Keypoints David G. Lowe Computer Science Department University of British Columbia Vancouver, B.C., Canada lowe@cs.ubc.ca January 5, 2004 Abstract This paper

More information

Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition

Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition Bildverarbeitung und Mustererkennung Image Processing and Pattern Recognition 1. Image Pre-Processing - Pixel Brightness Transformation - Geometric Transformation - Image Denoising 1 1. Image Pre-Processing

More information

CODING MODE DECISION ALGORITHM FOR BINARY DESCRIPTOR CODING

CODING MODE DECISION ALGORITHM FOR BINARY DESCRIPTOR CODING CODING MODE DECISION ALGORITHM FOR BINARY DESCRIPTOR CODING Pedro Monteiro and João Ascenso Instituto Superior Técnico - Instituto de Telecomunicações ABSTRACT In visual sensor networks, local feature

More information

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic

More information

More Local Structure Information for Make-Model Recognition

More Local Structure Information for Make-Model Recognition More Local Structure Information for Make-Model Recognition David Anthony Torres Dept. of Computer Science The University of California at San Diego La Jolla, CA 9093 Abstract An object classification

More information

Simultaneous Gamma Correction and Registration in the Frequency Domain

Simultaneous Gamma Correction and Registration in the Frequency Domain Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong a28wong@uwaterloo.ca William Bishop wdbishop@uwaterloo.ca Department of Electrical and Computer Engineering University

More information

FastKeypointRecognitioninTenLinesofCode

FastKeypointRecognitioninTenLinesofCode FastKeypointRecognitioninTenLinesofCode Mustafa Özuysal Pascal Fua Vincent Lepetit Computer Vision Laboratory École Polytechnique Fédérale de Lausanne(EPFL) 115 Lausanne, Switzerland Email: {Mustafa.Oezuysal,

More information

The use of computer vision technologies to augment human monitoring of secure computing facilities

The use of computer vision technologies to augment human monitoring of secure computing facilities The use of computer vision technologies to augment human monitoring of secure computing facilities Marius Potgieter School of Information and Communication Technology Nelson Mandela Metropolitan University

More information

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall

Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall Automatic Photo Quality Assessment Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall Estimating i the photorealism of images: Distinguishing i i paintings from photographs h Florin

More information

BRIEF: Binary Robust Independent Elementary Features

BRIEF: Binary Robust Independent Elementary Features BRIEF: Binary Robust Independent Elementary Features Michael Calonder, Vincent Lepetit, Christoph Strecha, and Pascal Fua CVLab, EPFL, Lausanne, Switzerland e-mail: firstname.lastname@epfl.ch Abstract.

More information

Face Recognition in Low-resolution Images by Using Local Zernike Moments

Face Recognition in Low-resolution Images by Using Local Zernike Moments Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie

More information

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION

AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION Saurabh Asija 1, Rakesh Singh 2 1 Research Scholar (Computer Engineering Department), Punjabi University, Patiala. 2 Asst.

More information

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.

Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree. Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table

More information

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang

Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform

More information

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear

More information

Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28

Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28 Recognition Topics that we will try to cover: Indexing for fast retrieval (we still owe this one) History of recognition techniques Object classification Bag-of-words Spatial pyramids Neural Networks Object

More information

Fast Matching of Binary Features

Fast Matching of Binary Features Fast Matching of Binary Features Marius Muja and David G. Lowe Laboratory for Computational Intelligence University of British Columbia, Vancouver, Canada {mariusm,lowe}@cs.ubc.ca Abstract There has been

More information

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms

A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms H. Kandil and A. Atwan Information Technology Department, Faculty of Computer and Information Sciences, Mansoura University,El-Gomhoria

More information

CATEGORIZATION OF SIMILAR OBJECTS USING BAG OF VISUAL WORDS AND k NEAREST NEIGHBOUR CLASSIFIER

CATEGORIZATION OF SIMILAR OBJECTS USING BAG OF VISUAL WORDS AND k NEAREST NEIGHBOUR CLASSIFIER TECHNICAL SCIENCES Abbrev.: Techn. Sc., No 15(2), Y 2012 CATEGORIZATION OF SIMILAR OBJECTS USING BAG OF VISUAL WORDS AND k NEAREST NEIGHBOUR CLASSIFIER Piotr Artiemjew, Przemysław Górecki, Krzysztof Sopyła

More information

FREAK: Fast Retina Keypoint

FREAK: Fast Retina Keypoint FREAK: Fast Retina Keypoint Alexandre Alahi, Raphael Ortiz, Pierre Vandergheynst Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland Abstract A large number of vision applications rely on matching

More information

Feature Point Selection using Structural Graph Matching for MLS based Image Registration

Feature Point Selection using Structural Graph Matching for MLS based Image Registration Feature Point Selection using Structural Graph Matching for MLS based Image Registration Hema P Menon Department of CSE Amrita Vishwa Vidyapeetham Coimbatore Tamil Nadu - 641 112, India K A Narayanankutty

More information

Automatic georeferencing of imagery from high-resolution, low-altitude, low-cost aerial platforms

Automatic georeferencing of imagery from high-resolution, low-altitude, low-cost aerial platforms Automatic georeferencing of imagery from high-resolution, low-altitude, low-cost aerial platforms Amanda Geniviva, Jason Faulring and Carl Salvaggio Rochester Institute of Technology, 54 Lomb Memorial

More information

Part-Based Recognition

Part-Based Recognition Part-Based Recognition Benedict Brown CS597D, Fall 2003 Princeton University CS 597D, Part-Based Recognition p. 1/32 Introduction Many objects are made up of parts It s presumably easier to identify simple

More information

Point Matching as a Classification Problem for Fast and Robust Object Pose Estimation

Point Matching as a Classification Problem for Fast and Robust Object Pose Estimation Point Matching as a Classification Problem for Fast and Robust Object Pose Estimation Vincent Lepetit Julien Pilet Pascal Fua Computer Vision Laboratory Swiss Federal Institute of Technology (EPFL) 1015

More information

Face detection is a process of localizing and extracting the face region from the

Face detection is a process of localizing and extracting the face region from the Chapter 4 FACE NORMALIZATION 4.1 INTRODUCTION Face detection is a process of localizing and extracting the face region from the background. The detected face varies in rotation, brightness, size, etc.

More information

FAST APPROXIMATE NEAREST NEIGHBORS WITH AUTOMATIC ALGORITHM CONFIGURATION

FAST APPROXIMATE NEAREST NEIGHBORS WITH AUTOMATIC ALGORITHM CONFIGURATION FAST APPROXIMATE NEAREST NEIGHBORS WITH AUTOMATIC ALGORITHM CONFIGURATION Marius Muja, David G. Lowe Computer Science Department, University of British Columbia, Vancouver, B.C., Canada mariusm@cs.ubc.ca,

More information

Online Learning of Patch Perspective Rectification for Efficient Object Detection

Online Learning of Patch Perspective Rectification for Efficient Object Detection Online Learning of Patch Perspective Rectification for Efficient Object Detection Stefan Hinterstoisser 1, Selim Benhimane 1, Nassir Navab 1, Pascal Fua 2, Vincent Lepetit 2 1 Department of Computer Science,

More information

Image Classification for Dogs and Cats

Image Classification for Dogs and Cats Image Classification for Dogs and Cats Bang Liu, Yan Liu Department of Electrical and Computer Engineering {bang3,yan10}@ualberta.ca Kai Zhou Department of Computing Science kzhou3@ualberta.ca Abstract

More information

Group Sparse Coding. Fernando Pereira Google Mountain View, CA pereira@google.com. Dennis Strelow Google Mountain View, CA strelow@google.

Group Sparse Coding. Fernando Pereira Google Mountain View, CA pereira@google.com. Dennis Strelow Google Mountain View, CA strelow@google. Group Sparse Coding Samy Bengio Google Mountain View, CA bengio@google.com Fernando Pereira Google Mountain View, CA pereira@google.com Yoram Singer Google Mountain View, CA singer@google.com Dennis Strelow

More information

Android Ros Application

Android Ros Application Android Ros Application Advanced Practical course : Sensor-enabled Intelligent Environments 2011/2012 Presentation by: Rim Zahir Supervisor: Dejan Pangercic SIFT Matching Objects Android Camera Topic :

More information

Image Stitching based on Feature Extraction Techniques: A Survey

Image Stitching based on Feature Extraction Techniques: A Survey Image Stitching based on Feature Extraction Techniques: A Survey Ebtsam Adel Information System Dept. Faculty of Computers and Information, Mansoura University, Egypt Mohammed Elmogy Information Technology

More information

Classification of Fingerprints. Sarat C. Dass Department of Statistics & Probability

Classification of Fingerprints. Sarat C. Dass Department of Statistics & Probability Classification of Fingerprints Sarat C. Dass Department of Statistics & Probability Fingerprint Classification Fingerprint classification is a coarse level partitioning of a fingerprint database into smaller

More information

Visualization by Linear Projections as Information Retrieval

Visualization by Linear Projections as Information Retrieval Visualization by Linear Projections as Information Retrieval Jaakko Peltonen Helsinki University of Technology, Department of Information and Computer Science, P. O. Box 5400, FI-0015 TKK, Finland jaakko.peltonen@tkk.fi

More information

GPS-aided Recognition-based User Tracking System with Augmented Reality in Extreme Large-scale Areas

GPS-aided Recognition-based User Tracking System with Augmented Reality in Extreme Large-scale Areas GPS-aided Recognition-based User Tracking System with Augmented Reality in Extreme Large-scale Areas Wei Guan Computer Graphics and Immersive Technologies Computer Science, USC wguan@usc.edu Suya You Computer

More information

The Role of Size Normalization on the Recognition Rate of Handwritten Numerals

The Role of Size Normalization on the Recognition Rate of Handwritten Numerals The Role of Size Normalization on the Recognition Rate of Handwritten Numerals Chun Lei He, Ping Zhang, Jianxiong Dong, Ching Y. Suen, Tien D. Bui Centre for Pattern Recognition and Machine Intelligence,

More information

Adaptive Face Recognition System from Myanmar NRC Card

Adaptive Face Recognition System from Myanmar NRC Card Adaptive Face Recognition System from Myanmar NRC Card Ei Phyo Wai University of Computer Studies, Yangon, Myanmar Myint Myint Sein University of Computer Studies, Yangon, Myanmar ABSTRACT Biometrics is

More information

QUALITY TESTING OF WATER PUMP PULLEY USING IMAGE PROCESSING

QUALITY TESTING OF WATER PUMP PULLEY USING IMAGE PROCESSING QUALITY TESTING OF WATER PUMP PULLEY USING IMAGE PROCESSING MRS. A H. TIRMARE 1, MS.R.N.KULKARNI 2, MR. A R. BHOSALE 3 MR. C.S. MORE 4 MR.A.G.NIMBALKAR 5 1, 2 Assistant professor Bharati Vidyapeeth s college

More information

A Genetic Algorithm-Evolved 3D Point Cloud Descriptor

A Genetic Algorithm-Evolved 3D Point Cloud Descriptor A Genetic Algorithm-Evolved 3D Point Cloud Descriptor Dominik Wȩgrzyn and Luís A. Alexandre IT - Instituto de Telecomunicações Dept. of Computer Science, Univ. Beira Interior, 6200-001 Covilhã, Portugal

More information

REAL-TIME OBJECT DETECTION AND TRACKING FOR INDUSTRIAL APPLICATIONS

REAL-TIME OBJECT DETECTION AND TRACKING FOR INDUSTRIAL APPLICATIONS REAL-TIME OBJECT DETECTION AND TRACKING FOR INDUSTRIAL APPLICATIONS Selim Benhimane 1, Hesam Najafi 2, Matthias Grundmann 3, Ezio Malis 4, Yakup Genc 2, Nassir Navab 1 1 Department of Computer Science,

More information

Circle Object Recognition Based on Monocular Vision for Home Security Robot

Circle Object Recognition Based on Monocular Vision for Home Security Robot Journal of Applied Science and Engineering, Vol. 16, No. 3, pp. 261 268 (2013) DOI: 10.6180/jase.2013.16.3.05 Circle Object Recognition Based on Monocular Vision for Home Security Robot Shih-An Li, Ching-Chang

More information

Data, Measurements, Features

Data, Measurements, Features Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are

More information

Module II: Multimedia Data Mining

Module II: Multimedia Data Mining ALMA MATER STUDIORUM - UNIVERSITÀ DI BOLOGNA Module II: Multimedia Data Mining Laurea Magistrale in Ingegneria Informatica University of Bologna Multimedia Data Retrieval Home page: http://www-db.disi.unibo.it/courses/dm/

More information

Cloud-Based Image Coding for Mobile Devices Toward Thousands to One Compression

Cloud-Based Image Coding for Mobile Devices Toward Thousands to One Compression IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 15, NO. 4, JUNE 2013 845 Cloud-Based Image Coding for Mobile Devices Toward Thousands to One Compression Huanjing Yue, Xiaoyan Sun, Jingyu Yang, and Feng Wu, Senior

More information

Robert Collins CSE598G. More on Mean-shift. R.Collins, CSE, PSU CSE598G Spring 2006

Robert Collins CSE598G. More on Mean-shift. R.Collins, CSE, PSU CSE598G Spring 2006 More on Mean-shift R.Collins, CSE, PSU Spring 2006 Recall: Kernel Density Estimation Given a set of data samples x i ; i=1...n Convolve with a kernel function H to generate a smooth function f(x) Equivalent

More information

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines

EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation

More information

Euler Vector: A Combinatorial Signature for Gray-Tone Images

Euler Vector: A Combinatorial Signature for Gray-Tone Images Euler Vector: A Combinatorial Signature for Gray-Tone Images Arijit Bishnu, Bhargab B. Bhattacharya y, Malay K. Kundu, C. A. Murthy fbishnu t, bhargab, malay, murthyg@isical.ac.in Indian Statistical Institute,

More information

Accurate and robust image superresolution by neural processing of local image representations

Accurate and robust image superresolution by neural processing of local image representations Accurate and robust image superresolution by neural processing of local image representations Carlos Miravet 1,2 and Francisco B. Rodríguez 1 1 Grupo de Neurocomputación Biológica (GNB), Escuela Politécnica

More information

Visualization of Large Font Databases

Visualization of Large Font Databases Visualization of Large Font Databases Martin Solli and Reiner Lenz Linköping University, Sweden ITN, Campus Norrköping, Linköping University, 60174 Norrköping, Sweden Martin.Solli@itn.liu.se, Reiner.Lenz@itn.liu.se

More information

A Dynamic Approach to Extract Texts and Captions from Videos

A Dynamic Approach to Extract Texts and Captions from Videos Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

3 An Illustrative Example

3 An Illustrative Example Objectives An Illustrative Example Objectives - Theory and Examples -2 Problem Statement -2 Perceptron - Two-Input Case -4 Pattern Recognition Example -5 Hamming Network -8 Feedforward Layer -8 Recurrent

More information

AN EFFICIENT HYBRID REAL TIME FACE RECOGNITION ALGORITHM IN JAVA ENVIRONMENT ABSTRACT

AN EFFICIENT HYBRID REAL TIME FACE RECOGNITION ALGORITHM IN JAVA ENVIRONMENT ABSTRACT AN EFFICIENT HYBRID REAL TIME FACE RECOGNITION ALGORITHM IN JAVA ENVIRONMENT M. A. Abdou*, M. H. Fathy** *Informatics Research Institute, City for Scientific Research and Technology Applications (SRTA-City),

More information

jorge s. marques image processing

jorge s. marques image processing image processing images images: what are they? what is shown in this image? What is this? what is an image images describe the evolution of physical variables (intensity, color, reflectance, condutivity)

More information

DYNAMIC DOMAIN CLASSIFICATION FOR FRACTAL IMAGE COMPRESSION

DYNAMIC DOMAIN CLASSIFICATION FOR FRACTAL IMAGE COMPRESSION DYNAMIC DOMAIN CLASSIFICATION FOR FRACTAL IMAGE COMPRESSION K. Revathy 1 & M. Jayamohan 2 Department of Computer Science, University of Kerala, Thiruvananthapuram, Kerala, India 1 revathysrp@gmail.com

More information

Surgical Tools Recognition and Pupil Segmentation for Cataract Surgical Process Modeling

Surgical Tools Recognition and Pupil Segmentation for Cataract Surgical Process Modeling Surgical Tools Recognition and Pupil Segmentation for Cataract Surgical Process Modeling David Bouget, Florent Lalys, Pierre Jannin To cite this version: David Bouget, Florent Lalys, Pierre Jannin. Surgical

More information

COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS

COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS B.K. Mohan and S. N. Ladha Centre for Studies in Resources Engineering IIT

More information

arxiv:1201.3118v1 [physics.data-an] 15 Jan 2012

arxiv:1201.3118v1 [physics.data-an] 15 Jan 2012 Shape analysis using fractal dimension: a curvature based approach André R. Backes, 1, a) João B. Florindo, 2, b) 3, c) and Odemir M. Bruno 1) Faculdade de Computação, Universidade Federal de Uberlândia

More information

3D Scanner using Line Laser. 1. Introduction. 2. Theory

3D Scanner using Line Laser. 1. Introduction. 2. Theory . Introduction 3D Scanner using Line Laser Di Lu Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute The goal of 3D reconstruction is to recover the 3D properties of a geometric

More information

Signature Segmentation from Machine Printed Documents using Conditional Random Field

Signature Segmentation from Machine Printed Documents using Conditional Random Field 2011 International Conference on Document Analysis and Recognition Signature Segmentation from Machine Printed Documents using Conditional Random Field Ranju Mandal Computer Vision and Pattern Recognition

More information

Canny Edge Detection

Canny Edge Detection Canny Edge Detection 09gr820 March 23, 2009 1 Introduction The purpose of edge detection in general is to significantly reduce the amount of data in an image, while preserving the structural properties

More information

CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學. Self-Organizing Map: Visualization and Data Handling 自 組 織 神 經 網 絡 : 可 視 化 和 數 據 處 理

CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學. Self-Organizing Map: Visualization and Data Handling 自 組 織 神 經 網 絡 : 可 視 化 和 數 據 處 理 CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學 Self-Organizing Map: Visualization and Data Handling 自 組 織 神 經 網 絡 : 可 視 化 和 數 據 處 理 Submitted to Department of Electronic Engineering 電 子 工 程 學 系 in Partial Fulfillment

More information

Object class recognition using unsupervised scale-invariant learning

Object class recognition using unsupervised scale-invariant learning Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of Technology Goal Recognition of object categories

More information

Image Hallucination Using Neighbor Embedding over Visual Primitive Manifolds

Image Hallucination Using Neighbor Embedding over Visual Primitive Manifolds Image Hallucination Using Neighbor Embedding over Visual Primitive Manifolds Wei Fan & Dit-Yan Yeung Department of Computer Science and Engineering, Hong Kong University of Science and Technology {fwkevin,dyyeung}@cse.ust.hk

More information

Digital Imaging and Multimedia. Filters. Ahmed Elgammal Dept. of Computer Science Rutgers University

Digital Imaging and Multimedia. Filters. Ahmed Elgammal Dept. of Computer Science Rutgers University Digital Imaging and Multimedia Filters Ahmed Elgammal Dept. of Computer Science Rutgers University Outlines What are Filters Linear Filters Convolution operation Properties of Linear Filters Application

More information

THE development of methods for automatic detection

THE development of methods for automatic detection Learning to Detect Objects in Images via a Sparse, Part-Based Representation Shivani Agarwal, Aatif Awan and Dan Roth, Member, IEEE Computer Society 1 Abstract We study the problem of detecting objects

More information

Supervised and unsupervised learning - 1

Supervised and unsupervised learning - 1 Chapter 3 Supervised and unsupervised learning - 1 3.1 Introduction The science of learning plays a key role in the field of statistics, data mining, artificial intelligence, intersecting with areas in

More information

FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM

FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 2, February 2014,

More information

Edge-based Template Matching and Tracking for Perspectively Distorted Planar Objects

Edge-based Template Matching and Tracking for Perspectively Distorted Planar Objects Edge-based Template Matching and Tracking for Perspectively Distorted Planar Objects Andreas Hofhauser, Carsten Steger, and Nassir Navab TU München, Boltzmannstr. 3, 85748 Garching bei München, Germany

More information

SZTAKI @ ImageCLEF 2011

SZTAKI @ ImageCLEF 2011 SZTAKI @ ImageCLEF 2011 Bálint Daróczy Róbert Pethes András A. Benczúr Data Mining and Web search Research Group, Informatics Laboratory Computer and Automation Research Institute of the Hungarian Academy

More information

Data Storage 3.1. Foundations of Computer Science Cengage Learning

Data Storage 3.1. Foundations of Computer Science Cengage Learning 3 Data Storage 3.1 Foundations of Computer Science Cengage Learning Objectives After studying this chapter, the student should be able to: List five different data types used in a computer. Describe how

More information

Content-Based Image Retrieval

Content-Based Image Retrieval Content-Based Image Retrieval Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Image retrieval Searching a large database for images that match a query: What kind

More information

DIGITAL IMAGE PROCESSING AND ANALYSIS

DIGITAL IMAGE PROCESSING AND ANALYSIS DIGITAL IMAGE PROCESSING AND ANALYSIS Human and Computer Vision Applications with CVIPtools SECOND EDITION SCOTT E UMBAUGH Uffi\ CRC Press Taylor &. Francis Group Boca Raton London New York CRC Press is

More information

SWIGS: A Swift Guided Sampling Method

SWIGS: A Swift Guided Sampling Method IGS: A Swift Guided Sampling Method Victor Fragoso Matthew Turk University of California, Santa Barbara 1 Introduction This supplement provides more details about the homography experiments in Section.

More information

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition

Open Access A Facial Expression Recognition Algorithm Based on Local Binary Pattern and Empirical Mode Decomposition Send Orders for Reprints to reprints@benthamscience.ae The Open Electrical & Electronic Engineering Journal, 2014, 8, 599-604 599 Open Access A Facial Expression Recognition Algorithm Based on Local Binary

More information

MusicGuide: Album Reviews on the Go Serdar Sali

MusicGuide: Album Reviews on the Go Serdar Sali MusicGuide: Album Reviews on the Go Serdar Sali Abstract The cameras on mobile phones have untapped potential as input devices. In this paper, we present MusicGuide, an application that can be used to

More information

Colour Image Segmentation Technique for Screen Printing

Colour Image Segmentation Technique for Screen Printing 60 R.U. Hewage and D.U.J. Sonnadara Department of Physics, University of Colombo, Sri Lanka ABSTRACT Screen-printing is an industry with a large number of applications ranging from printing mobile phone

More information

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. indhubatchvsa@gmail.com

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. indhubatchvsa@gmail.com LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE 1 S.Manikandan, 2 S.Abirami, 2 R.Indumathi, 2 R.Nandhini, 2 T.Nanthini 1 Assistant Professor, VSA group of institution, Salem. 2 BE(ECE), VSA

More information

Object Recognition. Selim Aksoy. Bilkent University saksoy@cs.bilkent.edu.tr

Object Recognition. Selim Aksoy. Bilkent University saksoy@cs.bilkent.edu.tr Image Classification and Object Recognition Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Image classification Image (scene) classification is a fundamental

More information

PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO

PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO V. Conotter, E. Bodnari, G. Boato H. Farid Department of Information Engineering and Computer Science University of Trento, Trento (ITALY)

More information

A secure face tracking system

A secure face tracking system International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 10 (2014), pp. 959-964 International Research Publications House http://www. irphouse.com A secure face tracking

More information

Footwear Print Retrieval System for Real Crime Scene Marks

Footwear Print Retrieval System for Real Crime Scene Marks Footwear Print Retrieval System for Real Crime Scene Marks Yi Tang, Sargur N. Srihari, Harish Kasiviswanathan and Jason J. Corso Center of Excellence for Document Analysis and Recognition (CEDAR) University

More information

Jiří Matas. Hough Transform

Jiří Matas. Hough Transform Hough Transform Jiří Matas Center for Machine Perception Department of Cybernetics, Faculty of Electrical Engineering Czech Technical University, Prague Many slides thanks to Kristen Grauman and Bastian

More information

Semi-structured document image matching and recognition

Semi-structured document image matching and recognition Semi-structured document image matching and recognition Olivier Augereau a, Nicholas Journet a and Jean-Philippe Domenger a a Université de Bordeaux, 351 Cours de la Libération, Talence, France ABSTRACT

More information

Subspace Analysis and Optimization for AAM Based Face Alignment

Subspace Analysis and Optimization for AAM Based Face Alignment Subspace Analysis and Optimization for AAM Based Face Alignment Ming Zhao Chun Chen College of Computer Science Zhejiang University Hangzhou, 310027, P.R.China zhaoming1999@zju.edu.cn Stan Z. Li Microsoft

More information

A Method of Caption Detection in News Video

A Method of Caption Detection in News Video 3rd International Conference on Multimedia Technology(ICMT 3) A Method of Caption Detection in News Video He HUANG, Ping SHI Abstract. News video is one of the most important media for people to get information.

More information

Large Scale Image Copy Detection Evaluation

Large Scale Image Copy Detection Evaluation Large Scale Image Copy Detection Evaluation Bart Thomee Mark J. Huiskes Erwin Bakker Michael S. Lew LIACS Media Lab, Leiden University Niels Bohrweg, 2333 CA Leiden, The Netherlands {bthomee, markh, erwin,

More information