Improving Web-based Image Search via Content Based Clustering
|
|
|
- Thomasine French
- 9 years ago
- Views:
Transcription
1 Improving Web-based Image Search via Content Based Clustering Nadav Ben-Haim, Boris Babenko and Serge Belongie Department of Computer Science and Engineering University of California, San Diego Abstract Current image search engines on the web rely purely on the keywords around the images and the filenames, which produces a lot of garbage in the search results. Alternatively, there exist methods for content based image retrieval that require a user to submit a query image, and return images that are similar in content. We propose a novel approach named ReSPEC (Re-ranking Sets of Pictures by Exploiting Consistency), that is a hybrid of the two methods. Our algorithm first retrieves the results of a keyword query from an existing image search engine, clusters the results based on extracted image features, and returns the cluster that is inferred to be the most relevant to the search query. Furthermore, it ranks the remaining results in order of relevance. 1. Introduction Due to the fact that web-based image search engines are blind to the actual content of images, the result of querying for a specific object is often cluttered with irrelevant data. Alternatively, much research has been done on content based image retrieval (CBIR). Nevertheless, most CBIR systems require a user to provide one or more query images. In [3] the user provides an image, and selects the blob in that image that represents the object of interest. Although quite flexible, this system is burdensome on the user. Similarly, in [15] the system requires several images from a user, which it uses as a training set to build a classifier using AdaBoost. This, again, requires the user to to be in possession of example images. Other research has focused on the task of learning probability models linking text to image regions ([2], [1]). This task requires a large set of data labeled by a human. Also, it is not clear how these techniques would perform on noisy data sets, with mislabeling, as is the case with the Internet. Alternatively, in [14] and [4], various clustering algorithms are used to display results of a search engine or CBIR system in visually similar groups. These approaches, however, make no attempt in weeding out groups of images that are irrelevant to the search query, nor re-ranking the search results. Recently, Fergus et al. have proposed a method that uses the top results from a web-based image search engine to train a classifier, and then filter the search results in [8], or classify novel images into categories in [7]. To overcome the problem of garbage results from the image search, they use an ad hoc approach of getting the top 5 results for the same query in different languages (making the assumption that the top 5 results will most likely be correct), and developing a robust classifier that would overcome possible noise and variability of the training set. Furthermore, their feature selection is based on interest point detection and ignores color and texture, which represent powerful cues for object recognition. In this paper we present ReSPEC, a system in which a user need only provide a keyword query, as is the case with standard web-based image search engines. Our system, however, infers a representation of the object of interest, and re-ranks the images according to relevance based on their content. Lastly, we discuss how our approach can be applied to the more difficult task of transferring text labels from web based image databases to novel photograph collections for purposes of object recognition. 2. Approach When using a standard web-based image search engine, one is likely to find garbage output even in the first couple of pages of results. Nevertheless, for simple objects, it can be conjectured that most of the top results will contain the object of interest. Our system exploits this consistency, and attempts to find the object that appears in most of these images. ReSPEC has several integral components (see Fig. 1). First, each image is segmented into similar regions or blobs. Next, a set of features is extracted from each of these blobs. The set of feature vectors retrieved from the top image search results is then clustered using the mean shift algorithm. We posit the cluster that corresponds to the 1
2 User makes text query Top Yahoo Image Search results Segment images into blobs and build color histograms for each blob an orange Cluster blobs in feature space Find cluster of blobs that corresponds to largest number of parent images Compute the mean in feature space mean the significant cluster Re-rank all Yahoo Image Search results based on distance of each blob in each image to the mean of the significant cluster Figure 1. A flowchart depicting the ReSPEC system with the query an orange. largest number of parent images to be the object of interest, and we refer to this as the significant cluster (see Fig. 2). Lastly, a larger set of images from the image search is reranked based on similarity to this significant cluster Image Segmentation Each image most likely contains multiple objects, or an object and a background. Therefore, extracting features globally is not appropriate. For this reason we start by splitting each image into regions of similarity, using an image segmentation algorithm, with the intuition that each of these regions is a separate object in the image. Image segmentation is a well studied problem in Computer Vision, and there are many existing algorithms for this task. We chose to use an algorithm by Felzenszwalb and Huttenlocher, as described in [6], because of its ease of use and speed. This segmentation algorithm partitions an image into similar regions using a graph based approach. Each pixel is a node in the graph with undirected edges connecting its adjacent pixels in the image. Each edge has a weight encoding the similarity of the two connected pixels. The partitioning is done such that two segments are merged only if the dissimilarity between the segments is not as great as the dissimilarity inside either of the segments Feature Selection In order to obtain a measure of how similar image blobs are to one another, good features are needed to represent the blobs. We chose to use color histograms in HSV color space as our features. To form a feature vector for each blob, histograms are built for the H, S and V channels, with 15 bins each, and then concatenated together to form a 45 dimensional feature vector. Although histograms have clear advantage over taking the mean color of all the pixels in the blob, there is an inherent problem. For example, consider the following three histograms for hue: X = (1, 0, 0, 0, 0, 0, 0), Y = (0, 1, 0, 0, 0, 0, 0), Z = (0, 0, 0, 1, 0, 0, 0). It is clear that X and Y are more similar in hue. Nevertheless, the distance between X and Y, and X and Z are equal. To overcome this problem, Hafner et al. introduced a method in [9], to measure distance between two histograms I and J with inter-bin similarities encoded with a certain A matrix as follows: D 2 hist(i, J) = (I J) A(I J) (1)
3 point in the feature space. On each iteration, each window moves in the direction of the mean shift vector which is computed as follows: y t+1 = 1 Θ λ x Θ λ (y t x) (2) where y t is the window center at iteration t, and Θ λ is the set of points in the hypersphere window of radius λ. It is also possible to use a kernel function to weight points according to how far they are from the window center. The windows eventually converge on the points of local density maxima, and these points are returned as the cluster centroids. Thus, the mean shift clustering algorithm eliminates the parameter k (number of clusters) at the price of introducing another parameter λ. This parameter, however, is easier to tune to all possible inputs than k. Figure 2. Examples of significant clusters for two queries: strawberries (top), and us flag (bottom). We take a similar approach, where we choose a matrix A such that A = B B and B is a matrix where each row is a 1 dimensional Gaussian, shifted by 1 from row to row. We achieve the same effect by convolving each channel with a Gaussian filter (σ = 0.7). The circularity effect of the Hue channel is taken into account Mean Shift Clustering in Feature Space The next step in the system is to cluster the blobs, according to their extracted features with the hope that the object of interest will form the largest cluster. Since some of the blobs will represent garbage, it is difficult to predict the number of clusters that are present. Hence, a standard clustering approach such as k-means is not appropriate. The mean shift clustering algorithm, which is an iterative gradient ascent method for finding local density maxima, was used instead. As described in [5] 1, the algorithm begins by placing a window (usually a hypersphere) around each 1 We used Piotr Dollár s implementation of this algorithm, which is available at pdollar/toolbox/ doc/index.html 2.4. Re-ranking the Images After obtaining the significant cluster in feature space, the mean is computed. The rest of the images are then resorted based on the distance of their blobs to this mean. Since each image could potentially contain more than one blob, the closest blob in each image is used. Chi-squared distance comparisons are used in the re-sorting because it is known that for histograms, a chi-squared distance measure yields better results than L 2 distance [12]. The distance D χ 2(I, J) between two histograms I and J is computed as follows: D χ 2(I, J) = 1 K (I k J k ) 2 (3) 2 I k + J k k=1 The result is a re-ranking of the images from the original search engine. 3. Implementation For a given search query, we downloaded the first 500 images from Yahoo s search engine using the Yahoo Image API 2. We chose to work with the Yahoo Image Search engine because, to our knowledge, this is the only engine that provides a free image search API. Felzenszwalb and Huttenlocher s segmentation algorithm was then run on each image. For the segmentation, we used the default parameters of k = 500, A min = 20, and σ =.5, where k is how dissimilar the blobs can be, A min is the minimum blob size, and σ is the parameter of the Gaussian which is used to smooth the image before segmentation. After this preprocessing completed, the clustering was run on the first 15 images with the mean shift window size set to 55. Blobs that were too small (took up less than 5% of the image area) 2 The API is available at search/image/v1/imagesearch.html
4 BEFORE RE-RANKING (Yahoo Image Search results) AFTER RE-RANKING Figure 3. The top results from Yahoo Image Search (left) and ReSPEC (right). Results for the queries basketballs (top), Lakers jersey (middle), sprite can (bottom). We circled results which we felt were completely irrelevant to the query, although this is very subjective depending on the user. Nevertheless, it is clear that the top results from ReSPEC are much more visually consistent than Yahoo s top results. were filtered out of the clustering. The rest of the 500 images were re-ranked as described above. 4. Results Quantifying the results of an image retrieval system is not a trivial task. While most CBIR papers use a hand labeled corpus of images to compute precision and recall on a set of queries, this is not appropriate for the task of refining the results of a web-based image search engine. To competently perform evaluation of such a system, a study must be made using human subjects who would evaluate the qualitative performance. In this study we concentrated on the
5 Figure 4. The significant cluster detected for the query car. Since the object is highly variable in terms of color, our system fails on this query. algorithmic aspects of discovering a visual representation of a given query, and we leave such evaluation for future work. Instead we report example results of our algorithm, and discuss them. For queries that describe simple objects with a single, visually consistent semantic meaning, our method seems to work quite well. It is important to note that if a query is vague, or has multiple semantic meanings, our method will favor the more prominent meaning. The detection of multiple meanings per query seems plausible, but in this paper we chose to concentrate on single meanings, with the argument that a query could be made more specific if an ambiguity exists. For instance, consider the Lakers jersey example in Fig. 3. Although the results from the Yahoo Image Search engine demonstrate more variety, they also contain irrelevant images. The results from our system exploit the visual consistencies of Lakers jerseys, and return only the images with yellow regions. If a user is indeed interested in a different sort of Lakers jersey, for example, a purple jersey, he/she could narrow down the search query to purple Lakers jersey. Another important note about any system similar to what we have described in this paper, is that it must be efficient in order to be used in a real application. In our implementation, the segmentation and feature extraction could be done offline on an entire database (to further speed things up, only the thumbnail images are used). The rest of the processing takes a few seconds, which is fast enough for a real-time application. 5. Conclusions & Future Work In this paper we presented ReSPEC, a system that uses the results of a standard web-based image search engine, and re-ranks these results in order of relevance. The novelty of our system is that it requires only a keyword query from Figure 5. The visual representation for the query oranges was extracted from Yahoo Image Search, and then used to re-rank a collection of 10 unlabeled personal images. The two top results are shown. The left column shows the photos, while the right column displays the blobs that were closest to the significant cluster. the user, and infers a representation of this query in feature space, which is used to re-rank the results. In our implementation, the features consisted of a smoothed HSV histogram, but there are many other possible extensions (e.g. include textures and shape descriptors, use different colorspace, etc). For instance, in Fig. 4 we demonstrate an example where color information is not enough to capture the visual consistency for the query car. In particular, it would be interesting to see how our approach would perform if feature selection were changed into interest point detection, or maximally stable extremal region (MSER)[11] detection in combination with SIFT region descriptors [10]. A similar approach was successfully used in [13] to search for visually similar regions in a video. An exciting application of our approach is robust image annotation transfer from a noisily labeled set of images on the web to an unlabeled set of images in a personal collection. Most photograph collections gathered by people with digital cameras contain completely irrelevant metadata. For example, an image file might be called DSC00010.JPG. Therefore, a standard keyword search for an object in such a collection would yield no results. In this paper, however, we have described a way to find a visual representation of a text query using the noisily labeled sets of images returned by image search engines. It seems plausible to use such a representation to then search through a collection of personal photographs on a user s machine and sort this collection by relevance to the query. The user would again only need to provide a text query, and in this sense such a system would be largely unsupervised. We attempted an experiment with a very small set of personal photographs, and saw some promising results (See Fig. 5 & Fig. 6). Due to the fact that our feature extraction is limited to color only,
6 however, we found this to not be discriminative enough to sort large collections of highly variable photos. The limited set of features works for the application that we described in this paper, because the distribution of image features in personal photo collections is in general different than the distribution of image features conditioned on a text query. Specifically, the probability that an object of interest will appear in the image search results is higher than the probability of finding that object in an unconstrained set of photos. Additional discriminative features would be necessary to attack the more challenging problem of generic object recognition in an unconstrained set of images. We leave such ideas to future work. Figure 6. The visual representation for the query grass was extracted from Yahoo Image Search, and then used to re-rank a collection of 10 unlabeled personal images. These pictures are shown in their new order. Notice that the two pictures of grass are at the top. Acknowledgements We would like to thank the students of CSE 190A for helpful discussions. S.B. acknowledges support from NSF CAREER Grant # and the Alfred P. Sloan Research Fellowship. References [1] K. Barnard, P. Duygulu, N. de Freitas, D. Forsyth, D. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3, pages , [2] G. Carneiro and N. Vasconcelos. Formulating semantic image annotation as a supervised learning problem. In CVPR, pages II: , [3] C. Carson, M. Thomas, S. Belongie, J. Hellerstein, and J. Malik. Blobworld: A system for region-based image indexing and retrieval. In VISUAL, pages , [4] Y. Chen, J. Wang, and R. Krovetz. CLUE: cluster-based retrieval of images by unsupervised learning. IP, 14(8): , August [5] Y. Cheng. Mean shift, mode seeking, and clustering. PAMI, 17(8): , August [6] P. Felzenszwalb and D. Huttenlocher. Efficient graph-based image segmentation. IJCV, 59(2): , September [7] R. Fergus, L. Fei-Fei, P. Perona, and A. Zisserman. Learning object categories from Google s image search. In ICCV, pages II: , [8] R. Fergus, P. Perona, and A. Zisserman. A visual category filter for Google images. In ECCV, pages Vol I: , [9] J. Hafner, H. Sawhney, W. Equitz, M. Flickner, and W. Niblack. Efficient color histogram indexing for quadratic form distance functions. PAMI, 17(7): , July [10] D. Lowe. Object recognition from local scale-invariant features. In ICCV, pages , [11] J. Matas, O. Chum, M. Urban, and T. Pajdla. Robust wide baseline stereo from maximally stable extremal regions. In BMVC, page 3D and Video, [12] J. Puzicha, T. Hofmann, and J. Buhmann. Non-parametric similarity measures for unsupervised texture segmentation and image retrieval. In CVPR, pages , [13] J. Sivic and A. Zisserman. Video google: A text retrieval approach to object matching in videos. In ICCV, pages , [14] D. K. T. Deselaers and H. Ney. Clustering visually similar images to improve image search engines. In Informatiktage der Gesellschaft für Informatik, Bad Schussenried, Germany, [15] K. Tieu and P. Viola. Boosting image retrieval. In CVPR, pages I: ,
Segmentation & Clustering
EECS 442 Computer vision Segmentation & Clustering Segmentation in human vision K-mean clustering Mean-shift Graph-cut Reading: Chapters 14 [FP] Some slides of this lectures are courtesy of prof F. Li,
MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos [email protected]
Machine Learning for Computer Vision 1 MVA ENS Cachan Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos [email protected] Department of Applied Mathematics Ecole Centrale Paris Galen
Automatic Attribute Discovery and Characterization from Noisy Web Data
Automatic Attribute Discovery and Characterization from Noisy Web Data Tamara L. Berg 1, Alexander C. Berg 2, and Jonathan Shih 3 1 Stony Brook University, Stony Brook NY 11794, USA, [email protected],
Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang
Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform
Vision based Vehicle Tracking using a high angle camera
Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu [email protected] [email protected] Abstract A vehicle tracking and grouping algorithm is presented in this work
Probabilistic Latent Semantic Analysis (plsa)
Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg [email protected] www.multimedia-computing.{de,org} References
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles [email protected] and lunbo
Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012
Clustering Big Data Anil K. Jain (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012 Outline Big Data How to extract information? Data clustering
A Genetic Algorithm-Evolved 3D Point Cloud Descriptor
A Genetic Algorithm-Evolved 3D Point Cloud Descriptor Dominik Wȩgrzyn and Luís A. Alexandre IT - Instituto de Telecomunicações Dept. of Computer Science, Univ. Beira Interior, 6200-001 Covilhã, Portugal
Local features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
The Visual Internet of Things System Based on Depth Camera
The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed
Content-Based Image Retrieval
Content-Based Image Retrieval Selim Aksoy Department of Computer Engineering Bilkent University [email protected] Image retrieval Searching a large database for images that match a query: What kind
Discovering objects and their location in images
Discovering objects and their location in images Josef Sivic Bryan C. Russell Alexei A. Efros Andrew Zisserman William T. Freeman Dept. of Engineering Science CS and AI Laboratory School of Computer Science
Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28
Recognition Topics that we will try to cover: Indexing for fast retrieval (we still owe this one) History of recognition techniques Object classification Bag-of-words Spatial pyramids Neural Networks Object
Mean-Shift Tracking with Random Sampling
1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of
Tracking in flussi video 3D. Ing. Samuele Salti
Seminari XXIII ciclo Tracking in flussi video 3D Ing. Tutors: Prof. Tullio Salmon Cinotti Prof. Luigi Di Stefano The Tracking problem Detection Object model, Track initiation, Track termination, Tracking
SIGNATURE VERIFICATION
SIGNATURE VERIFICATION Dr. H.B.Kekre, Dr. Dhirendra Mishra, Ms. Shilpa Buddhadev, Ms. Bhagyashree Mall, Mr. Gaurav Jangid, Ms. Nikita Lakhotia Computer engineering Department, MPSTME, NMIMS University
The Delicate Art of Flower Classification
The Delicate Art of Flower Classification Paul Vicol Simon Fraser University University Burnaby, BC [email protected] Note: The following is my contribution to a group project for a graduate machine learning
Learning Motion Categories using both Semantic and Structural Information
Learning Motion Categories using both Semantic and Structural Information Shu-Fai Wong, Tae-Kyun Kim and Roberto Cipolla Department of Engineering, University of Cambridge, Cambridge, CB2 1PZ, UK {sfw26,
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
Object Recognition. Selim Aksoy. Bilkent University [email protected]
Image Classification and Object Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] Image classification Image (scene) classification is a fundamental
Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006
Practical Tour of Visual tracking David Fleet and Allan Jepson January, 2006 Designing a Visual Tracker: What is the state? pose and motion (position, velocity, acceleration, ) shape (size, deformation,
Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011
Behavior Analysis in Crowded Environments XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Sparse Scenes Zelnik-Manor & Irani CVPR
Semantic Image Segmentation and Web-Supervised Visual Learning
Semantic Image Segmentation and Web-Supervised Visual Learning Florian Schroff Andrew Zisserman University of Oxford, UK Antonio Criminisi Microsoft Research Ltd, Cambridge, UK Outline Part I: Semantic
Canny Edge Detection
Canny Edge Detection 09gr820 March 23, 2009 1 Introduction The purpose of edge detection in general is to significantly reduce the amount of data in an image, while preserving the structural properties
Finding people in repeated shots of the same scene
Finding people in repeated shots of the same scene Josef Sivic 1 C. Lawrence Zitnick Richard Szeliski 1 University of Oxford Microsoft Research Abstract The goal of this work is to find all occurrences
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
Image Segmentation and Registration
Image Segmentation and Registration Dr. Christine Tanner ([email protected]) Computer Vision Laboratory, ETH Zürich Dr. Verena Kaynig, Machine Learning Laboratory, ETH Zürich Outline Segmentation
MusicGuide: Album Reviews on the Go Serdar Sali
MusicGuide: Album Reviews on the Go Serdar Sali Abstract The cameras on mobile phones have untapped potential as input devices. In this paper, we present MusicGuide, an application that can be used to
Clustering Connectionist and Statistical Language Processing
Clustering Connectionist and Statistical Language Processing Frank Keller [email protected] Computerlinguistik Universität des Saarlandes Clustering p.1/21 Overview clustering vs. classification supervised
Object class recognition using unsupervised scale-invariant learning
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of Technology Goal Recognition of object categories
High Level Describable Attributes for Predicting Aesthetics and Interestingness
High Level Describable Attributes for Predicting Aesthetics and Interestingness Sagnik Dhar Vicente Ordonez Tamara L Berg Stony Brook University Stony Brook, NY 11794, USA [email protected] Abstract
Dynamical Clustering of Personalized Web Search Results
Dynamical Clustering of Personalized Web Search Results Xuehua Shen CS Dept, UIUC [email protected] Hong Cheng CS Dept, UIUC [email protected] Abstract Most current search engines present the user a ranked
Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.
Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table
A World Wide Web Based Image Search Engine Using Text and Image Content Features
A World Wide Web Based Image Search Engine Using Text and Image Content Features Bo Luo, Xiaogang Wang, and Xiaoou Tang Department of Information Engineering The Chinese University of Hong Kong ABSTRACT
Assessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall
Automatic Photo Quality Assessment Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall Estimating i the photorealism of images: Distinguishing i i paintings from photographs h Florin
Convolutional Feature Maps
Convolutional Feature Maps Elements of efficient (and accurate) CNN-based object detection Kaiming He Microsoft Research Asia (MSRA) ICCV 2015 Tutorial on Tools for Efficient Object Detection Overview
3D Model based Object Class Detection in An Arbitrary View
3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/
Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus
Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus 1. Introduction Facebook is a social networking website with an open platform that enables developers to extract and utilize user information
Discovering objects and their location in images
Discovering objects and their location in images Josef Sivic Bryan C. Russell Alexei A. Efros Andrew Zisserman William T. Freeman Dept. of Engineering Science CS and AI Laboratory School of Computer Science
Analecta Vol. 8, No. 2 ISSN 2064-7964
EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,
Machine Learning for Data Science (CS4786) Lecture 1
Machine Learning for Data Science (CS4786) Lecture 1 Tu-Th 10:10 to 11:25 AM Hollister B14 Instructors : Lillian Lee and Karthik Sridharan ROUGH DETAILS ABOUT THE COURSE Diagnostic assignment 0 is out:
Determining optimal window size for texture feature extraction methods
IX Spanish Symposium on Pattern Recognition and Image Analysis, Castellon, Spain, May 2001, vol.2, 237-242, ISBN: 84-8021-351-5. Determining optimal window size for texture feature extraction methods Domènec
TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service
TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service Feng Tang, Daniel R. Tretter, Qian Lin HP Laboratories HPL-2012-131R1 Keyword(s): image recognition; cloud service;
The use of computer vision technologies to augment human monitoring of secure computing facilities
The use of computer vision technologies to augment human monitoring of secure computing facilities Marius Potgieter School of Information and Communication Technology Nelson Mandela Metropolitan University
Introduction to Machine Learning Using Python. Vikram Kamath
Introduction to Machine Learning Using Python Vikram Kamath Contents: 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. Introduction/Definition Where and Why ML is used Types of Learning Supervised Learning Linear Regression
LIBSVX and Video Segmentation Evaluation
CVPR 14 Tutorial! 1! LIBSVX and Video Segmentation Evaluation Chenliang Xu and Jason J. Corso!! Computer Science and Engineering! SUNY at Buffalo!! Electrical Engineering and Computer Science! University
ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING)
ARTIFICIAL INTELLIGENCE (CSCU9YE) LECTURE 6: MACHINE LEARNING 2: UNSUPERVISED LEARNING (CLUSTERING) Gabriela Ochoa http://www.cs.stir.ac.uk/~goc/ OUTLINE Preliminaries Classification and Clustering Applications
. Learn the number of classes and the structure of each class using similarity between unlabeled training patterns
Outline Part 1: of data clustering Non-Supervised Learning and Clustering : Problem formulation cluster analysis : Taxonomies of Clustering Techniques : Data types and Proximity Measures : Difficulties
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
Object Categorization using Co-Occurrence, Location and Appearance
Object Categorization using Co-Occurrence, Location and Appearance Carolina Galleguillos Andrew Rabinovich Serge Belongie Department of Computer Science and Engineering University of California, San Diego
Edge tracking for motion segmentation and depth ordering
Edge tracking for motion segmentation and depth ordering P. Smith, T. Drummond and R. Cipolla Department of Engineering University of Cambridge Cambridge CB2 1PZ,UK {pas1001 twd20 cipolla}@eng.cam.ac.uk
Robert Collins CSE598G. More on Mean-shift. R.Collins, CSE, PSU CSE598G Spring 2006
More on Mean-shift R.Collins, CSE, PSU Spring 2006 Recall: Kernel Density Estimation Given a set of data samples x i ; i=1...n Convolve with a kernel function H to generate a smooth function f(x) Equivalent
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
Visual Contextual Advertising: Bringing Textual Advertisements to Images
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI-10) Visual Contextual Advertising: Bringing Textual Advertisements to Images Yuqiang Chen Ou Jin Gui-Rong Xue Jia Chen
Open issues and research trends in Content-based Image Retrieval
Open issues and research trends in Content-based Image Retrieval Raimondo Schettini DISCo Universita di Milano Bicocca [email protected] www.disco.unimib.it/schettini/ IEEE Signal Processing Society
Colour Image Segmentation Technique for Screen Printing
60 R.U. Hewage and D.U.J. Sonnadara Department of Physics, University of Colombo, Sri Lanka ABSTRACT Screen-printing is an industry with a large number of applications ranging from printing mobile phone
CLUSTER ANALYSIS FOR SEGMENTATION
CLUSTER ANALYSIS FOR SEGMENTATION Introduction We all understand that consumers are not all alike. This provides a challenge for the development and marketing of profitable products and services. Not every
Search Result Optimization using Annotators
Search Result Optimization using Annotators Vishal A. Kamble 1, Amit B. Chougule 2 1 Department of Computer Science and Engineering, D Y Patil College of engineering, Kolhapur, Maharashtra, India 2 Professor,
Image Classification for Dogs and Cats
Image Classification for Dogs and Cats Bang Liu, Yan Liu Department of Electrical and Computer Engineering {bang3,yan10}@ualberta.ca Kai Zhou Department of Computing Science [email protected] Abstract
Randomized Trees for Real-Time Keypoint Recognition
Randomized Trees for Real-Time Keypoint Recognition Vincent Lepetit Pascal Lagger Pascal Fua Computer Vision Laboratory École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne, Switzerland Email:
Who are you? Learning person specific classifiers from video
Who are you? Learning person specific classifiers from video Josef Sivic, Mark Everingham 2 and Andrew Zisserman 3 INRIA, WILLOW Project, Laboratoire d Informatique de l Ecole Normale Superieure, Paris,
UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003. Yun Zhai, Zeeshan Rasheed, Mubarak Shah
UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003 Yun Zhai, Zeeshan Rasheed, Mubarak Shah Computer Vision Laboratory School of Computer Science University of Central Florida, Orlando, Florida ABSTRACT In this
An Automatic and Accurate Segmentation for High Resolution Satellite Image S.Saumya 1, D.V.Jiji Thanka Ligoshia 2
An Automatic and Accurate Segmentation for High Resolution Satellite Image S.Saumya 1, D.V.Jiji Thanka Ligoshia 2 Assistant Professor, Dept of ECE, Bethlahem Institute of Engineering, Karungal, Tamilnadu,
Interactive person re-identification in TV series
Interactive person re-identification in TV series Mika Fischer Hazım Kemal Ekenel Rainer Stiefelhagen CV:HCI lab, Karlsruhe Institute of Technology Adenauerring 2, 76131 Karlsruhe, Germany E-mail: {mika.fischer,ekenel,rainer.stiefelhagen}@kit.edu
J. P. Oakley and R. T. Shann. Department of Electrical Engineering University of Manchester Manchester M13 9PL U.K. Abstract
A CURVATURE SENSITIVE FILTER AND ITS APPLICATION IN MICROFOSSIL IMAGE CHARACTERISATION J. P. Oakley and R. T. Shann Department of Electrical Engineering University of Manchester Manchester M13 9PL U.K.
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Lecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION
AN IMPROVED DOUBLE CODING LOCAL BINARY PATTERN ALGORITHM FOR FACE RECOGNITION Saurabh Asija 1, Rakesh Singh 2 1 Research Scholar (Computer Engineering Department), Punjabi University, Patiala. 2 Asst.
Making Sense of the Mayhem: Machine Learning and March Madness
Making Sense of the Mayhem: Machine Learning and March Madness Alex Tran and Adam Ginzberg Stanford University [email protected] [email protected] I. Introduction III. Model The goal of our research
521466S Machine Vision Assignment #7 Hough transform
521466S Machine Vision Assignment #7 Hough transform Spring 2014 In this assignment we use the hough transform to extract lines from images. We use the standard (r, θ) parametrization of lines, lter the
STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table
Pedestrian Detection with RCNN
Pedestrian Detection with RCNN Matthew Chen Department of Computer Science Stanford University [email protected] Abstract In this paper we evaluate the effectiveness of using a Region-based Convolutional
Human behavior analysis from videos using optical flow
L a b o r a t o i r e I n f o r m a t i q u e F o n d a m e n t a l e d e L i l l e Human behavior analysis from videos using optical flow Yassine Benabbas Directeur de thèse : Chabane Djeraba Multitel
Specific Usage of Visual Data Analysis Techniques
Specific Usage of Visual Data Analysis Techniques Snezana Savoska 1 and Suzana Loskovska 2 1 Faculty of Administration and Management of Information systems, Partizanska bb, 7000, Bitola, Republic of Macedonia
Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
Colorado School of Mines Computer Vision Professor William Hoff
Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description
Dissertation TOPIC MODELS FOR IMAGE RETRIEVAL ON LARGE-SCALE DATABASES. Eva Hörster
Dissertation TOPIC MODELS FOR IMAGE RETRIEVAL ON LARGE-SCALE DATABASES Eva Hörster Department of Computer Science University of Augsburg Adviser: Readers: Prof. Dr. Rainer Lienhart Prof. Dr. Rainer Lienhart
Combining Local Recognition Methods for Better Image Recognition
Combining Local Recognition Methods for Better Image Recognition Bart Lamiroy Patrick Gros y Sylvaine Picard INRIA CNRS DGA MOVI GRAVIR z IMAG INRIA RHÔNE-ALPES ZIRST 655, Av. de l Europe Montbonnot FRANCE
QUALITY TESTING OF WATER PUMP PULLEY USING IMAGE PROCESSING
QUALITY TESTING OF WATER PUMP PULLEY USING IMAGE PROCESSING MRS. A H. TIRMARE 1, MS.R.N.KULKARNI 2, MR. A R. BHOSALE 3 MR. C.S. MORE 4 MR.A.G.NIMBALKAR 5 1, 2 Assistant professor Bharati Vidyapeeth s college
Semantic Recognition: Object Detection and Scene Segmentation
Semantic Recognition: Object Detection and Scene Segmentation Xuming He [email protected] Computer Vision Research Group NICTA Robotic Vision Summer School 2015 Acknowledgement: Slides from Fei-Fei
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS B.K. Mohan and S. N. Ladha Centre for Studies in Resources Engineering IIT
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
ICT Perspectives on Big Data: Well Sorted Materials
ICT Perspectives on Big Data: Well Sorted Materials 3 March 2015 Contents Introduction 1 Dendrogram 2 Tree Map 3 Heat Map 4 Raw Group Data 5 For an online, interactive version of the visualisations in
Interactive Offline Tracking for Color Objects
Interactive Offline Tracking for Color Objects Yichen Wei Jian Sun Xiaoou Tang Heung-Yeung Shum Microsoft Research Asia, Beijing, China {yichenw,jiansun,xitang,hshum}@microsoft.com Abstract In this paper,
Music Mood Classification
Music Mood Classification CS 229 Project Report Jose Padial Ashish Goel Introduction The aim of the project was to develop a music mood classifier. There are many categories of mood into which songs may
Image Analysis Using the Aperio ScanScope
Image Analysis Using the Aperio ScanScope Allen H. Olson, PhD Algorithm Development Engineer Aperio Technologies INTRODUCTION Why should I choose the Aperio ScanScope over competing systems for image analysis?
Blind Deconvolution of Barcodes via Dictionary Analysis and Wiener Filter of Barcode Subsections
Blind Deconvolution of Barcodes via Dictionary Analysis and Wiener Filter of Barcode Subsections Maximilian Hung, Bohyun B. Kim, Xiling Zhang August 17, 2013 Abstract While current systems already provide
Make and Model Recognition of Cars
Make and Model Recognition of Cars Sparta Cheung Department of Electrical and Computer Engineering University of California, San Diego 9500 Gilman Dr., La Jolla, CA 92093 [email protected] Alice Chu Department
Clustering Technique in Data Mining for Text Documents
Clustering Technique in Data Mining for Text Documents Ms.J.Sathya Priya Assistant Professor Dept Of Information Technology. Velammal Engineering College. Chennai. Ms.S.Priyadharshini Assistant Professor
