MusicGuide: Album Reviews on the Go Serdar Sali
|
|
|
- Madeleine Sharp
- 10 years ago
- Views:
Transcription
1 MusicGuide: Album Reviews on the Go Serdar Sali Abstract The cameras on mobile phones have untapped potential as input devices. In this paper, we present MusicGuide, an application that can be used to provide the user information on a music album on the go by simply using a photo of the album cover as the input. Relevant data on the album such as reviews and track samples are then collected from various sources on the web and presented to the user. Our experiments show that MusicGuide is robust against various transforms and artifacts in the input images, and it also has good performance in terms of accuracy, with recognition rates above 90%. 1. Introduction Mobile devices continue to grow enormously in popularity all around the world, and through convergence with PDAs, digital cameras and music players, they are set to become the mobile computing platform of choice for most people in the future. However, this computing potential is largely untapped right now, despite the fact that strong and continuously improving communication abilities, practicality due to small size and their ubiquitous nature make mobile devices an ideal platform for conveying custom-tailored, context-based information to the user. In this paper, we propose a system that can be used to provide a user with product information on the go by utilizing the camera on the phone as an input device. While various methods exist to provide users with product information, they usually rely on text based input by the user, or data obtained by other means such as barcodes or RFID. However, entering text-based input on these devices is not practical due to space limitations which make full size keyboards either impossible or cumbersome to use due to cramped, tiny keys. RFID readers are not widely available on phones and RFID tags are not widely available on products. While cameras on mobile phones can be used as barcode readers, due to their small size, barcodes are difficult to focus on; the resulting images are usually too blurry to have reliable recognition rates. An ideal method would be to use photos of product covers taken by the camera embedded on the phone as the input image for a search. In a wider sense, this problem is a subset of the content-based image retrieval (CBIR) problem and is called query by example. Various CBIR applications exist on mobile phones. The main differences lie on the type of input used, where the computation takes place and to which domain the architecture applies. The architecture we propose is aimed at searching for information on music albums from a mobile device. In contrast to applications where all computation is performed on the device, since the dataset for our domain is too large and computation power on the device is likely to be a bottleneck, our architecture employs a client-server architecture where the mobile device is the client, and all recognition and information aggregation is done on the remote server. The user can take a photo of an album cover and send this photo to the server via , a WiFi connection or MMS messaging. The server takes this photo as the input, performs the search, aggregates information from various resources and sends this information back to the user.
2 The contribution of this paper is a mobile album cover recognition system utilizing cameras on mobile phones. The motivation behind selecting this domain is the difficulty in obtaining information on an album based only on the album cover, which is the only information available to a user in a store. While a buyer can read sections of a book or a magazine, therefore sampling it, for albums sampling is limited to what is available in the listening kiosks. Our architecture provides users with the ability to obtain information on an album by taking just a photo of the album cover, which is intuitive and practical. 2. Related Work While CBIR is a large problem and encompasses a broad area of applications such as semantic retrieval of images, searching based on particular features of an image such as color distribution or frequency, or retrieving pictures with similar textual annotations, the relevant area to our research is retrieval of similar images to a sample input image, which is also called query by example. Since we use as input images taken by a user, the recognition algorithm employed must be robust against rotation, translation, scaling and partial occlusion. It is also important that the algorithm is invariant to some degree to noise in the image and differences in illumination. Furthermore, since we are comparing against actual digital versions of cover art, we need a reliable descriptor with a good accuracy Similarity based search Over the years, various approaches have been proposed to address the problem of matching images based on similarity. These approaches mainly differ in how they form the feature vector for an image. The most widely used sources for feature extraction are color, shape and texture. One of the first algorithms utilizing color was proposed by Swain et al. [1]. This algorithm computes color histograms for images, followed by an intersection of the histograms to perform the similarity search. Improvements that add spatial information and correlation to color histograms were proposed in [2] and [3]. Color histograms are very sensitive to noise and they work best when both the input image and the database images are taken by the same device and therefore have similar color representation. Unfortunately, this is not the case for our domain: cameras on phones tend to take relatively noisy pictures, and our database consists of digital versions of cover art, not photos of them. In addition to color histograms, various methods look for textures in an image and their spatial placement, and construct a feature vector based on this information. One important work utilizing textures is given by Tamura et al. [4]. In this paper, the authors propose approximations to the following texture features: Coarseness, contrast, roughness, regularity, directionality and linelikeness. These properties are based on how humans actually perceive textures. While these methods are usually good for detecting uniform regions such as sky and sea in an image, they are
3 not applicable to our domain with good results because such regions are not enough to correctly identify a single match for an input image. Shape can also be used to build a feature vector from an image. Methods relying on shape usually work on the similarity of edges, corners and shapes of the objects in the image. Feature extraction using shapes usually works in a local level: it is concerned with locating points of interest in the image, rather than considering the global distribution of a feature as in color-based methods. One of the most commonly used shape detectors is the corner detector by Harris [5]. Another descriptor that makes use of local points of interest is SIFT (Scale-invariant feature transform) [6]. SIFT is a popular method based on detection of key points in an image using the Difference of Gaussians method. The resulting features are invariant to scaling, differences in illumination and rotation, and works good even for 3D images and different points of view. These properties make SIFT ideal for our architecture Object Recognition Systems on Mobile Phones While many web-based systems exist for content-based image search, applications of these systems to mobile devices are relatively rare. Some research is being conducted on building assistive technologies for the blind and visually impaired using mobile devices and object recognition. One such system is GroZi [7]. GroZi aims to help the visually impaired by recognizing and locating the items on a customer s shopping list among the items on the shelf. Another set of applications aim to build on-demand mobile tour or museum guides [8]. Some of these systems try to perform location recognition by using photos from the location as input. Yeh et al. [9] propose a system that first queries an image database by using a photo taken by a user. After a match is found, the system then searches the web based on keywords associated with the determined location. PhoneGuide [10] is another application that makes use of the mobile phone as a museum guide. The user can take pictures of the various exhibits in the museum using the digital camera on her/his mobile phone, and then query a database for information on the exhibit using the image as the input. Seifert et al. [11] propose a system that utilizes object recognition and GPS on a mobile device for taking an inventory of traffic signs. The algorithm for sign recognition takes advantage of the fact that traffic signs have obvious shapes and patterns which are known in advance. Jia et al. [12] propose a generic architecture called Photo-to-Search that can be used to query the web directly from mobile devices using images taken by the digital camera on the device and minimal textual input. The underlying object recognition system uses SIFT to detect key points and perform the matching. There are also systems for outdoor hobbyists, such as systems for flower or fish recognition [13,14].
4 3. Our Method Image recognition systems require high performance, and an album cover database is very large to store on a phone. For these reasons, we chose to implement our system as a client-server architecture (Fig.1). Take photo of cover & send it to server MusicGuide Server User Match Image Collect Data Send Data to Phone Figure 1 - MusicGuide Architecture The user interface on the phone lets the user take a photo with a resolution of pixels using the camera on the device. This photo is then automatically uploaded to our server where we perform the matching and data aggregation. The server then pushes the results as an HTML file back to the phone. This HTML file contains product rating and track samples from Amazon, and average critic score and excerpt reviews from Metacritic with links to full reviews, and it is also saved on the phone for subsequent views if desired. Our server has two main tasks: image matching and data collection. The image matching component uses David Lowe s SIFT. SIFT has been shown to have good precision, and it is invariant to various image transforms. The implementation we use is a modified version of David Lowe s implementation. Once the matching is performed, the server gets user rating and track samples for the album from Amazon using the Amazon Web Services API, and parses the HTML data from the product page on Metacritic to get the review excerpts, links to full versions of these reviews and average critic score for the album. Since the input image is small, the total data communicated between the phone and the server is on average 100Kb, and most of this data is the input image and the matching cover image from our database that the server sends back so that the user can evaluate the correctness of the match. The user interface is implemented in Python. Data collection on the server was implemented on VB.Net, whereas the object recognition algorithm is written in C for efficiency purposes. A web interface to the server was also built using ASP.Net. While the user interface we implemented provides a practical mechanism to perform the search, through this web interface our server can be accessed by any web-capable phone with a browser.
5 4. Performance In order to evaluate the performance of our architecture, we test our system with 45 input images taken by the camera on a Nokia N95 phone against a database of up to 500 cover images. We measure the percentage of correct matches as well as the time it takes to execute a query and find a match. Fig. 2 shows the percentage of correct matches for different database sizes. 100 : 97.7% 300 : 93.3% 500 : 93.3% percentage of correct matches percentage of correct matches # of images in database Figure 2 - Percentage of correct matches for different database sizes. Our results show that our method has really good accuracy in finding the correct match. Among the 45 images, only one was identified incorrectly when we have 100 images in our database and only 3 when we have 500. Furthermore, the recognition rate falls rather slowly with increasing database size, which implies that our method can be relied on to work with good accuracy on larger databases. Some examples of correct matches given in Fig. 2 show how reliable our architecture is even under various transforms and artifacts in input images and background clutter. All the examples given were correctly matched in all our trials. (a) (b) (c) Figure 3 - Some examples of correct matches. Our method works reliably even with (a) heavy background clutter in the input image, (b) different viewpoints, and (c) blurry, noisy input images.
6 Since we perform an exhaustive search to find the matching image, the time it takes to execute a query increases linearly with the number of images in the database. The time performance measurements on a 1.6 GHz computer with 1GB of RAM indicate that it takes on average 200 seconds to search through 500 images in our database. The time performance can be improved by using approximate methods to find the correct match at the expense of some accuracy. 5. Conclusion and Future Work In this paper, we have presented a mobile CBIR architecture for album cover recognition, which can be used to provide users with reviews and sample tracks for an album on the go. The high accuracy of our system ensures a reliable and practical application. Furthermore, our system can be used readily with existing cover databases as we perform the matching against actual digital versions of cover art. As future work, we plan to implement a faster search by using approximate methods to find the correct match for a query image. We also plan to enhance the data collected on an album by including pricing data from various sources and recommending similar items, and extend our application to handle different media such as books and DVDs. References [1] S. Ballard, Color Indexing, International Journal of Computer Vision, Vol. 7, No. 1, pp , [2] G.Pass, and R. Zabith, "Comparing images using joint histograms," Multimedia Systems, Vol.7, pp , [3] G. Pass, and R. Zabith, "Histogram refinement for content-based image retrieval," IEEE Workshop on Applications of Computer Vision, pp , [4] H. Tamura, S. Mori, and T. Yamawaki, "Texture features corresponding to visual perception", IEEE Trans. On Systems, Man, and Cybernetics, Vol. Smc-8, No. 6, [5] C. Harris and M. Stephens, A combined corner and edge detector, ALVEY Vision Conference, pages , [6] D. G. Lowe, "Distinctive image features from scale-invariant keypoints", International Journal of Computer Vision, Vol. 60, No. 2, pp , [7] M. Merler, C. Galleguillos, and S. Belongie. Recognizing Groceries in situ Using in vitro Training Data, to appear, SLAM 2007, Minneapolis, MN, retrieved from [8] N. Davies, K. Cheverst, A. Dix, and A. Hesse, Understanding the role of image recognition in mobile tour guides, Proceedings of the 7th international Conference on Human Computer interaction with Mobile Devices & Services, pp , [9] T. Yeh, K. Tollmar, and T. Darrell, "Searching the Web with Mobile Images for Location Recognition", 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'04), Vol. 2, pp , [10] P. Föckler, T. Zeidler, B. Brombach, E. Bruns, and O. Bimber, PhoneGuide: museum guidance supported by on-device object recognition on mobile phones, Proceedings of the 4th international Conference on Mobile and Ubiquitous Multimedia, MUM '05, Vol. 154, pp. 3-10, 2005.
7 [11] C. Seifert, L. Paletta, A. Jeitler, E. Hödl, J.. Andreu, P. Luley and A. Almer, Visual Object Detection for Mobile Road Sign Inventory, Lecture Notes in Computer Science, MobileHCI 2004, pp , Springer, [12] M. Jia, X. Fan, X. Xie, M. Li, W. Ma, Photo-to-Search: Using Camera Phones to Inquire of the Surrounding World, The 7th International Conference on Mobile Data Management (MDM'06), p.46, [13] M. Noda, H. Sonobe, S. Takagi, et al., "Cosmos: Convenient Image Retrieval System of Flowers for Mobile Computing Situations", Proceedings of the IASTED Conf. on Information Systems and Databases 2002, pp , [14] H. Sonobe, S. Takagi, and F. Yoshimoto, "Image Retrieval System of Fishes Using a Mobile Device", Proceedings of International Workshop on Advanced Image Technology 2004, pp , 2004.
Geo-Services and Computer Vision for Object Awareness in Mobile System Applications
Geo-Services and Computer Vision for Object Awareness in Mobile System Applications Authors Patrick LULEY, Lucas PALETTA, Alexander ALMER JOANNEUM RESEARCH Forschungsgesellschaft mbh, Institute of Digital
Module II: Multimedia Data Mining
ALMA MATER STUDIORUM - UNIVERSITÀ DI BOLOGNA Module II: Multimedia Data Mining Laurea Magistrale in Ingegneria Informatica University of Bologna Multimedia Data Retrieval Home page: http://www-db.disi.unibo.it/courses/dm/
ISSN: 2348 9510. A Review: Image Retrieval Using Web Multimedia Mining
A Review: Image Retrieval Using Web Multimedia Satish Bansal*, K K Yadav** *, **Assistant Professor Prestige Institute Of Management, Gwalior (MP), India Abstract Multimedia object include audio, video,
Natural Language Querying for Content Based Image Retrieval System
Natural Language Querying for Content Based Image Retrieval System Sreena P. H. 1, David Solomon George 2 M.Tech Student, Department of ECE, Rajiv Gandhi Institute of Technology, Kottayam, India 1, Asst.
Open issues and research trends in Content-based Image Retrieval
Open issues and research trends in Content-based Image Retrieval Raimondo Schettini DISCo Universita di Milano Bicocca [email protected] www.disco.unimib.it/schettini/ IEEE Signal Processing Society
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
Interactive Robust Multitudinous Visual Search on Mobile Devices
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 12, December 2014,
TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service
TouchPaper - An Augmented Reality Application with Cloud-Based Image Recognition Service Feng Tang, Daniel R. Tretter, Qian Lin HP Laboratories HPL-2012-131R1 Keyword(s): image recognition; cloud service;
Virtual Zoom: Augmented Reality on a Mobile Device
Virtual Zoom: Augmented Reality on a Mobile Device Sergey Karayev University of Washington Honors Thesis in Computer Science June 5, 2009 Abstract The world around us is photographed millions of times
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
Object Recognition. Selim Aksoy. Bilkent University [email protected]
Image Classification and Object Recognition Selim Aksoy Department of Computer Engineering Bilkent University [email protected] Image classification Image (scene) classification is a fundamental
Android Ros Application
Android Ros Application Advanced Practical course : Sensor-enabled Intelligent Environments 2011/2012 Presentation by: Rim Zahir Supervisor: Dejan Pangercic SIFT Matching Objects Android Camera Topic :
siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service
siftservice.com - Turning a Computer Vision algorithm into a World Wide Web Service Ahmad Pahlavan Tafti 1, Hamid Hassannia 2, and Zeyun Yu 1 1 Department of Computer Science, University of Wisconsin -Milwaukee,
The Visual Internet of Things System Based on Depth Camera
The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed
Fast Matching of Binary Features
Fast Matching of Binary Features Marius Muja and David G. Lowe Laboratory for Computational Intelligence University of British Columbia, Vancouver, Canada {mariusm,lowe}@cs.ubc.ca Abstract There has been
How To Fix Out Of Focus And Blur Images With A Dynamic Template Matching Algorithm
IJSTE - International Journal of Science Technology & Engineering Volume 1 Issue 10 April 2015 ISSN (online): 2349-784X Image Estimation Algorithm for Out of Focus and Blur Images to Retrieve the Barcode
Build Panoramas on Android Phones
Build Panoramas on Android Phones Tao Chu, Bowen Meng, Zixuan Wang Stanford University, Stanford CA Abstract The purpose of this work is to implement panorama stitching from a sequence of photos taken
Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011
Behavior Analysis in Crowded Environments XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Sparse Scenes Zelnik-Manor & Irani CVPR
Local features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
Feature Tracking and Optical Flow
02/09/12 Feature Tracking and Optical Flow Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Many slides adapted from Lana Lazebnik, Silvio Saverse, who in turn adapted slides from Steve
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.
Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table
A Genetic Algorithm-Evolved 3D Point Cloud Descriptor
A Genetic Algorithm-Evolved 3D Point Cloud Descriptor Dominik Wȩgrzyn and Luís A. Alexandre IT - Instituto de Telecomunicações Dept. of Computer Science, Univ. Beira Interior, 6200-001 Covilhã, Portugal
Face Recognition For Remote Database Backup System
Face Recognition For Remote Database Backup System Aniza Mohamed Din, Faudziah Ahmad, Mohamad Farhan Mohamad Mohsin, Ku Ruhana Ku-Mahamud, Mustafa Mufawak Theab 2 Graduate Department of Computer Science,UUM
Automatic Grocery Shopping Assistant
Automatic Grocery Shopping Assistant Linda He Yi Department of Electrical Engineering Stanford University Stanford, CA [email protected] Feiqiao Brian Yu Department of Electrical Engineering Stanford University
Feasibility Study of Searchable Image Encryption System of Streaming Service based on Cloud Computing Environment
Feasibility Study of Searchable Image Encryption System of Streaming Service based on Cloud Computing Environment JongGeun Jeong, ByungRae Cha, and Jongwon Kim Abstract In this paper, we sketch the idea
MOBILITY DATA MODELING AND REPRESENTATION
PART I MOBILITY DATA MODELING AND REPRESENTATION 1 Trajectories and Their Representations Stefano Spaccapietra, Christine Parent, and Laura Spinsanti 1.1 Introduction For a long time, applications have
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES
ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES B. Sirmacek, R. Lindenbergh Delft University of Technology, Department of Geoscience and Remote Sensing, Stevinweg
Character Image Patterns as Big Data
22 International Conference on Frontiers in Handwriting Recognition Character Image Patterns as Big Data Seiichi Uchida, Ryosuke Ishida, Akira Yoshida, Wenjie Cai, Yaokai Feng Kyushu University, Fukuoka,
A Method of Caption Detection in News Video
3rd International Conference on Multimedia Technology(ICMT 3) A Method of Caption Detection in News Video He HUANG, Ping SHI Abstract. News video is one of the most important media for people to get information.
3D Model based Object Class Detection in An Arbitrary View
3D Model based Object Class Detection in An Arbitrary View Pingkun Yan, Saad M. Khan, Mubarak Shah School of Electrical Engineering and Computer Science University of Central Florida http://www.eecs.ucf.edu/
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
Recognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang
Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform
DEVELOPMENT OF CAMPUS SPACE NAVIGATION AND GUIDE SYSTEM
DEVELOPMENT OF CAMPUS SPACE NAVIGATION AND GUIDE SYSTEM Yan-Chyuan Shiau Chung Hua University, Hsin-Chu, Taiwan [email protected] Tsung-Pin Tsai Chung Hua University, Hsin-Chu, Taiwan [email protected]
Euler Vector: A Combinatorial Signature for Gray-Tone Images
Euler Vector: A Combinatorial Signature for Gray-Tone Images Arijit Bishnu, Bhargab B. Bhattacharya y, Malay K. Kundu, C. A. Murthy fbishnu t, bhargab, malay, [email protected] Indian Statistical Institute,
Practical Tour of Visual tracking. David Fleet and Allan Jepson January, 2006
Practical Tour of Visual tracking David Fleet and Allan Jepson January, 2006 Designing a Visual Tracker: What is the state? pose and motion (position, velocity, acceleration, ) shape (size, deformation,
IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS
IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS Alexander Velizhev 1 (presenter) Roman Shapovalov 2 Konrad Schindler 3 1 Hexagon Technology Center, Heerbrugg, Switzerland 2 Graphics & Media
A World Wide Web Based Image Search Engine Using Text and Image Content Features
A World Wide Web Based Image Search Engine Using Text and Image Content Features Bo Luo, Xiaogang Wang, and Xiaoou Tang Department of Information Engineering The Chinese University of Hong Kong ABSTRACT
An Active Head Tracking System for Distance Education and Videoconferencing Applications
An Active Head Tracking System for Distance Education and Videoconferencing Applications Sami Huttunen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information
Asset Tracking System
Asset Tracking System System Description Asset & Person Tracking 1. General The Vizbee platform is a flexible rule based solution for RFID based applications that can adapt to the customer s needs and
The Delicate Art of Flower Classification
The Delicate Art of Flower Classification Paul Vicol Simon Fraser University University Burnaby, BC [email protected] Note: The following is my contribution to a group project for a graduate machine learning
Automatic georeferencing of imagery from high-resolution, low-altitude, low-cost aerial platforms
Automatic georeferencing of imagery from high-resolution, low-altitude, low-cost aerial platforms Amanda Geniviva, Jason Faulring and Carl Salvaggio Rochester Institute of Technology, 54 Lomb Memorial
Tracking in flussi video 3D. Ing. Samuele Salti
Seminari XXIII ciclo Tracking in flussi video 3D Ing. Tutors: Prof. Tullio Salmon Cinotti Prof. Luigi Di Stefano The Tracking problem Detection Object model, Track initiation, Track termination, Tracking
Object Recognition for the Internet of Things
Object Recognition for the Internet of Things Till Quack 1,HerbertBay 1, and Luc Van Gool 2 1 ETH Zurich, Switzerland {quack,bay}@vision.ee.ethz.ch 2 KU Leuven, Belgium [email protected] Abstract.
An Efficient Multi-Keyword Ranked Secure Search On Crypto Drive With Privacy Retaining
An Efficient Multi-Keyword Ranked Secure Search On Crypto Drive With Privacy Retaining 1 B.Sahaya Emelda and 2 Mrs. P. Maria Jesi M.E.,Ph.D., 1 PG Student and 2 Associate Professor, Department of Computer
Probabilistic Latent Semantic Analysis (plsa)
Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg [email protected] www.multimedia-computing.{de,org} References
Simultaneous Gamma Correction and Registration in the Frequency Domain
Simultaneous Gamma Correction and Registration in the Frequency Domain Alexander Wong [email protected] William Bishop [email protected] Department of Electrical and Computer Engineering University
A New Approach for Similar Images Using Game Theory
Applied Mathematical Sciences, Vol. 8, 2014, no. 163, 8099-8111 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.410790 A New Approach for Similar Images Using Game Theory O. Bencharef
Make and Model Recognition of Cars
Make and Model Recognition of Cars Sparta Cheung Department of Electrical and Computer Engineering University of California, San Diego 9500 Gilman Dr., La Jolla, CA 92093 [email protected] Alice Chu Department
MIFT: A Mirror Reflection Invariant Feature Descriptor
MIFT: A Mirror Reflection Invariant Feature Descriptor Xiaojie Guo, Xiaochun Cao, Jiawan Zhang, and Xuewei Li School of Computer Science and Technology Tianjin University, China {xguo,xcao,jwzhang,lixuewei}@tju.edu.cn
Introduction. Selim Aksoy. Bilkent University [email protected]
Introduction Selim Aksoy Department of Computer Engineering Bilkent University [email protected] What is computer vision? What does it mean, to see? The plain man's answer (and Aristotle's, too)
Mobile Image Offloading Using Cloud Computing
Mobile Image Offloading Using Cloud Computing Chintan Shah, Aruna Gawade Student, Dept. of Computer., D.J.Sanghvi College of Engineering, Mumbai University, Mumbai, India Assistant Professor, Dept. of
So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)
Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we
Toolkit for Bar Code Recognition and Resolving on Camera Phones - Jump-Starting the Internet of Things
Toolkit for Bar Code Recognition and Resolving on Camera Phones - Jump-Starting the Internet of Things Robert Adelmann Institute for Pervasive Comp. ETH Zurich, 8092 Zurich, Switzerland +41 44 632 20859
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE. [email protected]
LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE 1 S.Manikandan, 2 S.Abirami, 2 R.Indumathi, 2 R.Nandhini, 2 T.Nanthini 1 Assistant Professor, VSA group of institution, Salem. 2 BE(ECE), VSA
The STC for Event Analysis: Scalability Issues
The STC for Event Analysis: Scalability Issues Georg Fuchs Gennady Andrienko http://geoanalytics.net Events Something [significant] happened somewhere, sometime Analysis goal and domain dependent, e.g.
Low-resolution Image Processing based on FPGA
Abstract Research Journal of Recent Sciences ISSN 2277-2502. Low-resolution Image Processing based on FPGA Mahshid Aghania Kiau, Islamic Azad university of Karaj, IRAN Available online at: www.isca.in,
Implementation of Augmented Reality System for Smartphone Advertisements
, pp.385-392 http://dx.doi.org/10.14257/ijmue.2014.9.2.39 Implementation of Augmented Reality System for Smartphone Advertisements Young-geun Kim and Won-jung Kim Department of Computer Science Sunchon
Web 3.0 image search: a World First
Web 3.0 image search: a World First The digital age has provided a virtually free worldwide digital distribution infrastructure through the internet. Many areas of commerce, government and academia have
<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany
Information Systems University of Koblenz Landau, Germany Semantic Multimedia Management - Multimedia Annotation Tools http://isweb.uni-koblenz.de Multimedia Annotation Different levels of annotations
PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO
PHYSIOLOGICALLY-BASED DETECTION OF COMPUTER GENERATED FACES IN VIDEO V. Conotter, E. Bodnari, G. Boato H. Farid Department of Information Engineering and Computer Science University of Trento, Trento (ITALY)
HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER
HSI BASED COLOUR IMAGE EQUALIZATION USING ITERATIVE n th ROOT AND n th POWER Gholamreza Anbarjafari icv Group, IMS Lab, Institute of Technology, University of Tartu, Tartu 50411, Estonia [email protected]
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software
FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 2, February 2014,
Bernice E. Rogowitz and Holly E. Rushmeier IBM TJ Watson Research Center, P.O. Box 704, Yorktown Heights, NY USA
Are Image Quality Metrics Adequate to Evaluate the Quality of Geometric Objects? Bernice E. Rogowitz and Holly E. Rushmeier IBM TJ Watson Research Center, P.O. Box 704, Yorktown Heights, NY USA ABSTRACT
Mean-Shift Tracking with Random Sampling
1 Mean-Shift Tracking with Random Sampling Alex Po Leung, Shaogang Gong Department of Computer Science Queen Mary, University of London, London, E1 4NS Abstract In this work, boosting the efficiency of
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 269 Class Project Report
Automatic 3D Reconstruction via Object Detection and 3D Transformable Model Matching CS 69 Class Project Report Junhua Mao and Lunbo Xu University of California, Los Angeles [email protected] and lunbo
Automatic Labeling of Lane Markings for Autonomous Vehicles
Automatic Labeling of Lane Markings for Autonomous Vehicles Jeffrey Kiske Stanford University 450 Serra Mall, Stanford, CA 94305 [email protected] 1. Introduction As autonomous vehicles become more popular,
Image Classification for Dogs and Cats
Image Classification for Dogs and Cats Bang Liu, Yan Liu Department of Electrical and Computer Engineering {bang3,yan10}@ualberta.ca Kai Zhou Department of Computing Science [email protected] Abstract
Introduction to Computer Graphics
Introduction to Computer Graphics Torsten Möller TASC 8021 778-782-2215 [email protected] www.cs.sfu.ca/~torsten Today What is computer graphics? Contents of this course Syllabus Overview of course topics
Evaluation of Open Source Data Cleaning Tools: Open Refine and Data Wrangler
Evaluation of Open Source Data Cleaning Tools: Open Refine and Data Wrangler Per Larsson [email protected] June 7, 2013 Abstract This project aims to compare several tools for cleaning and importing
PERSONAL MOBILE DEVICE FOR SITUATED INTERACTION
PERSONAL MOBILE DEVICE FOR SITUATED INTERACTION YANG-TING SHEN, TAY-SHENG TENG Information Architecture Lab, Department of Architecture, National Cheng Kung University, Taiwan. [email protected]
A Methodological Shift in Building Design through Development of Collaborative Design Platforms
ctbuh.org/papers Title: Authors: Subject: Keywords: A Methodological Shift in Building Design through Development of Collaborative Design Platforms Jonatan Schumacher, CORE Studio, Thornton Tomasetti Matthew
Randomized Trees for Real-Time Keypoint Recognition
Randomized Trees for Real-Time Keypoint Recognition Vincent Lepetit Pascal Lagger Pascal Fua Computer Vision Laboratory École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne, Switzerland Email:
OBJECT TRACKING USING LOG-POLAR TRANSFORMATION
OBJECT TRACKING USING LOG-POLAR TRANSFORMATION A Thesis Submitted to the Gradual Faculty of the Louisiana State University and Agricultural and Mechanical College in partial fulfillment of the requirements
A Cognitive Approach to Vision for a Mobile Robot
A Cognitive Approach to Vision for a Mobile Robot D. Paul Benjamin Christopher Funk Pace University, 1 Pace Plaza, New York, New York 10038, 212-346-1012 [email protected] Damian Lyons Fordham University,
International Journal of Advanced Information in Arts, Science & Management Vol.2, No.2, December 2014
Efficient Attendance Management System Using Face Detection and Recognition Arun.A.V, Bhatath.S, Chethan.N, Manmohan.C.M, Hamsaveni M Department of Computer Science and Engineering, Vidya Vardhaka College
Determining optimal window size for texture feature extraction methods
IX Spanish Symposium on Pattern Recognition and Image Analysis, Castellon, Spain, May 2001, vol.2, 237-242, ISBN: 84-8021-351-5. Determining optimal window size for texture feature extraction methods Domènec
Filling the Semantic Gap: A Genetic Programming Framework for Content-Based Image Retrieval
INSTITUTE OF COMPUTING University of Campinas Filling the Semantic Gap: A Genetic Programming Framework for Content-Based Image Retrieval Ricardo da Silva Torres [email protected] www.ic.unicamp.br/~rtorres
The use of computer vision technologies to augment human monitoring of secure computing facilities
The use of computer vision technologies to augment human monitoring of secure computing facilities Marius Potgieter School of Information and Communication Technology Nelson Mandela Metropolitan University
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,
A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms
A Comparative Study between SIFT- Particle and SURF-Particle Video Tracking Algorithms H. Kandil and A. Atwan Information Technology Department, Faculty of Computer and Information Sciences, Mansoura University,El-Gomhoria
STORE VIEW: Pervasive RFID & Indoor Navigation based Retail Inventory Management
STORE VIEW: Pervasive RFID & Indoor Navigation based Retail Inventory Management Anna Carreras Tànger, 122-140. [email protected] Marc Morenza-Cinos Barcelona, SPAIN [email protected] Rafael Pous
Colorado School of Mines Computer Vision Professor William Hoff
Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description
Augmented Reality Tic-Tac-Toe
Augmented Reality Tic-Tac-Toe Joe Maguire, David Saltzman Department of Electrical Engineering [email protected], [email protected] Abstract: This project implements an augmented reality version
