OVERVIEW OF JPSEARCH: A STANDARD FOR IMAGE SEARCH AND RETRIEVAL



Similar documents
Semantic Search in Portals using Ontologies

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities

Service Oriented Architecture

In ediscovery and Litigation Support Repositories MPeterson, June 2009

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG1 CODING OF STILL PICTURES

Principal MDM Components and Capabilities

CitationBase: A social tagging management portal for references

CLARIN-NL Third Call: Closed Call

Secure Semantic Web Service Using SAML

Department of Defense Net-Centric Data Strategy

XML for Manufacturing Systems Integration

Standard-Compliant Streaming of Images in Electronic Health Records

SIMATIC IT Production Suite Answers for industry.

Functional Requirements for Digital Asset Management Project version /30/2006

Talend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Chapter. Solve Performance Problems with FastSOA Patterns. The previous chapters described the FastSOA patterns at an architectural

Scalable End-User Access to Big Data HELLENIC REPUBLIC National and Kapodistrian University of Athens

Security challenges for internet technologies on mobile devices

Rotorcraft Health Management System (RHMS)

Annotea and Semantic Web Supported Collaboration

A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems

SERVICE-ORIENTED MODELING FRAMEWORK (SOMF ) SERVICE-ORIENTED SOFTWARE ARCHITECTURE MODEL LANGUAGE SPECIFICATIONS

A Secure Autonomous Document Architecture for Enterprise Digital Right Management

TRANSFoRm: Vision of a learning healthcare system

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

The Value of Taxonomy Management Research Results

Published International Standards Developed by ISO/IEC JTC 1/SC 37 - Biometrics

WHITE PAPER MANAGING VALUABLE IMAGE ASSETS USING DIGITAL WATERMARKING FOR COPYRIGHT COMMUNICATION AND IMAGE SEARCH INTRODUCTION

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness

Introduction to Service Oriented Architectures (SOA)

Streamlining the drug development lifecycle with Adobe LiveCycle enterprise solutions

Ontology for Home Energy Management Domain

Digital Asset Management System Quality Assurance Testing A Case Study

Tableau Metadata Model

ECM Governance Policies

Lightweight Data Integration using the WebComposition Data Grid Service

Adobe Certified Expert Program

Flattening Enterprise Knowledge

Security and privacy for multimedia database management systems

Azure Scalability Prescriptive Architecture using the Enzo Multitenant Framework

APPLICATION COMPLIANCE AUDIT & ENFORCEMENT

Digital Asset Management

Introduction to UDDI: Important Features and Functional Concepts

E-Business Technologies for the Future

Search Taxonomy. Web Search. Search Engine Optimization. Information Retrieval

M3039 MPEG 97/ January 1998

Appendix A. Functional Requirements: Document Management

FIPA agent based network distributed control system

IBM Enterprise Content Management Product Strategy

SHared Access Research Ecosystem (SHARE)

Interoperable Cloud Storage with the CDMI Standard

Extending SOA Infrastructure for Semantic Interoperability

A DICOM-based Software Infrastructure for Data Archiving

Turkey s National Health Information System (NHIS)

Ontology and automatic code generation on modeling and simulation

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

Team Collaboration, Version Management, Audit Trails

Master s Program in Information Systems

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

EXPLOITING FOLKSONOMIES AND ONTOLOGIES IN AN E-BUSINESS APPLICATION

From the Monolith to Microservices: Evolving Your Architecture to Scale. Randy linkedin.com/in/randyshoup

IBM Rational Asset Manager

The Recipe for Sarbanes-Oxley Compliance using Microsoft s SharePoint 2010 platform

Presente e futuro del Web Semantico

JOURNAL OF OBJECT TECHNOLOGY

Meta-Model specification V2 D

How To Manage Your Digital Assets On A Computer Or Tablet Device

MOBILE ARCHITECTURE FOR DYNAMIC GENERATION AND SCALABLE DISTRIBUTION OF SENSOR-BASED APPLICATIONS

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Enabling Storage Services in Virtualized Cloud Environments

IBM Lotus Domino Document Manager 6.5.1

Guideline for Implementing the Universal Data Element Framework (UDEF)

Managing explicit knowledge using SharePoint in a collaborative environment: ICIMOD s experience

Standard of the Camera & Imaging Products Association. White Paper. of CIPA DC Picture Transfer Protocol over TCP/IP networks

Self-Protecting Documents for Cloud Storage Security

Fogbeam Vision Series - The Modern Intranet

CLOUD BASED SEMANTIC EVENT PROCESSING FOR

IBM WebSphere ILOG Rules for.net

SOA REFERENCE ARCHITECTURE: WEB TIER

Service-Oriented Architecture and Software Engineering

MOOCviz 2.0: A Collaborative MOOC Analytics Visualization Platform

How To Make Sense Of Data With Altilia

Transcription:

OVERVIEW OF JPSEARCH: A STANDARD FOR IMAGE SEARCH AND RETRIEVAL Frédéric Dufaux, Michael Ansorge, and Touradj Ebrahimi Institut de Traitement des Signaux Ecole Polytechnique Fédérale de Lausanne (EPFL) CH-1015 Lausanne, Switzerland frederic.dufaux@epfl.ch, michael.ansorge@epfl.ch, touradj.ebrahimi@epfl.ch ABSTRACT In this paper, we review the on-going JPSearch standardization activity. Its goal is to provide a standard for interoperability for image search and retrieval systems. More specifically, JPSearch aims at defining the interfaces and protocols for data exchange between devices and systems. 1. INTRODUCTION Powerful image compression and coding standards, along with rapid evolution in computer technologies, image sensors, storage devices and networking, have made digital imaging a tremendous success with consumers and businesses. As a result, a phenomenal growth in personal, professional and shared digital image collections has been observed in recent years. However, searching and managing these large distributed collections represents a considerable challenge. With the evolution from film to digital camera, it has become easy, fast and free to take lots of photographs. Yet, to organize the resulting digital photo shoebox, a modern variation of the traditional photo shoebox, remains cumbersome. Photo organizer applications allow for a simple classification of images to generate albums or slideshows, but often require manually annotating the whole collection, a very tedious and time-consuming process. Furthermore, because the portability of data and metadata is not guaranteed, the consumer is de facto locked into one system. Stock photography companies, such as Getty Images [1] or Corbis [2], boast tens of millions of digital photographs. These photographs are labeled and organized based on professionally developed taxonomies. The tremendous cost of generating the annotation is justified and recouped by the high commercial value of the content. More recently, photo sharing websites, such as Flickr [3], have gained in popularity, fueled by innovative online community tools that allow users to categorize images by use of tags, in order to allow searchers to easily find images concerning a certain topic. The resulting folksonomy [4] consists of collaboratively generated, open-ended labels authored by the users, and sometimes submitters, of the content. However, again these systems are closed and lock up the user. The JPEG committee, formally known as ISO/IEC JTC1 SC29 WG 1, has throughout the years developed and brought to market successful standards for data representation such as JPEG [5] and JPEG 2000 [6]. Recognizing the need of a standard for interoperability for image search and retrieval systems, JPEG has recently launched a new activity, JPSearch [7][8], also known more formally as ISO/IEC 24800. The aim of this standard is to allow different image management systems to inter-operate. It is foreseen that JPSearch will enable more complete solutions and give consumers and businesses confidence in the longevity of their annotations and collection maintenance effort. In this paper, we review the on-going JPSearch standardization activity. As of the time of this writing, JPSearch is still at an early stage. Per consequent, the description below is still subject to possibly significant changes. This paper is structured as follow. We discuss the main features of JPSearch in Sec. 2. We then introduce the JPSearch framework and system architecture in Sec. 3. The structure of the JPSearch specification is outlined in Sec. 4, where the scope of each part is detailed. In Sec. 5, we present a number of relevant use cases. Finally, a summary is given in Sec. 6. 2. JPSEARCH OVERVIEW JPSearch aims at inter-operability between devices and systems by defining the interfaces and protocols for data

exchange between them, while restricting as little as possible how those devices, systems or components perform their task. Existing search systems are implemented in a way that tightly couples many elements of the search process. JPSearch provides an abstract framework as well as a modular and flexible search architecture that allows an alignment of system design to a standard framework. In this framework, inter-operability can be defined in many ways, for example between selfcontained vertical image search systems that interact to provide federated search, between the different layers of image search so that these components could be supplied by different best-of-breed vendors, or at the metadata level such that different systems may add, update or query metadata for images and image collections. In particular, JPSearch facilitates the use and reuse of metadata. A user makes a heavy investment when annotating a collection of images. With JPSearch, the portability of the metadata is guaranteed, hence allowing a user to subsequently migrate to applications or systems which best suit his needs. In community based image sharing systems, this portability enables the owner of an image collection to merge community metadata back into his own management system, hence helping to overcome the manual annotation bottleneck. Similarly, JPSearch makes possible the use and reuse of ontologies to provide a common language for contexts. Indeed, searching for images always takes place in a context, either implicit or explicit. A common format for handling context allows a user to carry his context with him to different search engines. It also allows the context to be owned by the user and not by the system, hence protecting the user s privacy. JPSearch also provides a common query language, giving search providers a reference standard to remove ambiguity in the formulation of a query, and to make searching over shared repositories consistent. The common query language also defines query management process such as relevance feedback. Finally, by providing a solution for the carriage of image collections and associated metadata between compliant devices and systems, JPSearch enables image search and retrieval functionality across multiple repositories. Therefore, it allows leveraging the generally high cost of creating metadata. 3. JPSEARCH FRAMEWORK AND SYSTEM ARCHITECTURE A JPSearch architecture has been defined, as illustrated in Figure 1. This architecture is generic in the sense that most existing image search systems can be straightforwardly mapped to it. The architecture is divided into four layers, namely: User layer: The user layer aims at the personalization of the search service. Query layer: The query layer is dealing with the formulation of the query and presentation of the results. Management layer: The management layer is handling the distribution of a search task over multiple image repositories. Content layer: The content layer includes the image repositories and the associated metadata, as well as the corresponding schema and ontology. Note that while the query, management and content layers are in the scope of JPSearch, the user layer is outside of the scope. In turn, four independent functional modules are involved in the architecture, as detailed below: Query process: The query process is part of the query layer. It aims at the efficient execution of search tasks. More specifically, it forms a machine understandable query from the user s search task which is then conveyed to the subsequent management and content layers. Conversely, it validates the search results and ranks them according to the user s criteria. Finally, the query process may also generate a new query based on relevance feedback mechanisms. Repository management process: The repository management process belongs to the management layer. Its purpose is to allow users to simultaneously search multiple distributed image repositories with the same query. Another purpose is to comprehensively aggregate the results returned from the image repositories. Image repository process: The image repository process is located in the content layer and supplies basic search functionalities. More specifically, this includes receiving a set of queries, executing the matching of these queries with the stored metadata, and forming a result. Metadata creation and update process: The metadata creation and update process is also part of the content layer. On the one hand, this process enables to build metadata using a proper schema and ontology definition. On the other hand, this process also provides with the functionality to update the metadata, e.g. by adding, replacing, removing all or part of the metadata. In particular, whenever the image content is modified, it verifies that the metadata is suitably brought up to date.

4. JPSEARCH SPECIFICATION AND SCOPE JPSearch is a multi-part specification. Five parts are currently envisioned, as detailed hereafter: Part-1: Framework and system components: This part is a type-3 technical report which provides a global view of JPSearch [8]. In particular, the technical report reviews the traditional approaches to image search and motivates the importance of the user in the search process. It describes a number of use cases in order to identify user needs and requirements. It then explains the overall search and management process. This leads to the introduction of a JPSearch architecture composed of 4 layers: user layer, query layer, management layer and content layer. Finally, the technical report outlines the organization of the JPSearch specification. Part-2: Schema and ontology registration and identification: This part standardizes a platformindependent format for the import, export and exchange of ontologies. It also defines a registry of ontologies which can be imported into a JPSearch compliant system. Finally, it standardizes basic functions to query and manipulate one or more ontologies in a repository. Part-3: JPSearch query format: This part provides three standardized functionalities between users and image repositories. Firstly, it allows users to express their search criteria. Secondly, it allows users to describe the aggregated return result sets for user presentation or machine consumption. Thirdly, it defines query management processes such as relevance feedback. Part-4: Metadata embedded in image data (JPEG-1 and JPEG-2000) file format: This part specifies image data exchange format with associated metadata to accelerate the re-use of metadata. It supports two functionalities, namely the mobility of metadata and the persistent association of metadata with image. Part-5: Data interchange format between image repositories: This part standardizes a format for the exchange of image collections and respective metadata between JPSearch compliant repositories. The data interchange format enables the synchronization of repositories in order to facilitate simple and fully interoperable exchange across different devices and platforms. The overall structure of the JPSearch standard is illustrated in Figure 2, where the role of each part of the specification is shown in the previously defined JPSearch architecture. 5. USE CASES To define the scope of JPSearch, use cases illustrating the current problems with image search and management have been identified. An extended and detailed list of these use cases can be found in [8], while some of them are mentioned hereafter: Searching images in stock photo collections for usage in magazines: The user wishes to buy a selection of images in order to illustrate a publication to be sold to consumers. Mobile tourist information: A tourist is in an unfamiliar place, sees an interesting landmark and wants to know what it is. He takes a picture of the landmark on his mobile phone and sends it to a tourist information server which calls him back and gives him the information. Surveillance search from desktop to mobile device with alerts: The user sets up a visual surveillance query on a desktop computer, saves the query for real-time monitoring with results saved periodically for retrieval from a mobile device. Finding illegal or unauthorized use of images: The user holds his original content. He wants to find unauthorized variations of his original content using search engines. Matching images between collections for synchronization: The user ends up with a large collection of images stored on multiple computers, laptops, external drives, portable USB key drives, portable photo players, photo sharing websites, photo printing websites, and mobile phones. He needs his collection to be synchronized across all platforms. Image search in the medical domain: Many pathologies have visual symptoms which are essential for doctors doing diagnosis. There are cases where the visual symptoms are not familiar and the doctor would like to consult a visual reference material (an atlas, or previously diagnosed reference cases). The user (a doctor in a hospital clinic) searches then for the best matches to the symptoms and retrieves case histories (including other images, metadata, text, etc.) to aid his/her diagnosis. Servants image searchers: The user accesses content by means of a standard web browser. The web site forwards the user s requests to the application using a standard communication protocol. For each provided query, the application creates a personalized intelligent search agent that will effectively take care of the search on behalf of the user.

Open federated repositories: Digital archives set up access to part of their databases for being consulted from cultural and institutional search interfaces. The data provider manages the database and provides a standardized protocol to consult the data. The service provider manages in turn the added value services (search and retrieval and presentation), harvests the collection and provides the user interface. 6. SUMMARY In this paper, we have reviewed the on-going JPSearch standardization activity. JPSearch aims at a standard for interoperability for image search and retrieval systems. We have first discussed the limitations of current systems and motivations for JPSearch. We have then introduced the JPSearch framework and system architecture. Next, we have discussed the scope of each part of the JPSearch specification. Finally, we have presented a number of relevant use cases. 7. ACKNOWLEDGEMENT This work was partially supported by the COST Action 292 on Semantic Multimodal Analysis of Digital Media, and by the European Commission under the IST research network of excellence K-SPACE of the 6th Framework program (contract FP6-027026). This paper expresses the view of the authors but not necessarily the view of K- SPACE. The authors would like to thank the editors of JPSearch, Mun-Kew Leong, Young Seop Kim, Wo Chang, and Akio Yamada, as well as all the other individuals who participate in the development of JPSearch. 8. REFERENCES [1] http://www.gettyimages.com [2] http://www.corbis.com [3] http://www.flickr.com [4] http://en.wikipedia.org/wiki/folksonomy [5] W.B. Pennebaker and J.L. Mitchell, JPEG: Still Image Data Compression Standard, Van Nostrand Reinhold, New York, 1993. [6] D. Taubman and M. Marcellin, JPEG 2000: Image Compression Fundamentals, Standards and Practice, Kluwer Academic Publishers, 2002. [7] ISO/IEC JTC1/SC29/WG1 WG1N4108, JPSearch Executive White Paper, December 2006. [8] ISO/IEC JTC1/SC29/WG1 WG1N4083, ISO/IEC PDTR 24800-1: Framework and System Components, November 2006.

Figure 1: JPSearch architecture (adapted from [8]).

Figure 2: Overall structure of JPSearch (adapted from [8]).