Domain Engineering in GIS: Metadata Standard Interoperability

Similar documents
Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web

Advanced Meta-search of News in the Web

Implementing Ontology-based Information Sharing in Product Lifecycle Management

Ontology and automatic code generation on modeling and simulation

Information and documentation The Dublin Core metadata element set

Databases in Organizations

OWL based XML Data Integration

data.bris: collecting and organising repository metadata, an institutional case study

Network Working Group

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

CONDIS. IT Service Management and CMDB

Designing a Semantic Repository

Performance Analysis, Data Sharing, Tools Integration: New Approach based on Ontology

Combining RDF and Agent-Based Architectures for Semantic Interoperability in Digital Libraries

An Approach to Eliminate Semantic Heterogenity Using Ontologies in Enterprise Data Integeration

Service Oriented Architecture

Mapping VRA Core 4.0 to the CIDOC/CRM ontology

From MARC21 and Dublin Core, through CIDOC CRM: First Tenuous Steps towards Representing Library Data in FRBRoo

Building Semantic Content Management Framework

1 Building a metadata schema where to start 1

Integrating XML Data Sources using RDF/S Schemas: The ICS-FORTH Semantic Web Integration Middleware (SWIM)

A Java Tool for Creating ISO/FGDC Geographic Metadata

Firewall Builder Architecture Overview

- a Humanities Asset Management System. Georg Vogeler & Martina Semlak

Application of ontologies for the integration of network monitoring platforms

A RDF Vocabulary for Spatiotemporal Observation Data Sources

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee Berlin, Germany.

ONTOLOGY-BASED MULTIMEDIA AUTHORING AND INTERFACING TOOLS 3 rd Hellenic Conference on Artificial Intelligence, Samos, Greece, 5-8 May 2004

Guideline for Implementing the Universal Data Element Framework (UDEF)

ECM Governance Policies

Service Road Map for ANDS Core Infrastructure and Applications Programs

XML. CIS-3152, Spring 2013 Peter C. Chapin

Templates and Archetypes: how do we know what we are talking about?

Semantics, SaaS and STASIS

Australian Recordkeeping Metadata Schema. Version 1.0

7. Classification. Business value. Structuring (repetition) Automation. Classification (after Leymann/Roller) Automation.

XML for Manufacturing Systems Integration

A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches

How To Write An Inspire Directive

SNOMED-CT

UNIMARC, RDA and the Semantic Web

FHIM Model Content Overview

Combining SAWSDL, OWL DL and UDDI for Semantically Enhanced Web Service Discovery

Semantic Interoperability

Notes about possible technical criteria for evaluating institutional repository (IR) software

CERN Document Server

XML- New meta language in e-business

Lightweight Data Integration using the WebComposition Data Grid Service

Secure Semantic Web Service Using SAML

Filtering the Web to Feed Data Warehouses

FIPA agent based network distributed control system

Integration of Polish National Bibliography within the repository platform for science and humanities

Reusable Knowledge-based Components for Building Software. Applications: A Knowledge Modelling Approach

Toward an Ontology Architecture for Cyber-Security Standards

A terminology model approach for defining and managing statistical metadata

1 Introduction XML IN THE BISIS LIBRARY MANAGEMENT SYSTEM 1. Novi Sad J. Math. Vol. 41, No. 2, 2011,

Using Dublin Core for DISCOVER: a New Zealand visual art and music resource for schools

AN AUTOMATIC AND METHODOLOGICAL APPROACH FOR ACCESSIBLE WEB APPLICATIONS

Metadata Hierarchy in Integrated Geoscientific Database for Regional Mineral Prospecting

Department of Defense Net-Centric Data Strategy

ISO and OGC Service Architecture

DDI Lifecycle: Moving Forward Status of the Development of DDI 4. Joachim Wackerow Technical Committee, DDI Alliance

Creating an EAD Finding Aid. Nicole Wilkins. SJSU School of Library and Information Science. Libr 281. Professor Mary Bolin.

Requirements. Approaches to user support. Command assistance. Chapter 11 User support

Software Engineering. Software Engineering. Component-Based. Based on Software Engineering, 7 th Edition by Ian Sommerville

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper

Linked Medieval Data: Semantic Enrichment and Contextualisation to Enhance Understanding and Collaboration

METADATA STANDARDS AND METADATA REGISTRIES: AN OVERVIEW

Monitoring and Reporting Drafting Team Monitoring Indicators Justification Document

Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object

Extending SOA Infrastructure for Semantic Interoperability

Introduction to Service Oriented Architectures (SOA)

Semantic Method of Conflation. International Semantic Web Conference Terra Cognita Workshop Oct 26, Jim Ressler, Veleria Boaten, Eric Freese

An Implementation of a Semantic, Web-Based Virtual Machine Laboratory Prototyping Environment

BACKGROUND. Namespace Declaration and Qualification

METS and the CIDOC CRM a Comparison

Amit Sheth & Ajith Ranabahu, Presented by Mohammad Hossein Danesh

Common Warehouse Metamodel (CWM): Extending UML for Data Warehousing and Business Intelligence

Publishing Linked Data Requires More than Just Using a Tool

Semantic Modeling with RDF. DBTech ExtWorkshop on Database Modeling and Semantic Modeling Lili Aunimo

NSW Government Open Data Policy. September 2013 V1.0. Contact

STAR Semantic Technologies for Archaeological Resources.

Links, languages and semantics: linked data approaches in The European Library and Europeana.

Committee on WIPO Standards (CWS)

Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint

A Semantic web approach for e-learning platforms

Appendix B Data Quality Dimensions

Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution

Background: Business Value of Enterprise Architecture TOGAF Architectures and the Business Services Architecture

Information Model Architecture. Version 2.0

ISSUES ON FORMING METADATA OF EDITORIAL SYSTEM S DOCUMENT MANAGEMENT

Baba Piprani. Canada

technische universiteit eindhoven WIS & Engineering Geert-Jan Houben

Semantic Information on Electronic Medical Records (EMRs) through Ontologies

DFS C Open Data Policy

The Ontological Approach for SIEM Data Repository

CHAPTER 1 INTRODUCTION

European Forest Information and Communication Platform

BC Geographic Warehouse. A Guide for Data Custodians & Data Managers

Transcription:

Domain Engineering in GIS: Metadata Standard Interoperability Anne Elise Weiss April 18, 2012

Introduction Earlier in the course: Discussed a system storing information about different types of things (concerts, restaurants, train schedules, etc.) Would mean: Combining information from many different domains, utilizing it in the GIS domain Need: metadata interoperability

Metadata Different definition proposals (by ISO): Data describing and documenting data Data about datasets and usage aspects of it Data about the content, quality, condition, and other characteristics of data In a GIS context: Metadata defined for geographical information mainly has a documenting role. The information which is given to the user is related to the dataset structure and to its contents.

Interoperability Interoperability: The ability to develop conventions that enable data exchange and integration. Semantic interoperability: agreement about content description standards. Growing number and complexity of metadata standards more difficult and tedious to handle information in different standards.

Metadata standards Ideally: one unique metadata standard xkcd.com/927

Metadata standards One possible solution: Create a mechanism enabling translation in order to make it conform to different standards crosswalks.

Example of a cross-domain application We have three different databases store metadata of different types: Library items (books, reports, other documents) Events (movies, theatre, recitals, etc.) Geographic data (maps, satellite images, etc.) Together, they can be used for e.g. tourist information The standard used in each metadata database belongs to a distinct domain, so they are different standards.

Example of a cross-domain application The tourist information provider system should use a homogeneous mechanism for querying and accessing the databases. I.e: the metadata schema of the system should be independent of the metadata representation used by the different databases. The system uses one standard for querying and managing information, e.g. Dublin Core. The homogeneous mechanism acts as a crosswalk broker A repository of crosswalks The software for activating and processing the crosswalks when needed

Example of a cross-domain application A crosswalk specifies the mapping between two related standards Problems with crosswalks: the construction is difficult and error-prone the construction requires deep knowledge and lots of experience the standards have mostly been developed independently maintenance Therefore: harmonization is vital, makes it easier to match the metadata elements

Domain Engineering and metadata crosswalks The problems associated with developing metadata crosswalks are not constrained to a specific application domain. Geographic information metadata concepts: Geographic information: describes phenomena associated directly or indirectly with a location The geographic metadata: describes the content, quality, condition and other characteristics of the data that allow a person to locate data and to understand them There is a range of different existing standards

Different approaches Main approaches to the semantic interoperability problem: 1. Solutions using ontologies 2. Create specific crosswalks for one-to-one mapping

Ontology-based semantic interoperability There exist many initiatives that aim towards solving the semantic interoperability problem on the Web Most of these propose using ontologies and RDF (Resource Description Framework): simple model for describing the interrelationships among resources named properties and values RDFS (RDF Schema/RDF Vocabulary Language) for the declaration and interpretation of those properties Closely related to the Semantic Web

Ontology-based semantic interoperability There exist alternatives to RDF technologies These approaches offer flexible solutions for interoperability. Ambitious aim of flexibility lack of accuracy in the performed mappings? No local structural constraints considered

Ontology-based semantic interoperability the wider the targeted scope of interoperability, the more difficult it is to achieve accurate, precise mappings For a small set of metadata standards, hardwired crosswalks may result more adequate than ontology-based solutions In the geographic information context, the set of metadata standards is small and syntax and semantics are relatively fixed

Crosswalk-based semantic interoperability Many different mappings have been made: MARC 21 to Dublin Core Dublin Core to USMARC Dublin Core to EAD/GILS/USMARC... Mostly, the only result included is the mapping table, almost no one offers details about the process.

Construction of crosswalks between metadata standards 1. Harmonization obtain a formal and homogeneous specification of both standards 2. Semantic mapping mapping table to determine the semantic correspondence of elements between the standards 3. Additional rules for metadata conversion to solve problems like differing hierarchy levels, data type conversions, etc. 4. Mapping implementation obtain a completely automated crosswalk (application of some tool)

Harmonization Standards often have properties that are very similar If had a fixed way of describing these properties: every metadata standard could be described in a similar way similar processes could be applied to related metadata standards standards implementation would be simplified development of new crosswalks between them would be simplified

Harmonization Generalization and formalisation: by means of a canonical representation or a specification language Because most standards use XML as exchange and presentation format, they also provide a DTD or XML Schema formally describing the syntax But: a mere syntactic description is not enough to store the necessary information to automate development of crosswalks Therefore: propose to create a table describing the elements of each standard apart from the available DTD

Semantic mapping Specification of a mapping between each element in the origin standard and the semantically equivalent element in the target standard Need a clear and precise definition of the elements Many metadata standards already provide a semantic mapping with related standards A mapping table should be produced at the end of this phase

Additional rules for metadata conversion Crosswalk: set of transformations a completely specified crosswalk: a table of semantic mappings + a metadata conversion specification

Additional rules for metadata conversion Content conversion Elements are frequently restricted to contain a particular data type, range of values or controlled vocabulary Analogous elements in different standards may have different content restrictions specific rules are required to establish the correspondence It is necessary to establish the relationship between values on a one-to-one basis.

Additional rules for metadata conversion Element-to-element mapping Properties specified with each element: whether repeatable or not, mandatory or not. Possibly non-trivial cases for crosswalks: One-to-many: Trivial in most cases, but not all Many-to-one: Must specify what to do with the extra elements. E.g. explicit concatenation rules, or rules for which value to select. Extra/unresolved elements in source/target

Additional rules for metadata conversion Hierarchy Most metadata standards organize their metadata hierarchically Crosswalks must consider the possible differences between source and target The mapping table itself shows the elements organized hierarchically in every standard

Mapping implementation Automated implementation of crosswalks: the use of style sheets Most of the mentioned metadata standards use XML most suitable technology to carry out implementation of crosswalks: XSL (extensible Stylesheet Language). Purpose of XSL: manipulation and transformation of XML. Transformation method: constructing the style sheet applying to the original XML-document Each section is transformed by applying the previously defined mapping tables

Example transformation Dublin Core (ISO 15836): 15 basic descriptors (result of an international and interdisciplinary consensus) ISO 19115: defines the schema required for describing geographic information and services

Example transformation Main component of the crosswalk: the mapping between the standards Some DC elements cannot be mapped to ISO 19115 Core, but to ISO 19115 Comprehensive. Some elements from ISO 19115 Core that have no direct correspondence with elements from DC

Conclusions Organizations aim at migrating towards ISO Also asked to provide a generic description Would be more sensible to maintain metadata in accordance with a unique standard and produced by a stable cataloguing tool Crosswalks would be applied when other views are required Crosswalks must be constructed by means of formalized methods Next step: prove the utility in the construction of search applications

References Metadata standard interoperability: application in the geographic information domain (2004) Integrating geographic and non-geographic data search services using metadata crosswalks (2003) Metadata And GIS: A Classification of Metadata for GIS (1998)