Tutorial 1: Basic Semantics. Stuart A. Sutton University of Washington DC-2007, Singapore 27 August 2007

Similar documents
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource for schools

Network Working Group

1 Building a metadata schema where to start 1

Integration of Heterogeneous Metadata in Europeana. Cesare Concordia Institute of Information Science and Technology-CNR

European Forest Information and Communication Platform

Information and documentation The Dublin Core metadata element set

Queensland recordkeeping metadata standard and guideline

Encoding Library of Congress Subject Headings in SKOS: Authority Control for the Semantic Web

REACCH PNA Data Management Plan

Working with the British Library and DataCite A guide for Higher Education Institutions in the UK

Introduction to Service Oriented Architectures (SOA)

UNIMARC, RDA and the Semantic Web

DC2AP: A Dublin Core Application Profile to Analysis Patterns

Government of Alberta Metadata Management Glossary

GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS

METADATA STANDARDS AND GUIDELINES RELEVANT TO DIGITAL AUDIO

Claremont Colleges Digital Library Dublin Core Metadata Elements Best Practices Version 2 January 2008

The Institutional Repository at West Virginia University Libraries: Resources for Effective Promotion

Metadata Quality Control for Content Migration: The Metadata Migration Project at the University of Houston Libraries

data.bris: collecting and organising repository metadata, an institutional case study

ISSUES ON FORMING METADATA OF EDITORIAL SYSTEM S DOCUMENT MANAGEMENT

Spatial Information Data Quality Guidelines

James Hardiman Library. Digital Scholarship Enablement Strategy

NSW Government Standard Approach to Information Architecture. December 2013 v.1.0

Joint Steering Committee for Development of RDA

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

BIBFRAME Pilot (Phase One Sept. 8, 2015 March 31, 2016): Report and Assessment

Bibliographic Standards

Data Management Considerations for the Data Life Cycle

Question/Answer Transaction Protocol Overview First Committee Working Draft NISO Committee AZ: September 22, 2003

Vilas Wuwongse, Thiti Vacharasintopchai, Neelawat Intaraksa Asian Institute of Technology

DDI Lifecycle: Moving Forward Status of the Development of DDI 4. Joachim Wackerow Technical Committee, DDI Alliance

CERN Document Server

DATA MODEL FOR STORAGE AND RETRIEVAL OF LEGISLATIVE DOCUMENTS IN DIGITAL LIBRARIES USING LINKED DATA

METS and the CIDOC CRM a Comparison

Semantic Interoperability

Call for experts for INSPIRE maintenance & implementation

ROLE OF METADATA IN DIGITAL RESOURCE MANAGEMENT

Australian Recordkeeping Metadata Schema. Version 1.0

The Rutgers Workflow Management System. Workflow Management System Defined. The New Jersey Digital Highway

Cloud Data Management Interface (CDMI) The Cloud Storage Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG

AGLS Victoria Metadata Implementation Manual. Information Victoria

Service Road Map for ANDS Core Infrastructure and Applications Programs

System Requirements for Archiving Electronic Records PROS 99/007 Specification 1. Public Record Office Victoria

Making Content Easy to Find. DC2010 Pittsburgh, PA Betsy Fanning AIIM

Notes about possible technical criteria for evaluating institutional repository (IR) software

CHAPTER 1 INTRODUCTION

Healthcare, transportation,

NSW Government Open Data Policy. September 2013 V1.0. Contact

From MARC21 and Dublin Core, through CIDOC CRM: First Tenuous Steps towards Representing Library Data in FRBRoo

Using Dataverse Virtual Archive Technology for Research Data Management. Jonathan Crabtree Thu-Mai Christian Amanda Gooch

A Java Tool for Creating ISO/FGDC Geographic Metadata

Evolving Infrastructure: Growth and Evolution of Spatial Portals

Report of the Ad Hoc Committee for Development of a Standardized Tool for Encoding Archival Finding Aids

Core Competencies for Visual Resources Management

Databases & Data Infrastructure. Kerstin Lehnert

Web 3.0 image search: a World First

Web NDL Authorities: Authority Data of the National Diet Library, Japan, as Linked Data

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

DFS C Open Data Policy

Cataloguing is riding the waves of change Renate Beilharz Teacher Library and Information Studies Box Hill Institute

STANDARDS FOR AGENTS AND AGENT BASED SYSTEMS (FIPA)

Building Semantic Content Management Framework

Monitoring and Reporting Drafting Team Monitoring Indicators Justification Document

Object-Process Methodology as a basis for the Visual Semantic Web

Library metadata, whether in the form of MARC 21

COIN - Collaboration & Interoperability for Networked Enterprises Project N COIN Training Courses

Organisation data architecture with standards connections

SOA Success is Not a Matter of Luck

Integration of Polish National Bibliography within the repository platform for science and humanities

Comprehensive Exam Analysis: Student Performance

Building Your EDI Modernization Roadmap

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

EUR-Lex 2012 Data Extraction using Web Services

Transcription:

Tutorial 1: Basic Semantics Stuart A. Sutton University of Washington DC-2007, Singapore 27 August 2007

Tutorial schedule Tutorial 1: Basic Semantics (Stuart Sutton) Tutorial 2: Basic Syntaxes (Mikael Nilsson) Tutorial 3: Vocabularies (Alistair Miles) Tutorial 4: Application Profiles (Diane Hillmann) 2

What s covered in this tutorial An introduction to metadata Key features of the Dublin Core Dublin Core metadata in broader context Important aspects of the DCMI community 3

What do we mean by Dublin Core? 1. An international community interested in interoperable metadata (DCMI) 2. New ways of thinking about interoperable metadata DCMIAbstractModel Application Profiles 3. A core set of 15 metadata elements ISO Standard 15836-2003 (February 2003) NISO Standard Z39.85-2007 (May 2007) 4. A number of additional elements and element refinements beyond the core e.g., audience reflecting needs of different discourse and practice communities 5. All of the above! 4

Why did the Dublin Core come to be in 1995? Dramatic increase in the number of document-like resources on the net Little improvement in indexing services made resources hard to discover Belief that descriptive metadata would improve discovery Perceived need for a descriptive standard that was simple to apply (even by non-professionals) 5

An Introduction to Metadata Definition Types & Functions Metadata Building Blocks

What is metadata? Metadata consists of statements we make about resources to help us find, identify, use, manage, evaluate, and preserve them. Answers come from three traditions: Database Management Systems ( Schemas of relational databases ) Library Cataloging Traditions (MARC & AACR2) The World Wide Web (since the mid-1990 s) The context for Dublin Core 7

Types and functions of metadata Types of Metadata Administrative Descriptive Access/Use Preservation Technical/Structura l Relational Etc. Functions of Metadata Discover resources Mark content structure Identify versions Manage resources Certify authenticity Situate geospatially Control IP rights Indicate status Describe processes Etc.. 8

Metadata building blocks 1. The basic unit of metadata is a statement. 2. A statement consists of a property (element) and a value (i.e., a property/value pair) 3. Metadata statements describe resources. property resource describes statement creator DC2007 Tutorial creator = Stuart A. Sutton value Stuart A. Sutton 9

Properties and values in action? 245 00 $a Romeo and Juliet $h [videorecording] <title>gone with the wind</title> <type>movingimage</type> 10

Key Features of Dublin Core General Characteristics Dublin Core Principles

A note of caution before beginning Use of the word element e.g., Dublin Core Element Set An Element is a property of a resource. As intended here, "properties" are attributes of resources -- characteristics that a resource may "have", such as a Title, Publisher, or Subject. http://dublincore.org/usage/documents/principles/ (2003) It is not used in the XML sense of element as a unit of XML data, delimited by tags Element is frequently used interchangeably in the Dublin Core community with the word property 12

The DCMI Abstract Model DC community realized early that rational development and machine-processing requires a coherent data model DCMI Grammatical Principles http://dublincore.org/usage/documents/principles/ (2003) DCMI Abstract Model http://dublincore.org/documents/abstract-model/ (2005) Defines resources in terms of semantic relationships among classes, properties, and values Defines a model for DCMI descriptions, description sets, and records Serves as a foundation for future DCMI developments Serves as a conceptual model for metadata initiatives outside DCMI 13

Characteristics of (or abstractions from) Dublin Core metadata Syntax independent HTML/XHTML, XML, RDF/XML Flat, non-hierarchical element structure Extensible By refinement (sub-properties of existing properties) By combination (application profiles) Optional Repeatable Elements may occur in any order 14

Simple DC: Fifteen core elements (1996) Creator Title Subject Contributor Date Description Publisher Type Format Coverage Rights Relation Source Language Identifier ISO 15836-2003 NISO Z39.85-2007 15

Resources for which DC is often used DCMI Type Vocabulary http://dublincore.org/documents/2006/12/18/dcmi-terms/ Collection Dataset Event Image Interactive Resource Moving Image Physical Object Service Software Sound Still Image Test 16

DC elements and refinements since DC Simple (15 core elements) 2000 abstract coverage hasformat isversionof requires accessrights created haspart language rights accrualmethod creator hasversion license rightsholder accrualperiodicity date identifier mediator source accrualpolicy dateaccepted instructionalmethod medium spatial alternative datecopyrighted isformatof modified subject audience datesubmitted ispartof provenance tableofcontents available description isreferencedby publisher temporal bibliographiccitation educationlevel isreplacedby references title conformsto extent isrequiredby relation type contributor format issues replaces valid http://dublincore.org/documents/2006/12/18/dcmi-terms/ 17

Element refinements Element refinements narrow the meaning of DC elements medium refines format bibliographiccitation refines identifier tableofcontents refines description Refinements are properties just like the properties they refine they can stand alone <alternative>nine queens<alternative> <datecopyrighted>2000-07-11</datecopyrighted> NOT <title.alternative>nine queens</title.alternative> <date.datecopyrighted>2000-07-11</date.datecopyrighted> 18

Qualification of element values: encoding schemes Encoding schemes give context to element values Vocabulary encoding schemes Indicate that a value comes from a controlled vocabulary (e.g., that Spanish American literature is an LCSH term) Syntax encoding schemes Indicate that a string is formatted in a standard way (e.g., that 1956-11-12 follows ISO 8601) For the DC core elements, DCMI recommends using encoding schemes with coverage, date, format, language, subject, and type 19

Summary: Simple and qualified DC Simple DC: Varying definitions Only the original 15 elements, or All available elements, without encoding schemes or refinements In each case only making use of value strings Qualified DC Metadata that makes use of some or all the features of the DCMI Abstract Model Element Refinements Value Encoding Schemes 20

Dublin Core Principles Dumb-Down One-to-One Appropriate Values 21

Dumb-Down Simple DC does not use element refinements or encoding schemes and statements only contain value strings Qualified DC uses features of the DCMI Abstract Model, particularly element refinements and encoding schemes Dumbing-down is translating qualified DC to simple DC (property dumb-down and value dumb-down) 22

The One-to-One Principle Create one metadata description for one and only one resource Do not describe a digital image of the Mona Lisa as if it were the original painting Do not describe both a song and the song s composer in the same description Describe the composer and the work in two separate descriptions Group related descriptions into a description set (record) 23

Appropriate Values Use elements, element refinements and qualifiers to meet the needs of your local context, but... Remember that your metadata may be interpreted by machines and people, so... Consider whether the values you use will aid discovery outside your local context and... Make decisions about your local practices accordingly 24

DCMI Namespaces and Policies All DCMI metadata terms are given unique identity within three namespaces: http://purl.org/dc/elements/1.1/ - the legacy DC-15 http://purl.org/dc/terms/ - all other elements/qualifiers http://purl.org/dc/dcmitype/ - a Type vocabulary Example: http://purl.org/dc/elements/1.1/title Policies promote long-term stability of namespace URIs Changes not substantially semantic (i.e., corrections) will not result in change of namespace URIs 25

Dublin Core metadata in a broader context Metadata Creation and Distribution Application Profiles and Interoperability

Metadata creation and distribution models Federation Extensive specifications, standards, protocols, training Harvesting Basic agreements, reliance on best practices Gathering Automated indexing of content, algorithms yield results from search terms, less likely to use descriptive metadata per se 27

Harvesting model key features Integrating metadata from many sources calls for common element sets, record structures, and harvesting protocols Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) serves as a framework for sharing metadata and mandates simple DC as a common metadata format Harvesting promotes metadata reuse Best practices balance cost and interoperability Communities add value to basic infrastructure (more complex metadata, new uses for protocol) 28

Application profiles & interoperability Application profiles consist of data elements drawn from one or more namespace schemas combined together by implementers and optimised for a particular local application. http://www.ariadne.ac.uk/issue25/appprofiles/ Application profiles enable: Implementers to use DC metadata in conjunction with non-dc metadata Implementers to benefit from the experience of their peers Communities to harmonize metadata usage for greater interoperability 29

Important aspects of the DCMI community

Dublin Core grows and changes DCMI emphasizes open participation Conferences, Communities, Task Groups, and discussion lists DCMI element set evolves as implementers coin new terms and usage patterns emerge DCMI Usage Board reviews proposals for new metadata terms 31

Dublin Core Usage Board Considers proposals for new terms (elements, refinements, encoding schemes, DCMI Type Vocabulary terms) Evaluates proposals in light of the requirements of the DCMI Abstract Model Evaluates constructs that use DCMI terms, such as application profiles 32

Finding out more about DC DCMI Web Site http://dublincore.org Using Dublin Core http://dublincore.org/documents/usageguide/ Participating in a Community or Task Group http://dublincore.org/groups Ask a question! http://askdcmi.askvrd.org/ 33

Acknowledgement Many thanks to Diane Hillmann and Marty Kurth for sharing with me the slides from earlier versions of this tutorial. 34

Questions? Thank you for your attention! Stuart A. Sutton University of Washington sasutton@u.washington.edu 35