How To Improve Mets 2.0.1.1



Similar documents
CCS Content Conversion Specialists. METS / ALTO introduction

11 ways to migrate Lotus Notes applications to SharePoint and Office 365

Chapter 10 Supplement

The Rutgers Workflow Management System. Workflow Management System Defined. The New Jersey Digital Highway

XML- New meta language in e-business

Functional Requirements for Digital Asset Management Project version /30/2006

How To Manage Your Digital Assets On A Computer Or Tablet Device

IML 400 Creative Coding for the Web

Summary Report of the PREMIS Implementation Fair

Best Practices for Structural Metadata Version 1 Yale University Library June 1, 2008

MPD Technical Webinar Transcript

September 2009 Cloud Storage for Cloud Computing

Current Page Location. Tips for Authors and Creators of Digital Content: Using your Institution's Repository: Using Version Control Software:

Dreamweaver CS3 THE MISSING MANUAL. David Sawyer McFarland. POGUE PRESS" O'REILLY 8 Beijing Cambridge Farnham Koln Paris Sebastopol Taipei Tokyo

Introduction to Open Atrium s workflow

Lightweight Data Integration using the WebComposition Data Grid Service

IT Academy Lesson Plan

MultiMimsy database extractions and OAI repositories at the Museum of London

Guidelines. The following guidelines are for companies who develop HTML design/creative and copy.

What Am I Looking At? Andy Kass

Digital Asset Management in Museums

IBM Unica emessage Version 8 Release 6 February 13, User's Guide

Scurlock Photographs Cataloging Analysis

SavvyDox Publishing Augmenting SharePoint and Office 365 Document Content Management Systems

Appendix A. Functional Requirements: Document Management

Overview of NDNP Technical Specifications

The MANTIS Framework Cyber-Threat Intelligence Mgmt. for CERTs Siemens AG All rights reserved

Overview of sharing and collaborating on Excel data

Review Manager Guide

Presentation / Interface 1.3

Authoring Within a Content Management System. The Content Management Story

Enduring Understandings: Web Page Design is a skill that grows and develops throughout the careful planning and study of software and design.

WHY DIGITAL ASSET MANAGEMENT? WHY ISLANDORA?

HCC ONLINE COURSE REVIEW RUBRIC

Program Advisory Committee (PAC) Agenda. December 14, :00am 3:00pm PST. Agenda Items:

This book does not contain affiliate links. Copyright 2009 Jodi Kaplan

MBooks: Google Books Online at the University of Michigan Library

Authoring Guide for Perception Version 3

Introduction to XML Applications

Terms and Definitions for CMS Administrators, Architects, and Developers

The Value of Taxonomy Management Research Results

LabArchives Electronic Lab Notebook:

Structured Data Capture (SDC) Trial Implementation

XML for Manufacturing Systems Integration

Introduction to Public Safety Project Management. Table of Contents. Introduction to Public Safety Project Management

XML and the College Website A Practical Look at the Use of XML and XSL

MadCap Software. Templates Guide. Flare 11.1

Guideline for Implementing the Universal Data Element Framework (UDEF)

Chapter 19: XML. Working with XML. About XML

How will the transition to the new Adobe Forums be handled?

Creating an EAD Finding Aid. Nicole Wilkins. SJSU School of Library and Information Science. Libr 281. Professor Mary Bolin.

Appendix 1: Adaptable Templates

Working With Templates in Web Publisher. Contributed by Paul O Mahony Developer Program

Installation and Administration Guide

Table of Contents. Introduction: 2. Settings: 6. Archive 9. Search Browse Schedule Archiving: 18

User Testing for Pre-Phase 2, Search 360 Janifer Holt and Barbara DeFelice

GENERAL PLATFORM CRITERIA. General Platform Criterion Assessment Question

Digital Asset Management. Content Control for Valuable Media Assets

Information Management

2015, André Melancia (Andy.PT) 1

BBBT Podcast Transcript

Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM)

Apache Tika for Enabling Metadata Interoperability

CREATING OPEN ACCESS JOURNALS WORKSHOP AT THE SECOND INTERNATIONAL PKP SCHOLARLY PUBLISHING CONFERENCE. Workshop Agenda. Planning for a new journal

15 Organisation/ICT/02/01/15 Back- up

San Joaquin County Office of Education Career & Technical Education Web Design ~ Course Outline CBEDS#: 4601

Tracking Campaigns with G-Lock Analytics

ABOUT THIS COURSE... 3 ABOUT THIS MANUAL... 4 LESSON 1: PERSONALIZING YOUR

General Platform Criterion Assessment Question

METADATA STANDARDS AND GUIDELINES RELEVANT TO DIGITAL AUDIO

The Intelligent Content Framework

Introducing Apache Pivot. Greg Brown, Todd Volkert 6/10/2010

DEVELOPING A VISUAL DIGITAL IMAGE COLLECTION. Calgary Collections 2005: The Changing Collections Environment CLA Pre-Conference June 15, 2005

Proposal for Website Design and Development Services: Digital Library Federation

Building, testing and deploying mobile apps with Jenkins & friends

ACE 2011 International

IBM Endpoint Manager Version 9.1. Patch Management for Red Hat Enterprise Linux User's Guide

Follow up Information. Caring for Digital Materials: Metadata, finding aids, and digital asset management. Danielle Cunniff Plumer, instructor

UBC Mailing List Replacement Project. Feedback Summary Survey & Focus Groups

INTRODUCTION TO WEB DESIGN WITH DREAMWEAVER

Web Design for Programmers. Brian Hogan NAPCS Slides and content 2008 Brian P. Hogan Do not reproduce in any form without permission

The Hepldesk and the CLIQ staff can offer further specific advice regarding course design upon request.

Standard Recommended Practice extensible Markup Language (XML) for the Interchange of Document Images and Related Metadata

General principles and architecture of Adlib and Adlib API. Petra Otten Manager Customer Support

Transcription:

METS Editorial Board Meeting Thursday, November 7, 2013 DLF Fall Forum AT&T Executive Education and Conference Center Austin, Texas Contents METS Editorial Board Meeting... 1 Minutes... 2 Present:... 2 On Phone:... 2 Board's Mission for the Upcoming Year... 2 Board's Internal Procedures and Mechanisms... 2 Membership/recruitment... 2 Chairmanship... 2 Teleconferences frequency/schedule/infrastructure... 3 Backup/mirror site for schema... 3 Github as a tool for wiki/documentation... 4 Formal adoption of 1.10 Schema changes... 5 Summary of changes... 5 Summary/Actions:... 5 Open Floor Other presentations; updates on other initiatives; continued work on topic from earlier discussion... 6 Documentation/Wiki... 6 ALTO and Structural Info... 6 METS 2.0 Presentation and Discussion (Tom)... 6 Discussion:... 7 Communication and Documentation Issues... 8 Goal: new XSLT, XSL-FO for METS Profiles Review of progress, next steps... 8 Action item:... 9 Open and Wrap up... 9

Next Year's Proposed Meeting in Europe... 9 Minutes Present: Tom Habing, Jukka Kervinen, Jean-Philippe Moreux, Betsy Post (recording), Brian Tingle, Nate Trail, Frederick Zarndt (guest) On Phone: Nancy Hoebelheinrich, Tobias Steinke, Robin Wendler Board's Mission for the Upcoming Year 1. METS 1.0 maintenance 2. Focus on documentation for 1.10 including updating Primer 3. Continue METS 2.0 data modeling exploration and development Board's Internal Procedures and Mechanisms Membership/recruitment Review new member recruitment procedures Put out a call for new members fill seats vacated by Patrick Yott and Sébastien Peyard Applications due by end of January; subcommittee makes recommendations to Board by end of February. In recruiting, consider importance of cross representation with other groups (Jukka, Nate, Jean-Philippe members of ALTO Board; Marcus/PREMIS; Betsy/MODS) Recruit broadly, putting out a call to METS, PREMIS, MODS, lists. Send out individual recruitment e-mails. Talk with vendors and implementer; look for people from museums. Eg. Singapore National Library, Smithsonian, Archives of American Art, ExLibris, Duraspace, U Wisconsin. Chairmanship No formal mechanisms around co-chairs consider new members.

Betsy and Tom will each serve another year Teleconferences frequency/schedule/infrastructure Six weeks is a good schedule. Betsy will schedule for next year. Like Ready Talk. Recordings after a call, delete the last call. Backup/mirror site for schema Discussion triggered by recent federal government shut down. Concerns (1) Availability of documentation and specification. IA had complete archive of LOC pages. (2) With LC servers down, direct link for schema validation caused error messages on ingest. Schema validation is vital function. Discussion Best practice is to maintain a local catalog of local schemas. Modern xml allows you to refer from local catalog and this is a best practice. LC wants to put schemas into GITHub, but uris would vary. Would still need to tell people what to put to in their own instances. Github doesn't want people to validate schemas directly from GitHub. For the long term LC is considering moving documentation and schemas off LC site so work load can be distributed. Internet Archive mirror site was used during shut down. Brian created a local catalog, but hasn't tested it -- OASIS test for catalogs should work for testing. Action Item Add something in our documentation indicating it's a best practice to make catalogs and a reference to a resource that shows how to make/implement a catalog.

Github as a tool for wiki/documentation Discussion: Is it easy to get stuff out of github? Yes you can clone but you are locked into wiki language. Just by cloning you've exported. But, how would you migrate issues? We don't know how they work internally. We think they are in a proprietary format. It looks like there is an API to grab issues so we may not be totally locked in. (Widely used alternative: JIRA issue tracker.) Our issues are in the wiki repo not against the schema. We should put schema issues with the schema. Goals: Documentation complete migration from social text. Get handle on github. Be happy we're not totally locked in. How we're using it now: 4 projects METS 2.0 wiki o They also have Github web pages. o You can set up c-names get metsstandards.org o We aren't using separate wiki for this METS Schema History o Tom will merge 1.10 branch back in once it is approved/finalized. Wiki project couldn't figure out a way to have a wiki without repo wiki project is where issues are. Terrry's schematron stuff. Issues Handling We still want community to send issues/change requests to the mailing list. One of us (Tom) will add issues to GITHub. Tom and Betsy will review open issues prior to each meeting.

When you do a commit you can also indicate that a particular issue should be closed. Formal adoption of 1.10 Schema changes Tom walked group through changes for 1.10. For change details see: https://github.com/mets/mets-schema/blob/version-1.10/mets.xsd Summary of changes Dropped copyright replaced with Creative Commons Added 'LIDO' as potential metadata scheme to MDTYPE enumeration Added xsd:anyattribute to top level elements (avoiding elements where this would create an xlink conflict) this will accommodate a lot of potential change requests a lot of change requests are around local practices. o Our documentation should provide best practices for the use of anyattribute. Carefully describe costs and benefits, especially what might be lost in interoperability when anyattribute is used. o Encourage people to make profiles. With anyattribute, we're allowing local practice, but not encouraging people to go overboard with it. o If there's an existing attribute use it, rather any anyattribute. o Encourage people to register their local attributes by registering profiles thereby gives interoperability to people who care about. This will also help the development of METS. We can add them to schema or address the need in 2.0 as appropriate. Summary/Actions: Update METS profile description on LC site noting you should document your use of anyattribute. Agreed to move forward with1.10 schema changes. Tom will publish version to github and test validation once more. Brian will locate his collection of sample METS documents that are included with profiles and send them out to board. Brian goal stability of v. 1 schema.

Open Floor Other presentations; updates on other initiatives; continued work on topic from earlier discussion Documentation/Wiki o Change SocialText link on LC site to github wiki. Add link on github wiki to SocialText. Make pdfs of old stuff on SocialText wiki. Goal for next year is to complete migration. (Betsy will talk to Leah) o Use of which mark up on wiki? People using different things. Does this matter? ALTO and Structural Info Developments at ALTO Board meeting: Need to allow for adding structural components in ALTO documents. For example, there is a need to point out elements such as headlines, etc. In most implementations this is handled by using METS and structmap. However, some implementations use ALTO without METS which can be a problem because ALTO doesn't understand document except on page level. Tools like FineReader don't create METS. Some of the ALTO changes might impact METS 2.0 o ALTO Board is proposing new section (similar to the styles section/element) s it would be called tags and would have label, description, title. o Textblock element would reference the tags by ID. In example, tag has an extension element ; example contains a MODS record. Discussion: Tom saw synergies between ALTO need for structural metadata and "Open Annotation Collaboration" as described on openannotation.org. Betsy raised possibility of re-use of some basic TEI tags. METS 2.0 Presentation and Discussion (Tom) Reviewed history or our METS 2.0 work, noting the Reimagining METS white paper (April 2011) and the affinity diagram (November 2011) PowerPoint Presentation and Review of Illustrative Document (see appendix A for Powerpoint)

Discussion: First bullet or PowerPoint (maintain core aspects of METS1) point is a fair summary Second bullet align ourselves more closely with PREMIS 3.0 (esp concept of intellectual entities). There are pros and cons: o there is an official PREMIS vocabulary and it is all posted to loc.gov o significant new thought is being given to PREMIS so if you adopt some if it then you are getting some of that stuff for free o What is missing in PREMIS that we need METS at all? Should we just add to PREMIS. o In any alignment with PREMIS we must look at restrictions in PREMIS that are confined to PREMIS. For example, PREMIS saying non-preservation uses out of scope (Robin) o For METS 2.0, more consideration should be given to interoperability from the get-go. Profiles are not workable solution for interoperability. Other schemas/standards are things starting to appear in multi-media space, so maybe range of use cases for METS narrows. o IIIF work may help us gain traction presentation system must do something with file format o Perhaps lobby PREMIS to allow us to embed METS stuff into schema. o Tom's concern of his mock up it is too expressive there are too many implementation decisions. o Nate put up principles and see who responds. Do this so we know what to people who care; so we know what to support and what not to support. o Point of caution: avoid trap of optimizing schema for humans who are looking at the xml document vs. optimizing for implementation. For example why is filegrp so nested. Why not just a flat inventory of files. o XML is inherently hierarchical which is limiting. We have hierarchies rather than full graphs. Property graphs considered. See for example: https://github.com/tinkerpop/gremlin/wiki/defining-a-property-graph o Brian: to some extent we are completing model and serialization.; step back to model. o Have we done in current years a survey of what features people are using? There definitely should be some sort of survey and we should also try to get as many sample documents as we can. We need verification of the goals. We also need to ask what features implementers are using and not using in METS. We should also ask what needs are not being met in current METS. How do we focus between modeling and technology? How do we broaden our technology base to include vendors and other folks who are doing large scale work. Broaden beyond research library group. Get specific feedback from people who are using METS at scale. An advisory group to work on 2.0 could include people not necessarily on board.

o Is 2.0 effort internally driven? Communication and Documentation Issues Goal: New edition of METS Primer, incorporating latest version of schema METS Profile Add new elements and attributes and examples added since 1.6 Correct errors identified since last release. Nancy has a list of errors or that needs to be changed. Some are in social text, some have been sent to Leah. Update tables at back One group could do writing Another group editing graphics that appear at the beginning of each section. They were done on a MAC in graphics. Master source is a word document. Process includes checking to language to make sure Primer and documentation in the schema use similar language Divvy up tasks and see who has time to do contribute Nancy will contact Leah about forming a group. Goal: new XSLT, XSL-FO for METS Profiles Review of progress, next steps Tom started doing a version 2 xslt for METS profiles. The 2.0 profiles are simply xml. There are not html or newer versions. It would be nice if there were a human friendly formatted version of the profiles. This might be a good task for a student learning xml. Recruit library school intern. We could ask DLF to sponsor a trip to its forum for a student who created a stylesheet for the new version of the profile. Or it could be an assignment in a library school class. Tom will look to see how far he got and we will check to see how far Jenn Riley got when she worked on this. Question: do we want pdf or xhtml? Existing standard transform doesn't show examples well. Look at Illinios examples, look at minutes to see what else was missing.

Action item: Take existing xslt style sheet and adopt for new version. Post volunteer opportunity on list. Betsy ask digital library and xslt instructors at Simmons. Perhaps Jerry could suggest a volunteer. Open and Wrap up Next Year's Proposed Meeting in Europe Nancy contacted Jose. If we are going to do a program (as opposed to a simple open meeting) we will need to submit a proposal. Nancy will spend some time looking at the METS 2.0 schema proposed by Tom as well as Brian's proposal about a graphing model. She will come up with questions. Related Standards to think about: ISAD(G): General International Standard Archival Description ISAIDG hierarchical description in archives is codified. Archive exchange Format something they're starting to talk about in science community. Nancy will send on link (http://www.openaxf.org/) Small group to work on 2.0. Nancy will keep an eye for deadline for proposals for joint JCDL and Theory and Practice of Digital Libraries conference. We'll see where we are at that point. Last year proposals were do march 23 rd and were extended until April. Betsy check with Marcus re space for Board meeting at BL. Or we could put in a proposal to conference organizers saying we'd like to have an open editorial board meeting. At an old JCDL we did a METS implementation fair. People did lightning talks about how they implemented METS.

Appendix A. METS 2.0 Ideas and Principles

METS Affinity Diagram http://linoit.com/groups/mets%20board/can vases/mets%20board

Guiding Principles Maintain Core Aspects of METS 1 Structured Inventory of Files (filesec and filegroup) Hierarchical Structures (structmap) Accommodate non-hierarchical structures (structlink) Links to different kinds of metadata Reuse PREMIS 3.0 entities as possible PREMIS File METS File PREMIS Representation METS Div or structmap PREMIS Bitstream METS Area or nested METS Files PREMIS Intellectual Entity METS Div or structmap

More Principles More PREMIS-isms No attributes in XML Schema Two-part identifiers Reuse PREMIS events, agents, and rights Simplify schema One Container Element <objectgroup> Do not hard-code controlled vocabularies Extension points throughout - <xs:any>, <xs:anyattributes>

More Principles? id.loc.gov for vocabs Not backward compatible More interoperable More constrained Digitized, Cultural Heritage Materials Disallow embedded binary Maintained explicit containers

objectinventory objectrelationships