Digital Preservation Workshop 1 Sept 2010 Simon Fraser University Burnaby, BC. Presenters: Glenn Dingwall City of Vancouver Archives Peter Van Garderen, Artefactual Systems Evelyn McLellan, Artefactual Systems
Archivematica Workshop 10:00 Introductions 10:15 City of Vancouver Digital Archives 10:45 The Archivematica Project 11:15 Archivematica Preservation Planning 11:45 Discussion 12:00 Lunch 13:30 Archivematica demo/tutorial 15:00 Coffee break 15:15 Discussion & Archivematica demo/tutorial con t
The content in this presentation may be freely re-used under the terms of the Creative Commons Attribution-Non-Commercial-Share Alike 3.0 license. Attribution: Title: Archivematica: Digital Preservation Workshop Creator: Peter Van Garderen, Artefactual Systems Date: September 1, 2010 Artefactual Systems Inc. 2010
Peter Van Garderen President / Systems Archivist Jack Bates Software Engineer David Juhasz Software Engineer Austin Trask Systems Engineer Evelyn McLellan Systems Archivist Jesús García Crespo Software Engineer open-source sofware for archives and libraries digital preservation consulting services http://artefactual.com Joseph Perry Software Engineer
An integrated suite of free and opensource tools that allows users to process digital objects from ingest to access, in accord with the ISO-OAIS model, while applying format specific preservation policies.
ISO-OAIS OAIS Use Cases UML Activity Diagrams requirements Digital Archives software system
The Archivematica collaboration Artefactual Systems City of Vancouver Archives UNESCO Memory of the World International Monetary Fund Archives???
Archivematica is: A classic Unix pipeline of OAIS micro-services provided by a series of opensource tools and integration code written in Python and Bash. Packaged as a virtual appliance that bundles the Xubuntu operating system and can be run within virtual machines, as a bootable USB or Live DVD, or as a bare metal install on dedicated machines.
http://archivematica.org/software
http://archivematica.org/docs ISO-OAIS OAIS Use Cases UML Activity Diagrams System Workflow Instructions requirements documentation
= manual step Producer places SIP in shared folder on host machine = automated step [host]/sendsip/ SIP appears in shared folder In Archivematica /1-receiveSIP/ Archivist copies SIP to SIP review folder /2-reviewSIP/ Archivist reviews SIP /2-reviewSIP/ Archivist adds descriptive metadata /2-reviewSIP/ Archivist moves SIP to quarantine /3-quarantineSIP/ - 16 - = file directory The producer places a folder of objects in a designated folder on his or her computer. This designated folder has been set up so that it automatically sends its contents to a shared folder in Archivematica. The shared folder in Archivematica is 1-receiveSIP. When you are processing the SIP, leave the original copy in this folder as a backup in case you need to go back and start again. Check the SIP to make sure it conforms to Submission Agreement.. If MD5checksum.txt file is included, right-click and select Verify MD5 Checksum. Otherwise Archivematica will add checksums to the SIP logs directory and verify checksums at various time throughout the ingest process. Open the SIP. Right-click and select Add Dublin Core XML from the drop-down menu. Right-click the dublincore.xml file to open it with Mousepad. Add descriptive information to the appropriate Dublin Core elements and save the file.
Agile Software Development Time-based system releases Feb 2009: Release 0.1-alpha May 2010: Release 0.6-alpha November 2010: Release 0.7-alpha Each iteration leads to updated and improved: Requirements Software Documentation Development resources
Release 0.7-alpha Repository Exchange Package (RXP) specification PREMIS, METS, Bagit Web Dashboard interface Multi-threaded processing workflows Ubuntu Launchpad Debian packages
Free Beer!
They ll never take our freedom
Free Software 1. 2. 3. 4. The freedom to run the program for any purpose The freedom to study how the program works, and adapt it to your own needs, meaning that easy access to the source code must be provided The freedom to redistribute copies to help friends, family, colleagues or society in general The freedom to improve the program, and release your own improvements to the public, so that the whole community benefits. Again, easy access to the source code is a precondition for this.
Free Puppy!
Users Lead institutions Funding Development All users Bug reports Enhancement requests Code patches Documentation Promotion Foundation or Steering Committee Code Time Money Knowledge Governance Open Source Software Code Knowledge Community Code Time Money Knowledge Service Providers Development Technical Support Hosting Training The open-source eco-system Promotion Time Money Knowledge Coordination Funding Promotion
http://archivematica.org peter@artefactual.com