Digital Preservation and Access - The 5 Opens Project

Similar documents
Columbia University Digital Library Architecture. Robert Cartolano, Director Library Information Technology Office October, 2009

Columbia University Libraries / Information Services

Discovery Services Platform

How collaboration can save [more of] the web: recent progress in collaborative web archiving initiatives

Developing software to prepare social science research data and code for sharing and preservation

CMDB Federation. DMTF Standards for Federating CMDBs and other Management Data Repositories

How collaboration can save [more of] the web: recent progress in collaborative web archiving initiatives

A Selection of Questions from the. Stewardship of Digital Assets Workshop Questionnaire

Databases & Data Infrastructure. Kerstin Lehnert

Navigating to Success: Finding Your Way Through the Challenges of Map Digitization

Environment Canada Data Management Program. Paul Paciorek Corporate Services Branch May 7, 2014

The University of Queensland Library Your Partner in Scholarship TECHNOLOGY STRATEGY

Digital Repository Initiative

Overview of state of art in Data management. Stefano Cozzini CNR/IOM and exact lab srl

Building community clouds to support access to scholarship. Michele Kimpton CEO, DuraSpace Jonathan Markow CSO, DuraSpace

Progress Made and Lessons Learned through Collaborative Web Archiving Projects

The Library as Partner in University Data Curation

MicroStrategy Course Catalog

Day 1 - Technology Introduction & Digital Asset Management

Library Strategic Planning

A Comprehensive Solution for API Management

Digital preservation a European perspective

PRESERVATION NEEDS ASSESSMENT PRESERVATION 101

Catalog Enhancements. VuFind

In ediscovery and Litigation Support Repositories MPeterson, June 2009

Final Report April 12, 2013

ProQuest Dissertations & Theses

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

ALEX THURMAN. PCC Participants Meeting ALA Annual June 30, Columbia University Libraries

How To Manage Your Digital Assets On A Computer Or Tablet Device

<Insert Picture Here> Solution Direction for Long-Term Archive

HP SOA Systinet software

THE 21 ST CENTURY RESEARCHER

A Digital Initiative to Preserve the History of the Louisiana State University School of Medicine in Shreveport

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff

GET READY FOR SHAREPOINT 2016 S O L U T I O N S A R C H I T E C T

How To Useuk Data Service

Successful Platform-as-a-Service Requires a Supporting Ecosystem for HR Applications

Ganzheitliches Datenmanagement

Adding Robust Digital Asset Management to Oracle s Storage Archive Manager (SAM)

Digital Stewardship Education at the Graduate School of Library & Information Science, Simmons College

UNIVERSITY OF WISCONSIN SYSTEM INFORMATION TECHNOLOGY SUMMARY FISCAL YEAR 2015

DIGITAL OBJECT an item or resource in digital format. May be the result of digitization or may be born digital.

What is Security Intelligence?

Change in Emphasis: Recasting Resource Investments and the Rise of Special Collections

ODUM INSTITUTE ARCHIVE SERVICES OVERVIEW IASSIST 2015

Combining Technologies to Create New Solutions

Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collecting Composers' Websites

Global Scientific Data Infrastructures: The Big Data Challenges. Capri, May, 2011

Real World Strategies for Migrating and Decommissioning Legacy Applications

Collaboration. Michael McCabe Information Architect black and white solutions for a grey world

DIGITAL ARCHIVES & PRESERVATION SYSTEMS

Developing an Enterprise-wide Knowledge Management System. Barbara Silcox Jo Ann Remshard Internet Librarian 2004

Indiana University s Library focusing on the Online Archive Kelsey Bawel

Portal for ArcGIS. Satish Sankaran Robert Kircher

Presentation Pathway. About Pacific Lutheran University. A Sense of Urgency. General Criteria & Goals. Environmental Scanning

ETD s at the University of Florida. by Bobby Parker & Cathy Martyniak

Library Philosophy and Practice ISSN

Cisco Integrated Video Surveillance Solution: Expand the Capabilities and Value of Physical Security Investments

Contact Information. 1. Institution Name. 2. City and State: 3. Prepared by (name): 4. Title: 5. address: Page 1

Choosing a Digital Asset Management System That s Right for You

Bibliographic Data Creation and Management at Network Level (Facilitator/Recorder:

Master Data Management

All You Wanted To Know About the Management of Digital Resources in Alma

DLF Aquifer Operations Plan Katherine Kott, Aquifer Director October 3, 2006

What is Digital Archiving? 4/10/2015. Archiving Digital Content ARMA Spring Meeting. Digital Archiving

Building Your EDI Modernization Roadmap

Q1 Labs Corporate Overview

DEVELOPING A VISUAL DIGITAL IMAGE COLLECTION. Calgary Collections 2005: The Changing Collections Environment CLA Pre-Conference June 15, 2005

The mission of academic libraries is to support research, education,

Amit Sheth & Ajith Ranabahu, Presented by Mohammad Hossein Danesh

James Hardiman Library. Digital Scholarship Enablement Strategy

Collaborative Open Source Software Production & APIs. Tom Cramer Stanford

Microsoft Dynamics NAV 2015 What s new?

Project Update. David Lindahl University of Rochester Libraries

Planning and Infrastructure for Analog to Digital Preservation Projects

Launch into the Oracle Business Intelligence Cloud Service (BICS) Presented by: Stephen Goldsmith Date: May 15, 2015

Librarian s skills for eresearch support Joint project at TUM, CPUT and UniBW_M Dr Caroline Leiss

Boston University Technology Plan

THE EFFECTS OF BIG DATA ON INFRASTRUCTURE. Sakkie Janse van Rensburg Dr Dale Peters UCT 1

Cityzenith s 5D Smart City platform empowers users with a simple way to make sense of the torrent of data in our cities, corporate

ELAG 2006 Paper: Long-term electronic archiving as part of a digital library solution an overview of products

Networking Library Services:

IMPLEMENTATION OF A MODERN DIGITAL LIBRARY AT THE OHIO STATE UNIVERSITY LIBRARIES White Paper

Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS.

WorldCat Local. May Install Notice

Marathon Information Management Program

NOAA Environmental Data Management

A Binary Tree SMART CTO Webinar. Analyzing and Rationalizing Big Data in Messaging

Updating Your Customer Web Experience

A Strategy for Managing Freight Forecasting Data Resources

Top 5 reasons to choose HP Information Archiving

Empowering the Masses with Analytics

Cloud Archive & Long Term Preservation Challenges and Best Practices

Statement of Work (SOW) for Web Harvesting U.S. Government Printing Office Office of Information Dissemination

UNH Strategic Technology Plan

Digital Assets Repository 3.0. PASIG User Group Conference Noha Adly Bibliotheca Alexandrina

College Archives Digital Preservation Policy. Created: October 2007 Last Updated: December 2012

Strategic Plan

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Transcription:

Less Code, More Product: Leveraging Open Source Technologies to Develop Digital Library Collections Robert Cartolano, Stephen Davis, Carole Ann Fabian Columbia University April 13, 2015

Digital Preservation & Access Digital preservation is the active management of digital content over time to ensure ongoing access. Library of Congress Digital preservation is a challenge for all of society because we all benefit from reliable, authentic information now and into the future. Done well, all of society will reap the benefits of digital stewardship. Blue Ribbon Task Force on Digital Preservation and Access, 2010. 2

Distributed Preservation & Access, Radical Collaboration Long-term preservation of digital information on a scale adequate for the demands of future research and scholarship will require a deep infrastructure capable of supporting a distributed system of digital archives. "Preserving Digital Information Report of the Task Force on Archiving of Digital Information, 1996 "Conditions in technology, economics, and expanding service requirements demand a more radical approach to collaboration." "Advancing From Kumbaya to Radical Collaboration: Redefining the Future Research Library" Neal, J., Journal of Library Administration, 2011.

Strategic Challenges Organization local and distributed Technical Infrastructure durable, interoperable, flexible Collections collect, aggregate, manage, discover and deliver, over time 4

Approach Close Partnerships develop and sustain technology platforms over time Strategic Investment local and shared staff, infrastructure, services 5

The Five Opens Open Code sustainable platforms Open Data Standards manage over time Open API/Protocols foster interoperability Open Communities go further together Open License share knowledge broadly Based on Mara Hancock s Three Opens 6

Open Fosters Preservation & Access Digital preservation and access is greatly fostered by adopting the Five Opens and partnering to develop local and distributed systems. 7

Key Benefits of Open Less Code Reduce local efforts, leverage community development More Product build better services for today Contribute local development, support entire community over time Work together solve hard problems (e.g. interoperability, migration) 8

Columbia Strategic Investments Fedora durable object repository Blacklight common discovery layer Hydra building, sharing apps & tools Multiple data centers Columbia private network Risk-averse vendor technologies 9

Implementing. Less Code MORE PRODUCT! 10

Columbia Digital Collections Scope Digitized special & archival collections Born-digital content Institutional repository content Research data sets Numeric data files Learning objects (to come) 11

Some Pre-2014 Standalone Special Collections Projects 12

But under the surface 13

Challenges Unique, custom code for each project Deprecated and unsupported software versions No longer best best practices Passé state of the art Divergent metadata and content models Maintenance and security problems 14

Consolidated Digital Collections Infrastructure 15

Consolidated Digital Collections Infrastructure 16

Benefits Common code base, release versions Easier software upgrades and bug fixes Faster implementation of new projects New features shared across projects Better asset management Easier metadata curation Ability to benefit from - and contribute to - other institutions work 17

Less Code Result of moving Columbia DL applications from custom code to Hydra framework: ca. 10,000 lines of local code removed so far And many 10s of thousands as we retire legacy sites 18

More Product Sample DL project development time, 1.5 FTE developers: 5-6 weeks New DL project development time for comparable project, 1.5 FTE developers: 2-3 days 19

Staff Collection Viewer, 300,000+ items 20

dlc.library.columbia.edu 21

dlc.library.columbia.edu/ifp 22

dlc.library.columbia.edu/durst 23

Less Code, More Graph 24

SEYMOUR B. DURST OLD YORK LIBRARY I don t collect books out of nostalgia, but out of a love of information and history ~~~ Seymour B. Durst

The Collection Book-like materials Books & Pamphlets 8,676 Landbooks 30 Serials 519 Visual materials Photographs 3,000 Thomas Nast Collection 1,200 Lantern & 35mm slides 75 Postcards 22,594 Maps Oversize maps 450 Bound & folded maps, and guides 150 Ephemeral materials Brochures, programs, menus, etc 1,091 TOTAL OBJECTS: 36,592

Durst project goals Support planning Add capacity Implement projects to ensure broad access: Catalog & house the collection Digitize the public domain content Provide digital access

Support planning

Add capacity

Project: data development 9,839 bibliographic records 3,151 copy 6,688 original 23,780 metadata records 23,780 geo-location 15,153 subject headings

Project: digitization Acceptable copy: 1,559 Internet Archive: 2,997 Backstage: 843 CUL-PRD: 368 Avery contract photographer: 87 Acceptable Copy Internet Archive CUL Project digitized: 5,852 CUNY digitized 14,559 TOTAL digitized objects 20,421 Avery CUL Book-Like CUL Flats Backstage

Project: ensure broad access

Discovery pathways

Search result

Single record display Avery Architectural & Fine Arts Library

Image palette Pan/Zoom Full Screen Data Display Citation

Access

Related content

Place-Based Discovery Biggert Architectural Vignettes New York Real Estate Brochure Collection Seymour B. Durst Old York Library Built Works Registry

locational data

Discovery: mapping interface Avery Architectural & Fine Arts Library

Discovery: neighborhood facet Avery Architectural & Fine Arts Library

Course integration History of Real Estate Development in New York City GSAPP RED program Summer session: 2012, 2013, 2014 ~100 grad students each year 2014 summer course interactions: basic instruction session (90 students) reference interaction (67 students) research consultation (35 students)

Symposia GSAPP CURE: The Center for Urban Real Estate May 11, 2012 Megaprojects: An Urban Planning and Development Conference April 11, 2013 Mind the Gap: Transit Lessons from New York and London April 25, 2014 From Port to People: Reinventing Urban Waterfronts

Publications

Q & A

References Advancing from Kumbaya to Radical Collaboration: Redefining the Future Research Library, Neal, James, Journal of Library Administration, 1/2011, http://hdl.handle.net/10022/ac:p:20919 Blue Ribbon Task Force on Digital Preservation and Access, 2/2010, http://brtf.sdsc.edu/biblio/ BRTF_Final_Report.pdf "Challenges and Opportunities of Open Source in Higher Education", Fuchs, Ira, The Tower and the Cloud, Educause, 2008, pp. 150-157, http://net.educause.edu/ir/library/pdf/pub7202.pdf Definitions of Digital Preservation, ALA, 6/2007, http://www.ala.org/alcts/resources/preserv/defdigpres0408 The Open-Source Movement, Warger, Thomas A., Educause Review, 1/1/2002, http://www.educause.edu/ero/ article/open-source-movement 25