DRAFT FOR DISCUSSION WITH ADVISORY COUNCIL MEMBERS ONLY The challenges of digital preservation to support research in the digital age Lynne Brindley CEO, The British Library November 2005
Agenda UK developments British Library initiatives E-infrastructure development E-legal deposit E-journals pilot Web archiving Digital preservation projects Digital archiving services Conclusions
UK developments Science & Technology Select Committee Inquiry into Scientific Publications Research Councils UK (RCUK) policy on the publication of and access to research outputs Office of Science and Technology (OST) e-infrastructure group assessing current state / future requirements of the UK s e- infrastructure Joint Information Systems Committee (JISC) supporting projects and programmes to test open access models Debate of balance between public / social good and legitimate commercial interests
British Library initiatives Our priority is ensuring long-term access to research materials regardless of format or publishing model. We working with UK and international partners from the public and private sectors to address the challenges of perpetual preservation and access. Key initiatives: E-infrastructure development E-legal deposit E-journals pilot Web archiving Digital preservation projects Digital archiving services
E-Infrastructure development Digital Object Management Programme (DOM) Mission To enable the United Kingdom to preserve and use its digital intellectual property forever Vision To create a management system for digital objects that will: store and preserve any type of digital material in perpetuity provide access to this material to users with appropriate permissions ensure that the material is easy to find ensure that users can view the material with contemporary applications ensure that users can, where possible, experience material with the original look-and-feel
DOM Programme Scope We already have a wide range of materials to deal with Existing voluntary deposit scheme, operational since 2000 (1.5 TB) Digitised versions of BL material, from early 90s onwards (15 to 20 TB) Electronic journals (1 TB) New digitisation initiatives: newspapers, sound, etc Sound Archive material (150 TB, growing at 30 TB per year) Web archiving, Cartographic data, Picture library, Purchased and donated digital materials We need a system that can manage the materials we will receive through the extension of Legal deposit legislation for non-print material Our storage planning figure is 300 TB after 5 years.
The Digital Object Management System Resource Discovery ACCESS Delivery Preservation Digital Rights Management STORAGE Ingest Shared services Signing Authentication Metadata Persistent ID WEB ARCHIVING Publishers Archives Donations BORN DIGITAL Legal Deposit Purchases Grey Literature DIGITAL FROM ANALOGUE BL digitisation Sound Archive Scholarly projects, e.g IDP
E-legal deposit defining the legislation Legal Deposit Libraries Act 2003 Provides for deposit of electronic publications Secondary legislation in the form of Regulations is needed in order to implement the Act A period of around 12 months will be needed for a Regulation to proceed from exact formulation to Parliamentary approval Legal Deposit Advisory Panel Established to advise the Secretary of State on the timing and content of Regulations Panel consists of a Chair and 14 members, from legal deposit libraries, publishing industry and independent fields Initial focus: offline publications, e-journals, Web archiving, definition of UK territoriality, and overall workplan and procedures Alongside this, publishers and libraries will also continue to work together via the JCLD
E-legal deposit e-journals pilot Aims of the e- journals pilot To test the technical infrastructure, mechanisms and procedures relating to the deposit, storage and preservation of electronic journals To highlight any interface problems, facilitating their early resolution To produce a fully operational and scaleable means for the deposit of e-journals, which can then support access models as developed and agreed subsequently Pilot will run for up to 12 months from June 2005 No end-user access during the pilot Who is participating Pilot developed by a working group under Joint Committee on Legal Deposit Publishers have volunteered (via their trade associations) a sample of e-journals offering diversity of subjects and formats 23 publishers participating, offering over 200 journal titles
Web Archiving - structure of BL s programme The BL s Web Archiving Programme is a collaborative initiative, implemented across two consortia UK Web Archiving Consortium Developing a selective approach to web archiving, procuring a common web archiving infrastructure and software to begin archiving activities at the earliest - www.webarchive.org.uk/ International Internet Preservation Consortium Developing advanced web archiving technologies for the long terms, large scale, continuous crawling requirements enabled through legislation
Digital preservation research projects Lifecycle information for e- literature Project will explore and develop a lifecycle approach to costing digital archiving for e-journals Project outcomes are intended to answer key questions for HE & FE such as: What are the long-term costs of preserving digital material? Who will do it? What are the costs for a library in HE/FE to partner with another institution to carry out long-term archiving? What are the relative risks of digital vs. paper archiving? PLANETS EU FP6 Call 5 Proposal on Digital Preservation Project will design and develop a sustainable framework to enable long-term preservation of digital text and images, digital audio and dynamic data sets Long-term access to the cultural and scientific heritage of Europe Co-ordinated by BL in partnership with other European national libraries and archives, Universities and commercial organisations
Digital Archiving: the British Library s proposition The British Library is committed to delivering a long-term digital architecture which will preserve the UK s electronic content in perpetuity with a key focus on legal deposit material The British Library is also working on services and business models to ensure that electronic content is also available to subscribers in the event that publisher content becoming unavailable (e.g. through system failure or company failure). This will give publishers and libraries far more confidence in the switch to digital-only content The British Library already provides backup services of this kind. Our electronic storage system already contains back-up copies of ten of the largest STM publishers and we are working with other publishers to obtain more content
A possible digital archiving service model Negotiated with publishers. Could include light & dark archiving, DRMbased access Level 3 additional publisher services Level 2 back-up & access service Negotiated with publishers. Aim for key publishers of UK & US content Level 1 All UK published content under legal deposit archiving for long-term preservation Access / Service Level 1: Under UK legal deposit regulations. Likely to be legal deposit library reading room use only. Level 2: Negotiated rights. Likely to be backup guarantees to publishers and rights to use for services (with payment) Level 3:Negotiated with publishers. Light & dark archiving, DRM-based access.
Conclusions The British Library has a statutory, UK-wide remit across all disciplines for the long-term preservation & access of both print & digital materials As such, our priority is ensuring long-term access to research materials regardless of format or publishing model We are committed to developing a robust infrastructure to support the perpetual preservation of the UK s published output but this is a complex and costly challenge In addition we are developing our current capabilities so that we are able to offer international archiving services to publishers We need to work in partnership to develop solutions to the challenges of long-term preservation We need to find alternative business models and funding sources for ongoing support and maintenance of the archive we cannot rely on public funding only