Building a master s degree on digital archiving and web archiving. Sara Aubry (IT department, BnF) Clément Oury (Legal Deposit department, BnF)

Size: px
Start display at page:

Download "Building a master s degree on digital archiving and web archiving. Sara Aubry (IT department, BnF) Clément Oury (Legal Deposit department, BnF)"

Transcription

1 Building a master s degree on digital archiving and web archiving Sara Aubry (IT department, BnF) Clément Oury (Legal Deposit department, BnF)

2 Objectives of the presentation > Present and discuss an experiment aimed at building a academic training in web archiving > A master s degree on digital archives > With a large part dedicated to web archiving > (as far as we know), the first master s degree of this type in France > Present the framework, students, theoretical lessons and practical experiences > Present and discuss the issues and the difficulties of teaching and promoting web archiving in librarian or research courses

3 The framework: the digital archives master s degree > Master s degree of the National Superior School for Information Sciences and Librarianship (Enssib) > Main librarian school in France > One semester (October January), 180 hours alternating theoretical lessons and practical work > An internship to be done in the second half of the year > Five learning units > Imagining the digital archive > Digital publishing > Records management > Preserving and archiving digital documents > Harvesting web sites, referencing and accessing web archives

4 The students > Profile in humanities and social sciences > History, philosophy, literature > Expected jobs : digital humanities, digital curation / archiving, record management

5 Objectives of the learning unit on web archiving > Offer a broad overview of web archiving goals, stakeholders, and methods > Not only focusing on national library experience > Provide a practical experience on how to: > Define and constitute a corpus > Crawl websites according to chosen parameters > Use and promote archived websites > It was not an extensive training course on web archiving tools

6 The theoretical lessons > Presenting a broad panorama of Web Archiving initiatives > Legal deposit and heritage purposes: BnF and INA > Research driven archiving : Medialab of the French Institute for Political Sciences (Sciences Po) > Third party archiving: Internet Memory Foundation > Nothing on corporate archiving for legal purposes > Access and promotion of digital collections using semantic web > 18 hours of theoretical lessons 12 hours remaining for practical works

7 Practical experience : on selection side > Letting the students constitute a ready for harvesting corpus > Groups of 2 3 students > Corpus of manageable size: 20 to 40 URLs > Choosing a subject or a theme > With a significant and specific presence on the web > Allowing the selection of very different websites (from editorial point of view) > Related e.g. to a recent event or a local topic > Defining > Seed URL > Relevant descriptors (specific and to be chosen by the group) > Harvesting parameters (depth and frequency) > Filling a spreadsheet with this information

8

9

10 Practical experience : on harvesting side > Review of web basics (HTML, URL, HTTP) and web sites architectures > Presentation of the different web archiving methods > server side > transactional > client side > how does it work > advantages/drawbacks > focus on web crawlers (exercizing with HTTrack) > software companies, non profits, online services > ARC and WARC formats (exercizing with wget)

11 Practical experience : on access side > The harder part of the training? > We can hardly explain to students how to install a Wayback Machine! > Viewing selected websites with IA s Wayback Machine (when possible) > Using the Gephi tool to visualize interactions between websites

12

13

14 Results and futures directions > Globally, positive feedback from students > Working with local institutions to make the work even more concrete and to demonstrate its usefulness > Working with the local library? > Enssib located in Lyon > Main library of Lyon has a partnership with BnF for websites selection and harvesting > It will get a remote access to BnF web archives

Web Archiving at BnF September 2006

Web Archiving at BnF September 2006 Hosting the IIPC Steering Committee gives us the opportunity to give an update on BnF s organisation and projects. In 2006, we have been mostly focusing on building our own organisation and internal dissemination

More information

Analysis of Web Archives. Vinay Goel Senior Data Engineer

Analysis of Web Archives. Vinay Goel Senior Data Engineer Analysis of Web Archives Vinay Goel Senior Data Engineer Internet Archive Established in 1996 501(c)(3) non profit organization 20+ PB (compressed) of publicly accessible archival material Technology partner

More information

Archiving the Internet

Archiving the Internet Archiving the Internet American University History and New Media February 6, 2014! Abbie Grotke Library of Congress @agrotke abgr@loc.gov Why Preserve the Internet? Information published on the Web today

More information

PASIG San Diego March 13, 2015

PASIG San Diego March 13, 2015 Archive-It Archiving & Preserving Web Content PASIG San Diego March 13, 2015 We are a Digital Library Founded in 1996 by Brewster Kahle In 2007 of@icially designated a library by the state of California

More information

Collecting and Providing Access to Large Scale Archived Web Data. Helen Hockx-Yu Head of Web Archiving, British Library

Collecting and Providing Access to Large Scale Archived Web Data. Helen Hockx-Yu Head of Web Archiving, British Library Collecting and Providing Access to Large Scale Archived Web Data Helen Hockx-Yu Head of Web Archiving, British Library Web Archives key characteristics Snapshots of web resources, taken at given point

More information

INTRODUCTION. by Kristine Hanna Director of Archiving Services at the Internet Archive https://archive.org/about/bios.php

INTRODUCTION. by Kristine Hanna Director of Archiving Services at the Internet Archive https://archive.org/about/bios.php THEME 7 The Web Archiving Life Cycle Model by Kristine Hanna Director of Archiving Services at the Internet Archive https://archive.org/about/bios.php INTRODUCTION The technological tools for archiving

More information

Capturing the Web WEB ARCHIVING. Tools for the Capture of Digital Assets on Websites March 27, 2006 Kelly Eubank

Capturing the Web WEB ARCHIVING. Tools for the Capture of Digital Assets on Websites March 27, 2006 Kelly Eubank WEB ARCHIVING Tools for the Capture of Digital Assets on Websites March 27, 2006 Kelly Eubank Why Capture Websites? Websites now the primary way that North Carolina state agencies communicate with the

More information

Practical Options for Archiving Social Media

Practical Options for Archiving Social Media Practical Options for Archiving Social Media Content Summary for ALGIM Web-Symposium Presentation 03/05/11 Euan Cochrane, Senior Advisor, Digital Continuity, Archives New Zealand, The Department of Internal

More information

ALEX THURMAN. PCC Participants Meeting ALA Annual June 30, 2013. Columbia University Libraries

ALEX THURMAN. PCC Participants Meeting ALA Annual June 30, 2013. Columbia University Libraries ALEX THURMAN PCC Participants Meeting ALA Annual June 30, 2013 Columbia University Libraries Overview Web archiving context (Who) Benefits of curated web archives (Why) Columbia University Libraries Web

More information

How collaboration can save [more of] the web: recent progress in collaborative web archiving initiatives

How collaboration can save [more of] the web: recent progress in collaborative web archiving initiatives How collaboration can save [more of] the web: recent progress in collaborative web archiving initiatives Anna Perricci Columbia University Libraries METRO Conference 2014 January 15, 2014 Overview Web

More information

Scholarly Use of Web Archives

Scholarly Use of Web Archives Scholarly Use of Web Archives Helen Hockx-Yu Head of Web Archiving British Library 15 February 2013 Web Archiving initiatives worldwide http://en.wikipedia.org/wiki/file:map_of_web_archiving_initiatives_worldwide.png

More information

Michele Kimpton, Internet Archive. April 7, 2006. Written Response to Section 4, Section 108

Michele Kimpton, Internet Archive. April 7, 2006. Written Response to Section 4, Section 108 Michele Kimpton, Internet Archive April 7, 2006 Written Response to Section 4, Section 108 TOPIC 4: Given the ephemeral nature of websites and their importance in documenting the historical record, should

More information

The International Internet Preservation Consortium (IIPC)

The International Internet Preservation Consortium (IIPC) The International Internet Preservation Consortium (IIPC) The International Internet Preservation Consortium (IIPC) was formally chartered in Paris on July 24, 2003 with 12 participating institutions,

More information

Web Archiving Tools: An Overview

Web Archiving Tools: An Overview Web Archiving Tools: An Overview JISC, the DPC and the UK Web Archiving Consortium Workshop Missing links: the enduring web Helen Hockx-Yu Web Archiving Programme Manager July 2009 Shape of the Web: HTML

More information

How To Manage Pandora

How To Manage Pandora PANDORA - past, present, and future National web archiving in Australia Dr Paul Koerbin Manager Web Archiving National Library of Australia National Conference on eresources in Malaysia Penang, Malaysia,

More information

Tools for Web Archiving: The Java/Open Source Tools to Crawl, Access & Search the Web. NLA Gordon Mohr March 28, 2012

Tools for Web Archiving: The Java/Open Source Tools to Crawl, Access & Search the Web. NLA Gordon Mohr March 28, 2012 Tools for Web Archiving: The Java/Open Source Tools to Crawl, Access & Search the Web NLA Gordon Mohr March 28, 2012 Overview The tools: Heritrix crawler Wayback browse access Lucene/Hadoop utilities:

More information

Web Archiving and Scholarly Use of Web Archives

Web Archiving and Scholarly Use of Web Archives Web Archiving and Scholarly Use of Web Archives Helen Hockx-Yu Head of Web Archiving British Library 15 April 2013 Overview 1. Introduction 2. Access and usage: UK Web Archive 3. Scholarly feedback on

More information

Archiving the Social Web MARAC Spring 2013 Conference

Archiving the Social Web MARAC Spring 2013 Conference Archiving the Social Web MARAC Spring 2013 Conference April 2013 Lori Donovan Partner Specialist Internet Archive About Internet Archive We are a Digital Library Mission Statement: Universal access to

More information

Advanced Archive- It Applica2on Training: Archiving Social Networking and Social Media Sites

Advanced Archive- It Applica2on Training: Archiving Social Networking and Social Media Sites Advanced Archive- It Applica2on Training: Archiving Social Networking and Social Media Sites 1 Agenda Overview of Social Networking/Media sites Why archive these sites? Typical Challenges Best Prac2ces:

More information

How collaboration can save [more of] the web: recent progress in collaborative web archiving initiatives

How collaboration can save [more of] the web: recent progress in collaborative web archiving initiatives How collaboration can save [more of] the web: recent progress in collaborative web archiving initiatives Anna Perricci Columbia University Libraries Best Practices Exchange November 14, 2013 Overview Web

More information

Web and Twitter Archiving at the Library of Congress

Web and Twitter Archiving at the Library of Congress Web and Twitter Archiving at the Library of Congress Web Archive Globalization Workshop June 16, 2011 Nicholas Taylor (@nullhandle) Web Archiving Team Library of Congress why archive the web? preserve

More information

Report on the Crawl and Harvest of the Whole Australian Web Domain Undertaken during June and July 2005

Report on the Crawl and Harvest of the Whole Australian Web Domain Undertaken during June and July 2005 Report on the Crawl and Harvest of the Whole Australian Web Domain Undertaken during June and July 2005 Paul Koerbin 10 October 2005 1. Executive Summary In June and July 2005 the National Library of Australia

More information

Archiving the Web: the mass preservation challenge

Archiving the Web: the mass preservation challenge Archiving the Web: the mass preservation challenge Catherine Lupovici Chargée de Mission auprès du Directeur des Services et des Réseaux Bibliothèque nationale de France 1-, Koninklijke Bibliotheek, Den

More information

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data David Minor 1, Reagan Moore 2, Bing Zhu, Charles Cowart 4 1. (88)4-104 minor@sdsc.edu San Diego Supercomputer Center

More information

Investigating Hadoop for Large Spatiotemporal Processing Tasks

Investigating Hadoop for Large Spatiotemporal Processing Tasks Investigating Hadoop for Large Spatiotemporal Processing Tasks David Strohschein dstrohschein@cga.harvard.edu Stephen Mcdonald stephenmcdonald@cga.harvard.edu Benjamin Lewis blewis@cga.harvard.edu Weihe

More information

Indexing big data with Tika, Solr, and map-reduce

Indexing big data with Tika, Solr, and map-reduce Indexing big data with Tika, Solr, and map-reduce Scott Fisher, Erik Hetzner California Digital Library 8 February 2012 Scott Fisher, Erik Hetzner (CDL) Indexing big data 8 February 2012 1 / 19 Outline

More information

ACCESSING WEB ARCHIVES

ACCESSING WEB ARCHIVES ACCESSING WEB ARCHIVES There are two collections of archived websites available via Explore the British Library: The Legal Deposit UK Web Archive, archived through legal deposit regulations introduced

More information

Information and documentation Statistics and Quality Indicators for Web Archiving

Information and documentation Statistics and Quality Indicators for Web Archiving ISO 2012 All rights reserved ISO/TC 46/SC 8 N Date: 2012-09-28 ISO/DTR 14873 ISO/TC 46/SC 8/WG Secretariat: DIN Information and documentation Statistics and Quality Indicators for Web Archiving Information

More information

Archive-IT Services Andrea Mills Booksgroup Collections Specialist

Archive-IT Services Andrea Mills Booksgroup Collections Specialist Getting Started with Archive-IT Services Andrea Mills Booksgroup Collections Specialist Internet Archive Micro History Text Archive Update Archive-IT Services 1996 The Internet Archive is created, with

More information

Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collecting Composers' Websites

Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collecting Composers' Websites Contemporary Composers Web Archive (CCWA): Progress in Collaboratively Collecting Composers' Websites June 24, 2015 IAML/IMS Anna Perricci, Columbia University Laura Stokes, Brown University What is CCWA?

More information

How To Understand Web Archiving Metadata

How To Understand Web Archiving Metadata Web Archiving Metadata Prepared for RLG Working Group The following document attempts to clarify what metadata is involved in / required for web archiving. It examines: The source of metadata The object

More information

Collaborative Open Source Software Production & APIs. Tom Cramer Stanford University @tcramer

Collaborative Open Source Software Production & APIs. Tom Cramer Stanford University @tcramer Collaborative Open Source Software Production & APIs Tom Cramer Stanford University @tcramer Agenda Flavors of Open Source Software 3 Case Studies in Collaborative OSS The Power of APIs What about Web

More information

How To Use The Internet Archive For A Library Collection

How To Use The Internet Archive For A Library Collection The Role of Grey Literature in Academic Library Collections: Discovering, Capturing, Preservation, & Access Patti Sherbaniuk & Sean Luyk University of Alberta Libraries What We Will Cover Grey literature

More information

Creation of Focused Web Archives for Scientists

Creation of Focused Web Archives for Scientists Creation of Focused Web Archives for Scientists, Thomas Risse and Gerhard Gossen L3S Research Center, Hannover, Germany ALEXANDRIA Workshop 15 / 16 September 2014 Hannover 15.09.2014 1 Web Archiving Web

More information

Web Site Collection Plan. for. Michigan State University Archives & Historical Collections. May 21, 2015

Web Site Collection Plan. for. Michigan State University Archives & Historical Collections. May 21, 2015 Web Site Collection Plan for Michigan State University Archives & Historical Collections May 21, 2015 Prepared by: Ed Busch Michigan State University buschedw@msu.edu Contents Section 1. Overview, Mission,

More information

ischool 2-Year Course Plan Summer 2015-Summer 2016 College Park Campus = CP; Shady Grove Campus = SG; SGO = Online

ischool 2-Year Course Plan Summer 2015-Summer 2016 College Park Campus = CP; Shady Grove Campus = SG; SGO = Online INFM 600 Information Environments CP, SG CP, SGO CP, SG CP, SGO INFM 603 Information Technology and Organizational Context CP, SG CP CP, SG SG INFM 605 Users and Use Context CP, SG CP, SGO CP, SG CP INFM

More information

Growing a web archiving program: A case study for evolving an organization-management plan

Growing a web archiving program: A case study for evolving an organization-management plan Submitted on: 10/06/2015 Growing a web archiving program: A case study for evolving an organization-management plan Todd Suomela Digital Initiatives, University of Alberta, Edmonton, Canada todd.suomela@ualberta.ca

More information

Crisis, Tragedy, and Recovery Network Digital Library (CTRnet) + Web Archiving in Qatar and VT

Crisis, Tragedy, and Recovery Network Digital Library (CTRnet) + Web Archiving in Qatar and VT Crisis, Tragedy, and Recovery Network Digital Library (CTRnet) + Web Archiving in Qatar and VT Edward A. Fox, Seungwon Yang, & CTRnet Team Department of Computer Science, Virginia Tech Workshop at WADL

More information

Report on Integration Strategy, Testing Plan and Test-bed Architecture

Report on Integration Strategy, Testing Plan and Test-bed Architecture European Commission Seventh Framework Programme Call: FP7-ICT-2007-1, Activity: ICT-1-4.1 Contract No: 216267 Report on Integration Strategy, Testing Plan and Test-bed Architecture Deliverable No: D6.3

More information

Knovel. Leveraging Electronic Resources. - Barbara Dixee Director of Global Academic Sales bdixee@knovel.com

Knovel. Leveraging Electronic Resources. - Barbara Dixee Director of Global Academic Sales bdixee@knovel.com Knovel Leveraging Electronic Resources - Barbara Dixee Director of Global Academic Sales bdixee@knovel.com What Knovel s Academic Customers are saying Challenges Providing quality information resources

More information

WebArchiving@UNT. Current Quality Assurance Practices in Web Archiving. Prepared By. Brenda Reyes Ayala Brenda.Reyes@unt.edu

WebArchiving@UNT. Current Quality Assurance Practices in Web Archiving. Prepared By. Brenda Reyes Ayala Brenda.Reyes@unt.edu WebArchiving@UNT Current Quality Assurance Practices in Web Archiving Prepared By Brenda Reyes Ayala Brenda.Reyes@unt.edu Mark E. Phillips Mark.Phillips@unt.edu Lauren Ko Lauren.Ko@unt.edu August 19, 2014

More information

PhD in Information Studies Goals

PhD in Information Studies Goals PhD in Information Studies Goals The goals of the PhD Program in Information Studies are to produce highly qualified graduates for careers in research, teaching, and leadership in the field; to contribute

More information

Web Harvesting and Archiving

Web Harvesting and Archiving Web Harvesting and Archiving João Miranda Instituto Superior Técnico Technical University of Lisbon jmcmi@mega.ist.utl.pt Keywords: web harvesting, digital archiving, internet preservation Abstract. This

More information

Kris Carpenter Negulescu, Director The Internet Archive, Web Group

Kris Carpenter Negulescu, Director The Internet Archive, Web Group Opportunities for Global Cooperation & Collaboration in Web Archiving Kris Carpenter Negulescu, Director The Internet Archive, Web Group Agenda Welcome & Overview The Internet Archive (IA) The International

More information

Oh My Blawg! Who Will Save the Legal Blogs? *

Oh My Blawg! Who Will Save the Legal Blogs? * LAW LIBRARY JOURNAL Vol. 105:4 [2013-26] Oh My Blawg! Who Will Save the Legal Blogs? * Caroline Young ** Legal professionals continue to need access to legal blogs for their scholarly, historical, and

More information

WEB ARCHIVING AT SCALE

WEB ARCHIVING AT SCALE WEB ARCHIVING AT SCALE (updated 12/19/14) by Rosalie Lack, Stephen Abrams, Trisha Cruse THE VALUE OF WEB CONTENT Web content holds a critical place in modern library collections. Web archiving is an essential

More information

First crawling of the Slovenian National web domain *.si: pitfalls, obstacles and challenges

First crawling of the Slovenian National web domain *.si: pitfalls, obstacles and challenges Submitted on: 1.7.2015 First crawling of the Slovenian National web domain *.si: pitfalls, obstacles and challenges Matjaž Kragelj Head, Digital Library Development Department and Head, Information Technology

More information

How To Use The Web Curator Tool On A Pc Or Macbook Or Ipad (For Macbook)

How To Use The Web Curator Tool On A Pc Or Macbook Or Ipad (For Macbook) User Manual Version 1.4.1... April 2009 Contents Contents... 2 Introduction... 4 About the Web Curator Tool... 4 About this document... 4 Where to find more information... 4 System Overview... 5 Background...

More information

Last edited on 7/30/07. Copyright Syncfusion., Inc 2001 2007.

Last edited on 7/30/07. Copyright Syncfusion., Inc 2001 2007. Enabling ClickOnce deployment for applications that use the Syncfusion libraries... 2 Requirements... 2 Introduction... 2 Configuration... 2 Verify Dependencies... 4 Publish... 6 Test deployment... 8 Trust

More information

Masters in Information Technology

Masters in Information Technology Computer - Information Technology MSc & MPhil - 2015/6 - July 2015 Masters in Information Technology Programme Requirements Taught Element, and PG Diploma in Information Technology: 120 credits: IS5101

More information

SHared Access Research Ecosystem (SHARE)

SHared Access Research Ecosystem (SHARE) SHared Access Research Ecosystem (SHARE) June 7, 2013 DRAFT Association of American Universities (AAU) Association of Public and Land-grant Universities (APLU) Association of Research Libraries (ARL) This

More information

of the public interface service and will also act as the national aggregator for Europeana.

of the public interface service and will also act as the national aggregator for Europeana. The Finnish National Digital Library: National Library of Finland developing a national infrastructure in collaboration with libraries, archives and museums Kristiina Hormia-Poutanen Deputy National Librarian

More information

Creating a billion-scale searchable web archive. Daniel Gomes, Miguel Costa, David Cruz, João Miranda and Simão Fontes

Creating a billion-scale searchable web archive. Daniel Gomes, Miguel Costa, David Cruz, João Miranda and Simão Fontes Creating a billion-scale searchable web archive Daniel Gomes, Miguel Costa, David Cruz, João Miranda and Simão Fontes Web archiving initiatives are spreading around the world At least 6.6 PB were archived

More information

A Platform for Large-Scale Machine Learning on Web Design

A Platform for Large-Scale Machine Learning on Web Design A Platform for Large-Scale Machine Learning on Web Design Arvind Satyanarayan SAP Stanford Graduate Fellow Dept. of Computer Science Stanford University 353 Serra Mall Stanford, CA 94305 USA arvindsatya@cs.stanford.edu

More information

Bibliothèque numérique de l enssib

Bibliothèque numérique de l enssib Bibliothèque numérique de l enssib Turning the library inside out, 4 au 7 juillet 2006 35 e congrès LIBER Organizational charts in a selection of LIBER libraries: analysis of current trends Jouguelet,

More information

Webarchiving: Legal Deposit of Internet in Denmark. A Curatorial Perspective

Webarchiving: Legal Deposit of Internet in Denmark. A Curatorial Perspective MDR, Vol. 41, pp. 110 120, December 2012 Copyright by Walter de Gruyter Berlin Boston. DOI 10.1515/mir-2012-0018 Webarchiving: Legal Deposit of Internet in Denmark. A Curatorial Perspective Sabine Schostag

More information

How To Build A Cloud Storage System

How To Build A Cloud Storage System Reference Architectures for Digital Libraries Keith Rajecki Education Solutions Architect Sun Microsystems, Inc. 1 Agenda Challenges Digital Library Solution Architectures > Open Storage/Open Archive >

More information

Research Data Management Policy. Glasgow School of Art

Research Data Management Policy. Glasgow School of Art Research Data Management Policy Glasgow School of Art Version 1.4 Last revision April 2013 Responsibility Research Information Manager Department Learning Resources Relevant legislation Data Protection

More information

Assignment 1 Briefing Paper on the Pratt Archives Digitization Projects

Assignment 1 Briefing Paper on the Pratt Archives Digitization Projects Twila Rios Digital Preservation Spring 2012 Assignment 1 Briefing Paper on the Pratt Archives Digitization Projects The Pratt library digitization efforts actually encompass more than one project, including

More information

An Introduction to Heritrix

An Introduction to Heritrix An Introduction to Heritrix An open source archival quality web crawler Gordon Mohr, Michael Stack, Igor Ranitovic, Dan Avery and Michele Kimpton Internet Archive Web Team {gordon,stack,igor,dan,michele}@archive.org

More information

Building Australia s eresearch Capability: the challenge of data management. Adrian Burton and Margaret Henty

Building Australia s eresearch Capability: the challenge of data management. Adrian Burton and Margaret Henty Building Australia s eresearch Capability: the challenge of data management Adrian Burton and Margaret Henty 1 Outline The ANDS context What do we mean by building capabilities? What is the ANDS constituency?

More information

Optimizing your research data management

Optimizing your research data management RESSOURCES HUMAINES SERVICE DE FORMATION DU PERSONNEL EPFL RI RH-F Téléphone : +41 21 693 34 30 Bâtiment BI Fax : +41 21 341 31 58 Station 7 CH-1015 Lausanne Site web : http://sfp.epfl.ch Optimizing your

More information

Archiving the Web and Beyond: A Look at Twi8er and Facebook (and some other things too)

Archiving the Web and Beyond: A Look at Twi8er and Facebook (and some other things too) Archiving the Web and Beyond: A Look at Twi8er and Facebook (and some other things too) July 23, 2014 Benn Joseph Manuscript Librarian Northwestern University Library Outline of discussion Digital preservanon

More information

KEYSTONE - Short scientific report for STSM visit in TU Delft

KEYSTONE - Short scientific report for STSM visit in TU Delft KEYSTONE - Short scientific report for STSM visit in TU Delft Visitor: Georgia Kapitsaki Host: TU Delft 1. Introduction This report covers the activities performed during my STSM at Delft University of

More information

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02) Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we

More information

Progress Made and Lessons Learned through Collaborative Web Archiving Projects

Progress Made and Lessons Learned through Collaborative Web Archiving Projects Progress Made and Lessons Learned through Collaborative Web Archiving Projects Anna Perricci Columbia University Libraries Archive-It Partner Meeting 2014 November 18, 2014 Web Resources Archiving Collaboration

More information

THE WEB ARCHIVING LIFE CYCLE MODEL

THE WEB ARCHIVING LIFE CYCLE MODEL THE WEB ARCHIVING LIFE CYCLE MODEL The Archive-It Team Internet Archive March 2013 Principle authors: Molly Bragg Kristine Hanna Contributors: Lori Donovan Graham Hukill Anna Peterson Introduction 1 Introduction

More information

Dominican University School Library Media Program

Dominican University School Library Media Program Dominican University School Library Media Program Frequently Asked Questions 1. What credentials do I need to work as a school librarian in Illinois public schools? A school librarian in an Illinois public

More information

Course Syllabus Fall 2015. S652 Digital Libraries

Course Syllabus Fall 2015. S652 Digital Libraries Course Syllabus Fall 2015 S652 Digital Libraries Indiana University- Purdue University at Indianapolis (IUPUI) School of Informatics and Computing - Dept. of Library and Information Science From schools

More information

Legal Issues in Building Social Media Collections

Legal Issues in Building Social Media Collections Association of Research Libraries May 2011 Legal Issues in Building Social Media Collections http://www.flickr.com/x/t/0098009/photos/cobalt/204902316 Hope O Keeffe Library of Congress Office of the General

More information

Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project

Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project Ahmet Suerdem Istanbul Bilgi University; LSE Methodology Dept. Science in the media project is funded

More information

Progress Report Template -

Progress Report Template - Progress Report Template - Text in italics is explanatory and should be deleted in completed documents. Project Name Project Website Report compiled by CHARTER UNIVERSITY OF EXETER http://projects.exeter.ac.uk/charter/

More information

THE CODE FOR SUCCESS. school of television and digital media. television and digital archive management television and digital media production

THE CODE FOR SUCCESS. school of television and digital media. television and digital archive management television and digital media production school of television and digital media THE CODE FOR SUCCESS television and digital archive management television and digital media production creating the future of your past school of television and digital

More information

Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS.

Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS. National Digital Stewardship Residency - Boston Project Summaries 2015-16 Residency Harvard Library Preparing for a Trustworthy Repository Certification of Harvard Library s DRS. Harvard Library s Digital

More information

HP Service Manager Architecture and Security HP Software-as-a-Service

HP Service Manager Architecture and Security HP Software-as-a-Service HP Service Manager Architecture and Security HP Software-as-a-Service Introduction...2 Architecture...2 Infrastructure Setup...4 Security Setup...4 Customer Infrastructure Requirements...5 Introduction

More information

Design and Selection Criteria for a National Web Archive

Design and Selection Criteria for a National Web Archive Design and Selection Criteria for a National Web Archive Daniel Gomes, Sérgio Freitas, and Mário J. Silva University of Lisbon, Faculty of Sciences 1749-016 Lisboa, Portugal dcg@di.fc.ul.pt, sfreitas@lasige.di.fc.ul.pt,

More information

The DK domain: in words and figures

The DK domain: in words and figures The DK domain: in words and figures by daily manager of netarchive.dk Bjarne Andersen State & University Library Universitetsparken DK 8000 Aarhus C +45 89462165 bja@netarkivet.dk On July 1, 2005, a new

More information

Web Archiving Services in The British Library: An Update

Web Archiving Services in The British Library: An Update Web Archiving Services in The British Library: An Update Web Archiving Panel, DLF Fall Forum 2006 November 8th 2006, Boston John Tuck Head of British Collections, The British Library Web Archiving Services

More information

Digital preservation a European perspective

Digital preservation a European perspective Digital preservation a European perspective Pat Manson Head of Unit European Commission DG Information Society and Media Cultural Heritage and Technology Enhanced Learning Outline The digital preservation

More information

Master of Digital Humanities and Public Culture. Preparing for the Digital Future

Master of Digital Humanities and Public Culture. Preparing for the Digital Future Preparing for the Digital Future Future of the profession Australia is well served by libraries in all sectors. Like other nations across the world we are, however, seeing a decline in investment in school

More information

PEER BENCHMARKING. A Powerful Tool for IT Portfolio Planning. Noah Wittman, Educational Technology Services, UC Berkeley UCCSC - 04 August 2014

PEER BENCHMARKING. A Powerful Tool for IT Portfolio Planning. Noah Wittman, Educational Technology Services, UC Berkeley UCCSC - 04 August 2014 PEER BENCHMARKING A Powerful Tool for IT Portfolio Planning Noah Wittman, Educational Technology Services, UC Berkeley UCCSC - 04 August 2014 Presentation Overview In Fall 2013, UC Berkeley adopted a benchmarking

More information

Archival Data Format Requirements

Archival Data Format Requirements Archival Data Format Requirements July 2004 The Royal Library, Copenhagen, Denmark The State and University Library, Århus, Denmark Main author: Steen S. Christensen The Royal Library Postbox 2149 1016

More information

Archiving Social Media in Senators Offices

Archiving Social Media in Senators Offices Archiving Social Media in Senators Offices Records created as a result of work conducted for the Senator (excluding committee records) are the Senator s personal property and should be retained as part

More information

Stanford University Libraries & Academic Information Resources

Stanford University Libraries & Academic Information Resources Stanford University Libraries & Academic Information Resources Continuously Addressing Challenges to Sustain and Grow a Foundation of Excellence THE STANFORD CHALLENGE Seeking Solutions, Educating Leaders

More information

The Importance of a Digital Library in Modern Business

The Importance of a Digital Library in Modern Business Qualitative and Quantitative Methods in Libraries (QQML) 4: 947-953, 2015 Introducing the "Getting Found" Web Analytics Cookbook for Monitoring Search Engine Optimization of Digital Repositories Kenning

More information

The Finnish National Digital Library: a national service is developed in collaboration with a network of libraries, archives and museums

The Finnish National Digital Library: a national service is developed in collaboration with a network of libraries, archives and museums Insights 26(1), March 2013 The Finnish National Digital Library Kristiina Hormia-Poutanen et al The Finnish National Digital Library: a national service is developed in collaboration with a network of

More information

NYARC Reframing Collections for a Digital Age Report from Consultant No. 2

NYARC Reframing Collections for a Digital Age Report from Consultant No. 2 NYARC Reframing Collections for a Digital Age Report from Consultant No. 2 September 11, 2012 Reviewed by referenced parties January 2013. Comments and approvals are on file. 2012 New York Art Resources

More information

DEPARTMENT OF INFORMATION AND LIBRARY SCIENCE

DEPARTMENT OF INFORMATION AND LIBRARY SCIENCE COLLEGE OF LIBERAL ARTS 67 DEPARTMENT OF INFORMATION AND LIBRARY SCIENCE Degrees Offered: B.A., M.A. Chair: Lin, Sinn-cheng ( 林 信 成 ) The Department The Department of Information and Library Science offers

More information

Developing a Model Virtual Internship Program: The SJSU/SLIS Experience

Developing a Model Virtual Internship Program: The SJSU/SLIS Experience Developing a Model Virtual Internship Program: The SJSU/SLIS Experience Patricia C. Franks, PhD, CRM Associate Professor, SLIS Internship Coordinator School of Library & Information Science San José State

More information

FACULTY OF HUMANITIES

FACULTY OF HUMANITIES The Faculty of Humanities, Universitas Indonesia is one of the oldest faculties among the thirteen faculties at Universitas Indonesia. Founded in 1940 by the Dutch colonial administration, it has undergone

More information

UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure

UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure UNINETT Sigma2 AS: architecture and functionality of the future national data infrastructure Authors: A O Jaunsen, G S Dahiya, H A Eide, E Midttun Date: Dec 15, 2015 Summary Uninett Sigma2 provides High

More information

Alison Elliott Director, Content Services National Library of New Zealand Wellington, New Zealand

Alison Elliott Director, Content Services National Library of New Zealand Wellington, New Zealand http://conference.ifla.org/ifla77 Date submitted: May 23, 2011 Electronic legal deposit: the New Zealand experience Alison Elliott Director, Content Services National Library of New Zealand Wellington,

More information

Archiving before Loosing Valuable Data? Development of Web Archiving in Europe

Archiving before Loosing Valuable Data? Development of Web Archiving in Europe BFP, Vol. 36, pp. 118-125, März 2012 Copyright by Walter de Gruyter Berlin Boston. DOI10.1515/bfp-2012-0014 Archiving before Loosing Valuable Data? Development of Web Archiving in Europe 0 Introduction:

More information

Scott D. Bacon. Web Services and Emerging Technologies Librarian Assistant Professor Kimbel Library, Coastal Carolina University

Scott D. Bacon. Web Services and Emerging Technologies Librarian Assistant Professor Kimbel Library, Coastal Carolina University Scott D. Bacon Web Services and Emerging Technologies Librarian Assistant Professor Kimbel Library, Coastal Carolina University Education Indiana University Bloomington, May 2011 - MLS with Digital Libraries

More information

A Novel Mobile Crawler System Based on Filtering off Non-Modified Pages for Reducing Load on the Network

A Novel Mobile Crawler System Based on Filtering off Non-Modified Pages for Reducing Load on the Network 272 The International Arab Journal of Information Technology, Vol. 8, No. 3, July 2011 A Novel Mobile Crawler System Based on Filtering off Non-Modified Pages for Reducing Load on the Network Rajender

More information

Design and Development of an Ajax Web Crawler

Design and Development of an Ajax Web Crawler Li-Jie Cui 1, Hui He 2, Hong-Wei Xuan 1, Jin-Gang Li 1 1 School of Software and Engineering, Harbin University of Science and Technology, Harbin, China 2 Harbin Institute of Technology, Harbin, China Li-Jie

More information

Web Developer Toolkit for IBM Digital Experience

Web Developer Toolkit for IBM Digital Experience Web Developer Toolkit for IBM Digital Experience Open source Node.js-based tools for web developers and designers using IBM Digital Experience Tools for working with: Applications: Script Portlets Site

More information

Management of Storage Devices and File Formats in Web Archive Systems

Management of Storage Devices and File Formats in Web Archive Systems The Ninth International Symposium on Operations Research and Its Applications (ISORA 10) Chengdu-Jiuzhaigou, China, August 19 23, 2010 Copyright 2010 ORSC & APORC, pp. 356 361 Management of Storage Devices

More information

Policies of Managing Web Resources at the Canadian Government: A Records Management Perspective

Policies of Managing Web Resources at the Canadian Government: A Records Management Perspective Policies of Managing Web Resources at the Canadian Government: A Records Management Perspective Natasha Zwarich Ph. D. Student natasha.zwarich@mail.mcgill.ca Eun G. Park Assistant Professor eun.park@mcgill.ca

More information

SEREYWATTANAK (Serey)

SEREYWATTANAK (Serey) 1 of 47 15/12/2013 12:11 AM An Authors' Trust Initiative. PO Box 105, Coonawarra, SA, Australia 5277 Tel +(61)438 005 051 Tel: + (61) 8737 3680 Fax: +(61) 8125 6766 Email: slim@copyright-archive.com www.copyright-archive.com

More information