Managing Web Archiving in Australia A Case Study

Size: px
Start display at page:

Download "Managing Web Archiving in Australia A Case Study"

Transcription

1 Managing Web Archiving in Australia A Case Study Paul Koerbin Digital Archiving Branch National Library of Australia Canberra, Australia pkoerbin@nla.gov.au Abstract. This paper describes the workflows and processes undertaken by the National Library of Australia (NLA) in archiving web resources for PANDORA: Australia s Web Archive. It considers the workflows and processes with regard to how they are supported by the functionality of the web archiving management system developed by the NLA, the PANDORA Digital Archiving System (PANDAS). It includes a summary of the Australian approach to web archiving and an overview of the PANDAS system architecture. Australian approach to web archiving The function of the National Library of Australia as stated in the National Library Act is to maintain and develop a national collection of library material, including a comprehensive collection of library material relating to Australia and the Australian people; and, moreover, to make this material available. With the advent of the Internet as a major medium for the publication of such material, the Library, as the primary institution in Australia charged with the responsibility for preserving Australia s documentary heritage, established the PANDORA Archive as a means to collect and manage online resources. Currently in Australia, in the federal jurisdiction, there is no legal deposit provision covering electronic publications, so the National Library must seek the explicit permission of publishers to harvest the online resources and to make the archived copies publicly accessible from the PANDORA Archive. 1

2 2 Paul Koerbin The National Library of Australia established the PANDORA Archive in The first tasks involved determining what would be included in the Archive and the drafting of selection guidelines while harvesting began in early Selective approach The National Library of Australia adopted a selective approach to archiving Australian web resources. All agencies that are responsible for archiving web resources are faced with the decision as to what archiving approach to take: whether to attempt whole or substantial domain archiving, or to be selective. The decision the National Library made was largely pragmatic and not for reasons that discredit the larger domain archiving options indeed a combination of approaches is most likely to achieve the best overall results. The selective approach has allowed the Library to make the best use of very limited resources. It has also fostered a get in and do it attitude which in turn provides practical insight into web archiving issues and incentive to deal with the specific problems encountered. This is not to ignore the problems and shortcomings associated with selective archiving, which I will briefly mention in the concluding remarks of this paper. The development of an archiving system The development of PANDORA 3 as a web archiving system commenced as a proof-of-concept project; that is, as a practical rather than theoretical undertaking. This meant that practical procedures involved in archive building were adopted as they became possible at the same time as the technical infrastructure was developed. So, for example, the selection of sites for archiving was begun before archiving could commence. Initially harvesting was done using the Harvest 4 software developed by the University of Colorado. Harvest was an indexing software and was modified by the National Library for archiving purposes. A rudimentary user interface, known as Pantrack, was developed as a sub-system to Harvest to allow PANDORA staff to submit and schedule archiving request and log archiving problems. In order to manage administrative metadata, an Access database was subsequently created which operated independently of the archiving system and Pantrack. The lack of integration of the system was further compounded by the adoption of desktop harvesting robots 2 The earliest archivings appear in the Archive with instance dates indicating July 1997, the date of their importation into the archiving system. In fact, some were harvested in the preceding months. 3 The name PANDORA is commonly used as the name for the National Library of Australia s web archiving initiative, as in PANDORA : Australia s Web Archive, see: However, in respect of the system architecture that is discussed in this paper, PANDORA also refers to the specific system component that provides the public interface to the archived resources. 4 For some information about Harvest, see: artz.harvest.html

3 Managing Web Archiving in Australia 3 (initially the offline browsing software WebZIP 5 then HTTrack 6 ). The adoption of desktop archiving was made necessary by the increasing prevalence of JavaScript which the Harvest HTML parsing program could not handle. While recognising that this approach is piecemeal, it also achieved results in that there are archived instances dating back to early 1997 in the PANDORA Archive. What was understood from this process was the need for an integrated web-based management system. This need was made all the more urgent by the move to a distributed model for the responsibility for web archiving in Australia which saw the engagement of other interested organisations as PANDORA partners. 7 Partnerships distributed responsibilities The National Library of Australia has a role in leading and providing direction to the Australian library community. While it was natural that the Library initiate and lead the development of web archiving it was also important to engage other institutions in the process in order to make use of local identification and selection knowledge and share the workload and responsibility. The State Library of Victoria was the first to formally become a PANDORA partner followed by ScreenSound Australia, Australia s National Film and Sound Archive. As of today, all mainland state libraries, that is, in addition to the State Library of Victoria, the State Libraries of New South Wales, Queensland, South Australia and Western Australia, together with the Northern Territory Library and Information Service are PANDORA partners. In addition, the Australian War Memorial and the Australian Institute of Aboriginal and Torres Strait Islander Studies have now joined as PANDORA partners. The State Library of Tasmania, it should be noted, has for many years developed its own web archiving system known as Our Digital Island 8. The National Library is still responsible for more than 57% of the archived titles and more than 66% of the archived instances in PANDORA 9. PANDORA today statistics and content As of August 2004 an indicative size of the PANDORA Archive can be represented by the following figures. These figures only refer to the Archive s access display copy, an uncompressed copy of the Archive content maintained on the PANDORA web server for public access through the PANDORA home page. As For more background on the early development of PANDORA and PANDAS see, Cathro, Warwick. Archiving the Web: the PANDORA Archive at the National Library of Australia. (2001) For a brief and more recent overview of PANDORA web archiving activity see Phillips, Margaret. PANDORA, Australia s Web Archive, and the Digital Archiving System that Supports It. (2003) Based on figures as at June 2004.

4 4 Paul Koerbin noted below, the Archive actually contains two preservation copies of each archived instance together with associated metadata including a metadata master or shadow copy for each archive instance 10. Number of titles: 6,608 Number of instances: 13,165 Number of files: 21,117,595 Size in gigabytes: In regard to what is archived that is the content of the Archive this is dependent upon the selection decisions and objectives of the contributing partners. However, the content selected is not constrained by technical format, nor by any narrow definition of what an online publication may be 11. Therefore, the content of the PANDORA Archive the entities referred to as titles in the above statistics ranges from simple print like publications produced as single PDF files to complete complex web sites incorporating multimedia content (and everything in between). So too, a part of web site for example the pages of a government web site containing the speeches of the minister or officials may be selected as a PANDORA title just as readily as a single document-like title or a whole web site domain. As such, the content is not easily or simply characterised. The content reflects the combined selection objectives and activities of the PANDORA partner agencies activities and objectives which do indeed differ between the contributing partners and may differ over time within any one agency depending on such factors as available resources and identified priorities. PANDAS The PANDORA Digital Archiving System, known as PANDAS, was developed in-house following an unsuccessful attempt to find an off-the-shelf system (or systems) to provide an integrated, web-based web archiving management system. The need for such a system was evident as the scale of the Library s archiving activity increased and if the best possible efficiencies were to be achieved in building a collaborative, selective and quality assessed web archive. PANDAS was first implemented in June 2001 and a second much enhanced version was released in August The second version was more modular in design to facilitate future enhancement by allowing development to component parts of the system. Consequently the current development program includes a number of 10 For an explanation of the various preservation master and access copies maintained in the Archive, see below in the section headed Archiving. 11 For the definition of publication used by the National Library of Australia in regard to PANDORA, see

5 Managing Web Archiving in Australia 5 incremental upgrades as at the end of June 2004 version is in production and version 2.2 is under development as well as concurrent development of what for working purposes is characterised as PANDAS version 3 and which will include more systemic enhancement. PANDAS system architecture The system architecture consists of four layers: presentation, application, business and data (Figure 1.). 1. The Presentation Layer includes client applications providing a visual presentation of the system s application layer to the end user. The client interfaces have restricted access except for the public archive display client (PANDORA). 2. The Application Layer includes client-specific server implementations delivering application functionality and behaviour. Core applications such as the management system, PANDAS, and the public interface PANDORA share a common code base. 3. The Business Layer provides unified access to the heterogeneous data storage and communication infrastructure for the application layer. The PANDAS system uses the WebObjects Enterprise Objects Framework 12 which in turn provides an object-oriented interface to the storage architecture. 4. The Data Layer consists of third party infrastructure products providing industry standard interfaces to the business layer. It includes a Relational Database Management System (Oracle), database servers and WebDAV accessible file servers. 12

6 6 Paul Koerbin Figure 1: PANDAS System Architecture While it is not within the scope of this paper to describe the system architecture in detail, some characteristics of the system should be mentioned. What is referred to generally as PANDAS is in fact a combination of several individual server applications sharing a set of functionalities through application modules (see Figure 2. for the complete system model). These component modules include: PANDAS As a component module, PANDAS encapsulates the functionality required to provide a web-based user interface for managing the workflow activities. It is this component and its support for the web archiving workflows that is the main focus of this paper. PANDORA This component creates the public interface through title and subject listings and title entry pages for archived resources. These pages are

7 Managing Web Archiving in Australia 7 generated on the fly from metadata and display resources in a predefined format 13. Gatherer This component provides the ability to harvest titles as requested by users 14 through the PANDAS component. Currently the offline browser software HTTrack is connected to PANDAS to undertake the web harvesting. Scripter This module manages requests made from PANDAS and Gatherer components to run backend server tasks using Perl scripts. Access restrictor - This component is responsible for the implementation of access restrictions to titles identified for restricted access. Restrictor This component computes access restriction requirements for the title instances on a daily basis as well as when a user-initiated request is made to restrict a title. Notifier This component enables messages to be sent to PANDAS users delivered either by or through the PANDAS component user interface. Reporter This component generates reports on the PANDAS metadata stored in the database repository. 13 See footnote 3 for an explanation of PANDORA nomenclature. 14 The term user is used throughout this paper to refer to collection managers within PANDORA partner agencies who are registered on PANDAS as Standard Users or higher. That is they are the business users of the system who undertake the archiving activities. They may also be referred to as title owners throughout this paper.

8 8 Paul Koerbin Figure 2: PANDAS Architecture Components Model Processes and workflows PANDAS was designed to support the workflows defined by the staff of the Library s Digital Archiving Section 15. These workflows include: Identifying, selecting and registering candidate titles; Seeking and recording permissions to archive; Setting harvesting regimes; Gathering (harvesting) files; Undertaking quality assurance checking; Determining suitability for preservation and initiating archiving processes; and, 15 The PANDAS user manual providing screenshots of the system and procedural detail is available online at:

9 Managing Web Archiving in Australia 9 Organising access, display and discovery routes to, and metadata for, the archived resources. Figure 3: Simplified PANDORA web archiving workflow In broad terms PANDAS supports these workflows by means of the following functions: The management of administrative metadata about titles that have been either selected for national preservation, considered but rejected for national preservation or are being monitored pending a selection decision; The management of access restrictions; The scheduling and initiation of the harvesting of titles selected for archiving; The management of the quality checking and assurance process and associated problem reporting and fixing; The initiation of archiving processes; The preparation and organisation of archived instances for public display through specific title entry pages, collections of titles and subject and alphabetical listings; and,

10 10 Paul Koerbin The provision of pre-defined management reports. I will consider each of these functions in more detail in relation to PANDORA web archiving workflows. Figure 4: PANDAS Main Menu showing all options available to the PANDAS Administrator Figure 5: PANDAS Title View screen

11 Managing Web Archiving in Australia 11 Administrative metadata User level and agency responsibilities for registered titles The PANDAS user interface supports four user levels. Higher user levels inherit all the permissions of the preceding levels. They are: 1. Informational User this user is only able to search and view records; 2. Standard User this user can perform all the necessary archiving functions; 3. Agency Administrator this user is able to view management reports and administer agency specific details and users; and, 4. PANDAS Administrator this user can access all PANDAS functionality across all agencies. The task of identifying and selecting candidate titles for inclusion in PANDORA is currently undertaken outside the management system. All PANDORA partners have collection responsibilities, whether that is based on a specific jurisdiction (as for the respective state and territory libraries) or topical interest (as in the case of ScreenSound Australia, the Australian War Memorial and the Australian Institute of Aboriginal and Torres Strait Islander Studies). Responsibilities may not necessarily be clearly delineated however and users search PANDAS as part of the selection process to determine if a selection decision in regard to a specific publication has already been made by another PANDORA partner. PANDAS identifies the agency responsible for a registered title and its current status (e.g. national preservation, rejected, monitored or pending selection ). Titles registered on PANDAS by one agency can be viewed but not be edited by staff in another agency. However PANDAS includes functionality to transfer ownership of a title record between participating agencies. This allows, for example, for one agency to identify and register a title onto the system and then transfer that title to another agency (for whom responsibility may be more appropriate), for that agency to make the selection decision. This process initiates an notification that alerts the agency that a title has been transferred. The title also appears in a message box on the PANDAS main menu screen of the relevant Agency Administrator. Should an agency locate a title on PANDAS registered as rejected and wish to change that decision to being selected for national preservation the title can be transferred and the status changed and responsibility assumed by that agency. Categories of metadata recorded The administrative metadata recorded can, for the purpose of this paper, be considered under four broad categories. The metadata is not grouped under such categories on the PANDAS screens. 1. Original resource identification (title, URL, owning agency and individual, indexing agency that identified the title).

12 12 Paul Koerbin 2. Selection and archiving status (registration date, selection status, associated standing). 3. Access management (persistent identifier, subject listing, inclusion in collections, national bibliographic database number). 4. Rights management (publisher details, permission status, registry file reference, access restrictions). The metadata is recorded in four distinct record types that can be registered on PANDAS: 1. Title records; 2. Publisher records; 3. Collection records; and 4. Indexing agency records. I will expand on some of the metadata elements that may be less self-explanatory. Metadata identifying original resource In addition to the registering agency owning the record by default, an individual user registered with the agency can take ownership of the title. This action means that the individual will receive notification of completed harvesting events and their gathered instances will be listed in their personal Processing List. The Agency Administrator is able to assign ownership to agency staff and thus manage the distribution or work. Trusted indexing and abstracting agencies are registered in the system. When they notify the Digital Archiving Section (via a web form) of a candidate title for archiving, the identifying indexing agency is associated with the title. When the title has been archived the relevant indexing agency will receive an automated message sent to one or more addresses included in the PANDAS record for the indexing agency to notify them that the title is now archived and providing them with the persistent identifier URL. Selection and archiving status metadata A selection status such as national preservation or rejected or monitored will have a number of associated standings that further define the status. In the case of titles selected for national preservation the standings define the archiving status; for example, the standing current indicates that the title is being archived on an ongoing scheduled basis, while the standing complete indicates that the archiving process in not scheduled or ongoing. Other standings indicate such things as: that the title has disappeared from the web; permission to archive was not able to be obtained; or, that the site is unable to be archived (although still considered selected for national preservation). A number of these standings determine the wording that appears on the automatically generated title entry pages. So, a national preservation title with a standing of current will produce the text this title is archived regularly on the title entry page.

13 Managing Web Archiving in Australia 13 Access management metadata Access management metadata includes the allocation of a persistent identification number. This is a system generated running number allocated to the title record which is subsequently incorporated into archived resource s persistent URL (which will be discussed further below). There is also an option to include the title in a predetermined collection. Collection records are registered on PANDAS independently of titles. Once a collection record is established, title records can be associated with a collection and as a result will be listed on a collective title entry page. This functionality is commonly used to organise collections of web sites associated with events such as election campaigns where the sites may be a compilation of ephemeral (and substantial) web sites, but which the collection manager determines are more likely to be of interest as a collection. The concept of collection in PANDORA should be understood as distinct from the established subject listings. A title included in a collection and listed on a collection title entry page can also be listed as an individual title under a subject listing thus opening up multiple discovery routes. Managing access restrictions As mentioned at the beginning of this paper, electronic publications are not covered by legal deposit at the federal level in Australia. Therefore, archiving permission must be sought and the permission status (i.e. granted, denied, unknown) is recorded in the management system. The actual correspondence with publishers usually conducted exclusively by is maintained in an external system (at the National Library this is the electronic registry file system TRIM 16 ) and the registry file number is recorded as a local reference number. As part of the process of obtaining permission to archive it may be necessary to negotiate access restrictions (typically in the case of commercial publications). Access restrictions are set at the title level. Three types of access restriction can be applied: period restriction, date restriction and authenticated restriction. 1. Period restriction applies the restriction for a set time period from the date of archiving, e.g. three months, three years. Where titles are re-archived over a period of time, archived instances that no longer fall with the restriction period are automatically made available. 2. Date restriction applies the restriction between specified dates. Where there are multiple instances, all instances will be restricted for the period specified, irrespective of the date of archiving. 3. Authenticated restrictions allows for the application of specified username and password access. 16

14 14 Paul Koerbin Period and date restrictions are applied in conjunction with locations (a range of IP addresses and subnet masks) to which the access is limited. For example, access may be restricted to staff-only areas of the National Library or to a single PC in the Library s Main Reading Room. These restriction locations must be programmed into the system and are not maintained through the user interface. Multiple access restrictions can be applied to an individual title. So, for example, access may be restricted for three years in the National Library of Australia Reading Room and for two years in the State Library of Victoria Reading Room. Managing harvesting scheduling, filters PANDAS utilises HTTrack to harvest files from the web, however PANDAS provides the additional functionality of scheduling the harvesting process. An appropriate harvesting regime for the title is determined by the collection manager responsible for the title, although this may be in consultation with the publisher of the resource. In determining a schedule, consideration is given to the type of publication an integrating resource or e-journal will in most cases be archived on a regular basis, whereas a monographic type publication would only be archived once as well as to the stability of the publication and how long information is retained on the site. Generally, a longer harvesting schedule is preferred and, while no attempt is made to capture all changes to a site as they occur, it is the aim to ensure that all substantial content is captured. Three types of scheduling are supported by PANDAS: 1. Regular harvesting schedules such as weekly, monthly, annual (et al.). 2. Specific dates on which the title is to be harvested. 3. Immediate harvesting; this is in effect the same as a date specified gather, where the date specified is today, so the gather is initiated immediately. These scheduling methods can be used in combination. The PANDAS user interface includes functionality to specify gather filters, a range of settings and login/password requirements. These options largely emulate the options found on the WinHTTrack interface 17. However the PANDAS interface is intended to be essentially generic so that other harvesting software can be connected to PANDAS in a manner that is transparent to the user 18. A number of default settings are established which can be over-ridden by the user. These include: Instruction to ignore robot.txt rules (these are not required since permission is obtained before archiving); The harvesting of sub-directories by default; 17 WinHTTrack is the Windows version of the HTTrack software. See footnote 6 for URL. 18 This is yet to be tested.

15 Managing Web Archiving in Australia 15 The harvesting of near files (non-text files such as images linked from allowed pages); and, A limit on the depth of archiving. The limit on archiving depth is large enough to encompass any web site but is applied in order to thwart regressive interrogation of the server hosting the web site being harvested. Other limitations on the harvester are applied so as to reduce the likelihood of aggressive harvesting including a limit of six connections per server accessed and a transfer rate of 50 Kb per second. A download limit of one gigabyte is also applied although this is more to facilitate functionality within the bandwidth available to the National Library. As a web archiving system PANDAS is primarily designed to manage the harvesting (downloading) of files from the web. However the system also supports an uploading functionality. While the ability to upload files is an essential component of the quality assurance process associated with harvested resources (see below) it can also be used to ingest new resources (whether single files, multiple files or whole web sites) from a local drive. This procedure may typically be used when a site cannot be successfully harvested from the web and the publisher has supplied the files by other means (e.g. FTP or on CD). It is also commonly used for uploading publications supplied as attachments. The ability to upload from local drives to the working area server is achieved using the WebDAV protocol 19. In order to do this an empty archive instance (i.e. an archive directory path) must first be created to which the uploaded files can be added. This simply involves selecting this method from the options available in the gather module interface. As with harvesting, this upload process can be initiated either at the time of upload or can be scheduled. Uploading to the archive can only be done by authorised PANDAS users, not by external parties. The PANDAS interface allows the user to view the gather queues which identify titles in the process of harvesting, those waiting to be harvested and those that have finished harvesting and are awaiting quality assurance processing. The title owner or Agency Administrator can pause, stop or delete the harvest when displayed in the gather queues. To further control the use of bandwidth, currently PANDAS allows only four titles to be downloaded concurrently. If there are more titles set to be harvested than there are available connections, they will queue in a waiting list until a connection becomes available. Scheduled harvests are commenced after midnight on the day they are scheduled to run, so most (if not all) the harvesting is usually completed before staff commence work. However common practice is to also to initiate immediate harvest requests during working hours to suit agency or individual workflows. 19

16 16 Paul Koerbin Managing the quality assurance process The owner of a harvested title is notified of titles awaiting processing by means of a message on their personal view of the main menu. The user can link from this message directly to the processing screen for the title. Other routes to the processing of title are also available, either from the completed gather queue, the title record or the user s personalised processing list which includes newly harvested instances as well as those already accessed but yet to be archived. Harvested files are initially located on a file server designated as the PANDAS working area. That is, they are not considered archived at this stage. While the files are in this working area, the PANDAS user can delete the instance and can open write access to the files using the WebDAV protocol. The checking of an instance for functionality and the completeness or more correctly, the accuracy of the harvest is a manual process. This remains the most time-consuming aspect of the archiving process. However, such a process is considered a necessary aspect indeed one of the advantages of the selective archiving approach. Essentially this involves the user doing a visual check of the site, following links and noting the completeness and functionality of the site; and determining that the look and feel of the original has been retained. Since this is a visual checking process, some subtlety and experience is required, since there are a number of traps for the unwary. By way of example I will mention three such considerations: 1. While completeness is to be ascertained that is, what has been harvested correlates to the selected title entity it is also important to ensure that files that are not wanted are not kept, since permission may not have been obtained for this extraneous material. This can be done by checking external links (those outside the defined scope of the archived title) to ensure they have not been harvested. The user can access a directory view of the harvested files using WebDAV if necessary or desired, but this may not be done since opening WebDAV access is not essential to the quality assurance process The quality assessment must include looking for traps such as framed sites where the content is accessed by menu options not able to be parsed by the harvester and the original absolute URLs remain active. In such cases the archived instance may appear complete, but a close inspection of the link (by opening in a new browser window or holding the mouse over it) reveals that the content file is still being delivered from the original site, not the archive. 3. Metafiles, such as RealMedia.ram files are not parsed and so even in the archived version continue to deliver the media resource from the original 20 A desired enhancement to PANDAS aims to make this directory view more readily accessible.

17 Managing Web Archiving in Australia 17 host. Consequently the archived instance appears to retain functionality while in fact the resource has not been gathered into the archive. In some cases, where the archived version is incomplete or not fully functional, PANDAS users (that is, the collection managers) are able to put into effect solutions, such as downloading missing files and perhaps some minor re-referencing in the HTML. It depends on the skill of the user how much can be done, but PANDAS collection managers are expected to undertake basic repair work. More complex problems, such as those involving JavaScript or decompiling class files to identify missing resources, are referred to technical support drawn from staff in the National Library s Information Technology Division. PANDAS includes a reporting functionality within the processing screen to identify and describe problems and hand-over the title and associated error report to the technical support for analysis and resolution where possible. PANDAS includes a tracking system for these reports. When the problem has been dealt with by the IT technical support person it is handed back through the system to the title owner who will be notified through the previously mentioned message system on their personal view of the main menu. Archiving The decision whether or not to archive the instance remains with the collection manager who is the business owner of the title. In some cases it may be a matter of deleting the instance and re-setting gather filters to include or exclude certain files or directories. In cases where full functionality or all files cannot be obtained, the collection manager must determine if the instance should still be preserved in its incomplete state. When a collection manager decides to accept an instance they select the archive option on the processing screen. This initiates a script that packages the quality assessed instance as a gzip compressed tarball (TAR, tape archive format) file and moves it to the Library s Digital Object Storage System (DOSS) 21. Thus any changes made to this copy are retained as the display master copy. In addition to the display master, another unaltered copy is stored on the DOSS. This copy, known as the preservation master, is derived from the output of the harvesting software and does not include any human or machine interventions (other than that resulting from the harvester). A third master copy is stored as a metadata master which may be understood as a shadow copy of the harvested instance in which the contents of each original file has been replaced with the HTTP response header for that file from the original site s web server. This master may perhaps more correctly be termed a MIME (Multipurpose Internet Mail Extension) master since the significant content for 21 For a discussion of the development of the National Library of Australia s digital services architecture, including the DOSS, see Cathro, Warwick and Boston, Tony. Development of a Digital Services Architecture at the National Library of Australia. (2003)

18 18 Paul Koerbin preservation purpose of this shadow copy is the MIME type 22. Finally, when the archiving of an instance is initiated, the archive script also creates an access copy. This is an uncompressed copy of the display master which is moved to the PANDORA web server. This is the copy that is publicly accessible through the PANDORA home page. The archived copies can therefore be summarised as follows: 1. Preservation master an unaltered copy derived from the harvester which is stored on the DOSS in TAR format. 2. Display master a copy that includes changes made during the quality assessment process and stored on the DOSS in TAR format. 3. Metadata master a copy retaining the directory structure and file names of the original web site that includes, in place of the original content, metadata derived from the HTTP responses for each file. It is stored on DOSS in TAR format. 4. Access display copy an uncompressed copy of the display master maintained on a web server for public access. Managing Display and Resource Discovery The public interface component of the PANDORA Archive consists of a combination of pages generated on the fly from the PANDAS database together with a number of static HTML pages. The latter include various manuals and informational pages. The pages generated from the database provide the browsable access to the PANDORA resources by means of alphabetical title listings and portal-like subject listings. Each archived resource, corresponding to a registered title on PANDAS, has a title entry page (TEP). The TEP (see Figure 5) includes the following information, some of which is derived from the administrative metadata recorded on the title record, while other components are edited though the Display Details screens: Publication title this can be edited to be different from the title as registered in PANDAS if, for example, it needs to be made more meaningful for public display; Branding (i.e. an agency logo) and wording to identify which PANDORA partner agency has selected and archived the title; Wording to indicate whether the resource is being archived in an ongoing manner or has ceased, etc.; 22 One limitation of the system, which is designed primarily as a downloading system, is that when resources are uploaded to the Archive the Perl scripts that are normally run with harvesting are not initiated. Thus, for example the shadow archive for preservation metadata is not created.

19 Managing Web Archiving in Australia 19 Links to archived instances these can be displayed or not, for example if there are links to journal issues they may not be necessary; however it is common when there are links to issues to retain the instance link as a means to allow the user to navigate the whole archived site which may include earlier issues; Links to specific parts of an archived instance usually this will mean links to specific journal issues, but it may also be to component parts of a site that can be labelled as desired; A link to the archived version of the publisher s copyright statement; Other free text notes as considered appropriate there is facility to add these notes at the top of the TEP and at the bottom of the TEP in association with the copyright statement. It is possible to include HTML mark-up elements in the text to apply hypertext links and other formatting; A link to the publisher s site (if still available); A link to a PANDORA collection TEP if the title is associated with a collection; A link to the TEP of a former or later title if the resource is an e-journal that has changed title; The persistent URL for citation; An option to generate persistent URLs for pages within the archived resource; An icon indicating access restrictions apply to specific instances detail about the restriction is viewable by clicking on the icon; An icon indicating that browser or plug-in requirements apply to specific instances details, which are automatically extracted based on the file extensions, are viewable by clicking on the icon; An icon indicating specific information about the archived instance this is free text that can be added by the collection manager and can be used, for example, if there are significant deficiencies in the archived instance that need to be publicly noted. Again, the details are viewable by clicking on the icon.

20 20 Paul Koerbin Figure 6: PANDORA title entry page As already noted, PANDAS assigns a persistent identifier (a system generated running number) when a title is registered. This number is incorporated into the persistent URL applicable to the title s TEP. The PANDAS persistent identifier is incorporated into a schema developed by the National Library of Australia for its various digital collections 23. This schema incorporates a collection identification element for the archive (nla.arc), and the PANDAS persistent identification number. An example, as for the title in Figure 5 is This scheme can be logically extended to deep resources within the archive by adding elements to the base identifier to identify the archived instance and the resource file. Thus: is the persistent identifier for the archived editorial page of the September issue of the Indo-Pacific Journal of Phenomenology as archived on 22 August In order to facilitate the identification of these deep persistent identifiers, a citation generator is accessible from the PANDORA TEPs and the PANDORA citation generator page For more information on the schema and associated resolver service, see: Persistent Identifier Scheme for Digital Collections at the National Library of Australia

21 Managing Web Archiving in Australia 21 This allows a reader to copy and paste the Archive URL of a deep resource into the generator box which will calculate the persistent URL. All titles archived in PANDORA are catalogued with full MARC records onto Kinetica, the Australian National Bibliographic Database (NBD), and the National Library s own online catalogue. When the MARC record has been added to the NBD it is possible to initiate from PANDAS a Z39.50 query to extract MARC data which is then cross-walked to Dublin Core metadata elements 25 which are then embedded in the TEP. Currently this query has to be initiated by the PANDAS user. One other access element to mention in regard to access and discovery is the ability to limit the PANDORA home page listings (title and subject) to specified partner views. Thus, for example, it is possible to view only those titles archived by the National Library of Australia or only those archived by the State Library of New South Wales. While this functionality is present it is somewhat underdeveloped at present especially in regard to clear branding. Reports PANDAS includes a reports module in which a number of pre-defined reports are viewable through the user interface at any time by users with administrator privileges. These reports can also be programmed to run at defined times, in which case a notification is sent to the relevant agency s address. These pre-defined reports include: A report on the total number of archived titles and total number of archived instances; A report on scheduled harvests set to run in the coming week; A monthly statistical report, reporting such things as new titles registered, new titles archived and titles re-archived (i.e. scheduled titles archived); Titles for which permission has been sought but not received; New title instances archived in the previous two months and a subset of this, new government titles archived in the same period; and, Titles being monitored for selection. These reports can be viewed in relation to all agencies or for a specified agency. Ad hoc reports cannot be run through the PANDAS interface; however SQL queries can be run over the Oracle database by a collection manager with appropriate access rights and software (e.g. Oracle Report Builder). For user defined statistical 25 The AGLS standard is used. See:

22 22 Paul Koerbin reporting the National Library has been testing the ProClarity Analytics Platform 26 which provides data cube manipulation and analysis capabilities. In order to report on broken publisher URLs which are displayed on the title entry pages, the link checking software LinkScan 27 is used. These reports identify broken links (404 errors) and other URL problems such as invalid URL schemes. Acting on these reports can involve considerable work in order to determine if the site has disappeared from the Internet or if it has a new location. In addition, these changes will require information in PANDAS to be updated and in many cases editing of the MARC catalogue record. Future developments and directions Future developments and directions for PANDAS and web archiving in Australia address both current system issues debugging, enhancements to existing modules and more systemic issues. The process of enhancing PANDAS is understood as an ongoing one. Planning for PANDAS version 2, for example, was well underway as PANDAS version 1 was being implemented. Since PANDAS version 2.0 was released in August 2002 there has been a considerable amount of debugging work and minor enhancements deployed in incremental upgrades. Desired enhancements to existing modules have been identified and are currently being progressively implemented. The sorts of enhancements encompassed in this work include changes to existing functionality to better support workflows and improve system performance. With increased PANDORA partner participation, heavier archiving loads and the regular archiving of large web sites, some problems in respect of the robustness and stability of the system have become evident. These problems relate in some part to the available hardware capabilities, such as memory and processing capacity, but also stem from the way the development environment, WebObjects, has been used. At the time that PANDAS was developed some five years ago, training and support for WebObjects was not readily available in Australia and one consequence of this was some sub-optimal use of the development environment which has been identified as the cause of some problems of stability in the production environment. The National Library is now proposing to re-engineer the PANDAS software to address this issue and migrate PANDAS to the later and better supported WebObjects 5.2. As this version of WebObjects is compliant with Java Enterprise Standards this will allow for the software to be deployed on standard Java Application Servers, a factor that will better enable possible collaborative development. A number of factors driving future developments will have significant implications for the system and workflows. These include:

23 Managing Web Archiving in Australia 23 The need to be able to automatically ingest and process a larger volume of online publications and associated metadata; The need to comply with the standards and adopt the tools being developed by the International Internet Preservation Consortium (IIPC); The need to improve access and discovery paths to the Archive as it continues to grow; The need to be able to incorporate data acquired by methods other than the current harvesting method, including partial or whole domain harvests, collecting deep web resources (e.g. databases) and enhanced deposit capability; and, The need for the automatic collection of more preservation metadata, together with a collection manager interface to the preservation metadata and mechanisms to support preservation processes. In respect of the ingest of larger volumes of online publications, the Library has begun working towards automating the process of identifying resources using MARC records derived from metadata emanating from Australian Commonwealth Government agencies and using this to automatically register and harvest resource through PANDAS. The National Library as a member of the International Internet Preservation Consortium 28 is currently leading the Consortium s Deep Web Working Group researching and developing strategies and tool for archiving web content that is inaccessible to crawlers. Some concluding remarks on the system and approach In adopting a selective approach to archiving and developing an archive system initially as a proof-of-concept model, the National Library of Australia has been able to achieve tangible web archiving outcomes while continuing to develop, extend and refine its systems and workflows. The scalability of the selective approach, which allows for the negotiation of permission to archive with rights owners and quality assurance in the archiving process, supports the building of an accessible, functional and undoubtedly valuable web archive. In taking up practical web archiving at an early stage while concurrently developing the system it is true to say that the management system that was created was designed to specifically meet the workflows that were being established by the National Library of Australia. The shortcomings of selective archiving must however be recognised. While the Australian domain is perhaps of a size to make selective archiving a valuable pursuit, nevertheless much the greater part of the Australian domain remains un-archived 28

24 24 Paul Koerbin by PANDORA. This puts great onus on the selection process, but it will always be a fraught undertaking to try and pre-empt all that future researchers may require or desire. Moreover as Catherine Lupovici has emphasised, 29 the web or any specified domain within it is an entity is in which the linkages form part of the essential character that ought to be preserved. This cannot be achieved with selective archiving of the type demonstrated by PANDORA. It must also be recognised that selective archiving as done in Australia does retain a degree of manual intervention, specifically in the selection, quality assurance and cataloguing processes, that requires considerable resources in order to achieve archiving on any useful scale. The ambition for web archiving in Australia is to develop a system able to automate as many of the web archiving processes as possible while retaining a commitment to quality archiving outcomes. In doing this we aim to be able to increase the scale of archiving and be able to incorporate a range of archiving approaches including quality assessed selective archiving and large scale domain or partial domain harvesting. The Australian experience at least demonstrates that, even with minimal and inevitably inadequate resources, a practical approach can achieve valuable results in the form of a working, accessible web archive while at the same time serving to promote system development and useful engagement in the ongoing work to address the problems of this essential pursuit. Acknowledgements I wish to acknowledge and thank Vinita Tuteja of the IT Applications Branch at the National Library of Australia for her documentation of the PANDAS system architecture from which much of the description in this paper is derived; and for her diagrams of the PANDAS architecture, system model and workflow. I also wish to thank Steven McPhillips of the Business Systems Support Branch at the Library for clarification of a number of technical aspects in regard to the system. This work is licensed under a Creative Commons License Catherine Lupovici, Head of Digital Library Department, Bibliothèque nationale du France, made this point in her unpublished keynote address at the 2004 VALA (Victorian Association for Library Automation) Conference in Melbourne, 3-5 February Conference web site at:

How To Manage Pandora

How To Manage Pandora PANDORA - past, present, and future National web archiving in Australia Dr Paul Koerbin Manager Web Archiving National Library of Australia National Conference on eresources in Malaysia Penang, Malaysia,

More information

How To Use The Web Curator Tool On A Pc Or Macbook Or Ipad (For Macbook)

How To Use The Web Curator Tool On A Pc Or Macbook Or Ipad (For Macbook) User Manual Version 1.4.1... April 2009 Contents Contents... 2 Introduction... 4 About the Web Curator Tool... 4 About this document... 4 Where to find more information... 4 System Overview... 5 Background...

More information

NETWRIX EVENT LOG MANAGER

NETWRIX EVENT LOG MANAGER NETWRIX EVENT LOG MANAGER ADMINISTRATOR S GUIDE Product Version: 4.0 July/2012. Legal Notice The information in this publication is furnished for information use only, and does not constitute a commitment

More information

Archiving Web Resources: Guidelines for Keeping Records of Web-based Activity in the Commonwealth Government

Archiving Web Resources: Guidelines for Keeping Records of Web-based Activity in the Commonwealth Government Archiving Web Resources: Guidelines for Keeping Records of Web-based Activity in the Commonwealth Government GOVERNMENT RECORDKEEPING MARCH 2001 ISBN 0 642 34440 X Commonwealth of Australia 2001 This work

More information

Functional Requirements for Digital Asset Management Project version 3.0 11/30/2006

Functional Requirements for Digital Asset Management Project version 3.0 11/30/2006 /30/2006 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 30 3 32 33 34 35 36 37 38 39 = required; 2 = optional; 3 = not required functional requirements Discovery tools available to end-users:

More information

Queensland recordkeeping metadata standard and guideline

Queensland recordkeeping metadata standard and guideline Queensland recordkeeping metadata standard and guideline June 2012 Version 1.1 Queensland State Archives Department of Science, Information Technology, Innovation and the Arts Document details Security

More information

zen Platform technical white paper

zen Platform technical white paper zen Platform technical white paper The zen Platform as Strategic Business Platform The increasing use of application servers as standard paradigm for the development of business critical applications meant

More information

How To Use Open Source Software For Library Work

How To Use Open Source Software For Library Work USE OF OPEN SOURCE SOFTWARE AT THE NATIONAL LIBRARY OF AUSTRALIA Reports on Special Subjects ABSTRACT The National Library of Australia has been a long-term user of open source software to support generic

More information

DiskPulse DISK CHANGE MONITOR

DiskPulse DISK CHANGE MONITOR DiskPulse DISK CHANGE MONITOR User Manual Version 7.9 Oct 2015 www.diskpulse.com info@flexense.com 1 1 DiskPulse Overview...3 2 DiskPulse Product Versions...5 3 Using Desktop Product Version...6 3.1 Product

More information

Vector HelpDesk - Administrator s Guide

Vector HelpDesk - Administrator s Guide Vector HelpDesk - Administrator s Guide Vector HelpDesk - Administrator s Guide Configuring and Maintaining Vector HelpDesk version 5.6 Vector HelpDesk - Administrator s Guide Copyright Vector Networks

More information

Virtual Data Centre. User Guide

Virtual Data Centre. User Guide Virtual Data Centre User Guide 2 P age Table of Contents Getting Started with vcloud Director... 8 1. Understanding vcloud Director... 8 2. Log In to the Web Console... 9 3. Using vcloud Director... 10

More information

Email Data Protection. Administrator Guide

Email Data Protection. Administrator Guide Email Data Protection Administrator Guide Email Data Protection Administrator Guide Documentation version: 1.0 Legal Notice Legal Notice Copyright 2015 Symantec Corporation. All rights reserved. Symantec,

More information

NS DISCOVER 4.0 ADMINISTRATOR S GUIDE. July, 2015. Version 4.0

NS DISCOVER 4.0 ADMINISTRATOR S GUIDE. July, 2015. Version 4.0 NS DISCOVER 4.0 ADMINISTRATOR S GUIDE July, 2015 Version 4.0 TABLE OF CONTENTS 1 General Information... 4 1.1 Objective... 4 1.2 New 4.0 Features Improvements... 4 1.3 Migrating from 3.x to 4.x... 5 2

More information

WebSpy Vantage Ultimate 2.2 Web Module Administrators Guide

WebSpy Vantage Ultimate 2.2 Web Module Administrators Guide WebSpy Vantage Ultimate 2.2 Web Module Administrators Guide This document is intended to help you get started using WebSpy Vantage Ultimate and the Web Module. For more detailed information, please see

More information

Copyright 2014 Jaspersoft Corporation. All rights reserved. Printed in the U.S.A. Jaspersoft, the Jaspersoft

Copyright 2014 Jaspersoft Corporation. All rights reserved. Printed in the U.S.A. Jaspersoft, the Jaspersoft 5.6 Copyright 2014 Jaspersoft Corporation. All rights reserved. Printed in the U.S.A. Jaspersoft, the Jaspersoft logo, Jaspersoft ireport Designer, JasperReports Library, JasperReports Server, Jaspersoft

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

Novell ZENworks Asset Management 7.5

Novell ZENworks Asset Management 7.5 Novell ZENworks Asset Management 7.5 w w w. n o v e l l. c o m October 2006 USING THE WEB CONSOLE Table Of Contents Getting Started with ZENworks Asset Management Web Console... 1 How to Get Started...

More information

State Records Guideline No 15. Recordkeeping Strategies for Websites and Web pages

State Records Guideline No 15. Recordkeeping Strategies for Websites and Web pages State Records Guideline No 15 Recordkeeping Strategies for Websites and Web pages Table of Contents 1 Introduction... 4 1.1 Purpose... 4 1.2 Authority... 5 2 Recordkeeping business requirements... 5 2.1

More information

SAS Business Data Network 3.1

SAS Business Data Network 3.1 SAS Business Data Network 3.1 User s Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2014. SAS Business Data Network 3.1: User's Guide. Cary,

More information

Quick Start Guide. Installation and Setup

Quick Start Guide. Installation and Setup Quick Start Guide Installation and Setup Introduction Velaro s live help and survey management system provides an exciting new way to engage your customers and website visitors. While adding any new technology

More information

Coveo Platform 7.0. Microsoft Dynamics CRM Connector Guide

Coveo Platform 7.0. Microsoft Dynamics CRM Connector Guide Coveo Platform 7.0 Microsoft Dynamics CRM Connector Guide Notice The content in this document represents the current view of Coveo as of the date of publication. Because Coveo continually responds to changing

More information

Netwrix Auditor for Windows Server

Netwrix Auditor for Windows Server Netwrix Auditor for Windows Server Quick-Start Guide Version: 7.0 7/7/2015 Legal Notice The information in this publication is furnished for information use only, and does not constitute a commitment from

More information

Software Development Kit

Software Development Kit Open EMS Suite by Nokia Software Development Kit Functional Overview Version 1.3 Nokia Siemens Networks 1 (21) Software Development Kit The information in this document is subject to change without notice

More information

NETWRIX USER ACTIVITY VIDEO REPORTER

NETWRIX USER ACTIVITY VIDEO REPORTER NETWRIX USER ACTIVITY VIDEO REPORTER ADMINISTRATOR S GUIDE Product Version: 1.0 January 2013. Legal Notice The information in this publication is furnished for information use only, and does not constitute

More information

The Australian War Memorial s Digital Asset Management System

The Australian War Memorial s Digital Asset Management System The Australian War Memorial s Digital Asset Management System Abstract The Memorial is currently developing an Enterprise Content Management System (ECM) of which a Digital Asset Management System (DAMS)

More information

Netwrix Auditor for Exchange

Netwrix Auditor for Exchange Netwrix Auditor for Exchange Quick-Start Guide Version: 8.0 4/22/2016 Legal Notice The information in this publication is furnished for information use only, and does not constitute a commitment from Netwrix

More information

Acunetix Web Vulnerability Scanner. Getting Started. By Acunetix Ltd.

Acunetix Web Vulnerability Scanner. Getting Started. By Acunetix Ltd. Acunetix Web Vulnerability Scanner Getting Started V8 By Acunetix Ltd. 1 Starting a Scan The Scan Wizard allows you to quickly set-up an automated scan of your website. An automated scan provides a comprehensive

More information

NETWRIX EVENT LOG MANAGER

NETWRIX EVENT LOG MANAGER NETWRIX EVENT LOG MANAGER QUICK-START GUIDE FOR THE ENTERPRISE EDITION Product Version: 4.0 July/2012. Legal Notice The information in this publication is furnished for information use only, and does not

More information

Installing and Administering VMware vsphere Update Manager

Installing and Administering VMware vsphere Update Manager Installing and Administering VMware vsphere Update Manager Update 1 vsphere Update Manager 5.1 This document supports the version of each product listed and supports all subsequent versions until the document

More information

Installing, Uninstalling, and Upgrading Service Monitor

Installing, Uninstalling, and Upgrading Service Monitor CHAPTER 2 Installing, Uninstalling, and Upgrading Service Monitor This section contains the following topics: Preparing to Install Service Monitor, page 2-1 Installing Cisco Unified Service Monitor, page

More information

Net Services: File System Monitor

Net Services: File System Monitor Net Services: File System Monitor Settings for ExtremeZ-IP file server volumes...1 Setup of the Net Services server...2 Configuring and testing the Net Services server...3 Installing File System Monitor...4

More information

1. Digital Asset Management User Guide... 2 1.1 Digital Asset Management Concepts... 2 1.2 Working with digital assets... 4 1.2.1 Importing assets in

1. Digital Asset Management User Guide... 2 1.1 Digital Asset Management Concepts... 2 1.2 Working with digital assets... 4 1.2.1 Importing assets in 1. Digital Asset Management User Guide....................................................... 2 1.1 Digital Asset Management Concepts.................................................... 2 1.2 Working with

More information

Customer Control Panel Manual

Customer Control Panel Manual Customer Control Panel Manual Contents Introduction... 2 Before you begin... 2 Logging in to the Control Panel... 2 Resetting your Control Panel password.... 3 Managing FTP... 4 FTP details for your website...

More information

Service Performance Management: Pragmatic Approach by Jim Lochran

Service Performance Management: Pragmatic Approach by Jim Lochran www.pipelinepub.com Volume 3, Issue 12 Service Performance Management: Pragmatic Approach by Jim Lochran As the mix of service provider offerings become more IP centric, the need to overhaul existing service

More information

Dell Active Administrator 8.0

Dell Active Administrator 8.0 What s new in Dell Active Administrator 8.0 January 2016 Dell Active Administrator 8.0 is the upcoming release of Dell Software's complete solution for managing Microsoft Active Directory security auditing,

More information

Novo Knowledge Base Software

Novo Knowledge Base Software Customer Support & Knowledge Management Solutions Novo Solutions for KNOWLEDGE MANAGEMENT What Will It Do For You? Better & Faster Customer Support: provides quicker problem resolution and 24 x 7 Web customer

More information

Vector Asset Management User Manual

Vector Asset Management User Manual Vector Asset Management User Manual This manual describes how to set up Vector Asset Management 6.0. It describes how to use the: Vector AM Console Vector AM Client Hardware Inventory Software Inventory

More information

NatureServe s Environmental Review Tool

NatureServe s Environmental Review Tool NatureServe s Environmental Review Tool A Repeatable Online Software Solution for Agencies For More Information, Contact: Lori Scott Rob Solomon lori_scott@natureserve.org rob_solomon@natureserve.org 703-908-1877

More information

Government of Saskatchewan Executive Council. Oracle Sourcing isupplier User Guide

Government of Saskatchewan Executive Council. Oracle Sourcing isupplier User Guide Executive Council Oracle Sourcing isupplier User Guide Contents 1 Introduction to Oracle Sourcing and isupplier...6 1.0 Oracle isupplier...6 1.1 Oracle Sourcing...6 2 Customer Support...8 2.0 Communications

More information

8.7. NET SatisFAXtion Email Gateway Installation Guide. For NET SatisFAXtion 8.7. Contents

8.7. NET SatisFAXtion Email Gateway Installation Guide. For NET SatisFAXtion 8.7. Contents NET SatisFAXtion Email Gateway Installation Guide For NET SatisFAXtion 8.7 Contents Install Microsoft Virtual SMTP Server 2 XP and 2003 2 2008 and 2008 R2 2 Windows 7 2 Upgrade Path 2 Configure Microsoft

More information

TIBCO Spotfire Automation Services 6.5. User s Manual

TIBCO Spotfire Automation Services 6.5. User s Manual TIBCO Spotfire Automation Services 6.5 User s Manual Revision date: 17 April 2014 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO

More information

IBM Campaign Version-independent Integration with IBM Engage Version 1 Release 3 April 8, 2016. Integration Guide IBM

IBM Campaign Version-independent Integration with IBM Engage Version 1 Release 3 April 8, 2016. Integration Guide IBM IBM Campaign Version-independent Integration with IBM Engage Version 1 Release 3 April 8, 2016 Integration Guide IBM Note Before using this information and the product it supports, read the information

More information

How To Use Gfi Mailarchiver On A Pc Or Macbook With Gfi Email From A Windows 7.5 (Windows 7) On A Microsoft Mail Server On A Gfi Server On An Ipod Or Gfi.Org (

How To Use Gfi Mailarchiver On A Pc Or Macbook With Gfi Email From A Windows 7.5 (Windows 7) On A Microsoft Mail Server On A Gfi Server On An Ipod Or Gfi.Org ( GFI MailArchiver for Exchange 4 Manual By GFI Software http://www.gfi.com Email: info@gfi.com Information in this document is subject to change without notice. Companies, names, and data used in examples

More information

VMware Mirage Web Manager Guide

VMware Mirage Web Manager Guide Mirage 5.1 This document supports the version of each product listed and supports all subsequent versions until the document is replaced by a new edition. To check for more recent editions of this document,

More information

Dell Enterprise Reporter 2.5. Configuration Manager User Guide

Dell Enterprise Reporter 2.5. Configuration Manager User Guide Dell Enterprise Reporter 2.5 2014 Dell Inc. ALL RIGHTS RESERVED. This guide contains proprietary information protected by copyright. The software described in this guide is furnished under a software license

More information

Netwrix Auditor for Active Directory

Netwrix Auditor for Active Directory Netwrix Auditor for Active Directory Quick-Start Guide Version: 7.1 10/26/2015 Legal Notice The information in this publication is furnished for information use only, and does not constitute a commitment

More information

WS_FTP Professional 12

WS_FTP Professional 12 WS_FTP Professional 12 Tools Guide Contents CHAPTER 1 Introduction Ways to Automate Regular File Transfers...5 Check Transfer Status and Logs...6 Building a List of Files for Transfer...6 Transfer Files

More information

System Administration Training Guide. S100 Installation and Site Management

System Administration Training Guide. S100 Installation and Site Management System Administration Training Guide S100 Installation and Site Management Table of contents System Requirements for Acumatica ERP 4.2... 5 Learning Objects:... 5 Web Browser... 5 Server Software... 5

More information

How To Set Up Total Recall Web On A Microsoft Memorybook 2.5.2.2 (For A Microtron)

How To Set Up Total Recall Web On A Microsoft Memorybook 2.5.2.2 (For A Microtron) Total Recall Web Web Module Manual and Customer Quick Reference Guides COPYRIGHT NOTICE Copyright 1994-2009 by DHS Associates, Inc. All Rights Reserved. All TOTAL RECALL, TOTAL RECALL SQL, TOTAL RECALL

More information

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories

DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories DSpace: An Institutional Repository from the MIT Libraries and Hewlett Packard Laboratories MacKenzie Smith, Associate Director for Technology Massachusetts Institute of Technology Libraries, Cambridge,

More information

NETWRIX FILE SERVER CHANGE REPORTER

NETWRIX FILE SERVER CHANGE REPORTER NETWRIX FILE SERVER CHANGE REPORTER ADMINISTRATOR S GUIDE Product Version: 3.3 April/2012. Legal Notice The information in this publication is furnished for information use only, and does not constitute

More information

Netwrix Auditor for SQL Server

Netwrix Auditor for SQL Server Netwrix Auditor for SQL Server Quick-Start Guide Version: 7.1 10/26/2015 Legal Notice The information in this publication is furnished for information use only, and does not constitute a commitment from

More information

8.6. NET SatisFAXtion Email Gateway Installation Guide. For NET SatisFAXtion 8.6. Contents

8.6. NET SatisFAXtion Email Gateway Installation Guide. For NET SatisFAXtion 8.6. Contents NET SatisFAXtion Email Gateway Installation Guide For NET SatisFAXtion 8.6 Contents 1.0 - Install Microsoft Virtual SMTP Server 2 XP and 2003 2 2008 and 2008 R2 2 Windows 7 2 Upgrade Path 2 Configure Microsoft

More information

HP ProLiant Essentials Vulnerability and Patch Management Pack Planning Guide

HP ProLiant Essentials Vulnerability and Patch Management Pack Planning Guide HP ProLiant Essentials Vulnerability and Patch Management Pack Planning Guide Product overview... 3 Vulnerability scanning components... 3 Vulnerability fix and patch components... 3 Checklist... 4 Pre-installation

More information

IBM Campaign and IBM Silverpop Engage Version 1 Release 2 August 31, 2015. Integration Guide IBM

IBM Campaign and IBM Silverpop Engage Version 1 Release 2 August 31, 2015. Integration Guide IBM IBM Campaign and IBM Silverpop Engage Version 1 Release 2 August 31, 2015 Integration Guide IBM Note Before using this information and the product it supports, read the information in Notices on page 93.

More information

SCOPE OF SERVICE Hosted Cloud Storage Service: Scope of Service

SCOPE OF SERVICE Hosted Cloud Storage Service: Scope of Service Hosted Cloud Storage Service: Scope of Service 1. Definitions 1.1 For the purposes of this Schedule: Access Account is an End User account with Data Storage requiring authentication via a username and

More information

Citrix EdgeSight Administrator s Guide. Citrix EdgeSight for Endpoints 5.3 Citrix EdgeSight for XenApp 5.3

Citrix EdgeSight Administrator s Guide. Citrix EdgeSight for Endpoints 5.3 Citrix EdgeSight for XenApp 5.3 Citrix EdgeSight Administrator s Guide Citrix EdgeSight for Endpoints 5.3 Citrix EdgeSight for enapp 5.3 Copyright and Trademark Notice Use of the product documented in this guide is subject to your prior

More information

User Guidelines for QFES e-lodgement

User Guidelines for QFES e-lodgement Guidelines to assist with electronically registering, submitting, receiving and viewing applications for QFES Referral Agency Advice under the Sustainable Planning Act 2009. State of Queensland (Queensland

More information

Your complete guide to installing the info@hand Self-Service Portal and estore.

Your complete guide to installing the info@hand Self-Service Portal and estore. Your complete guide to installing the info@hand Self-Service Portal and estore. Install the Portal & estore as shrink-wrapped software, or as add-ons to an existing Joomla! installation. Then configure

More information

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data

Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data Archiving, Indexing and Accessing Web Materials: Solutions for large amounts of data David Minor 1, Reagan Moore 2, Bing Zhu, Charles Cowart 4 1. (88)4-104 minor@sdsc.edu San Diego Supercomputer Center

More information

Dashboard Admin Guide

Dashboard Admin Guide MadCap Software Dashboard Admin Guide Pulse Copyright 2014 MadCap Software. All rights reserved. Information in this document is subject to change without notice. The software described in this document

More information

Improved document archiving speeds; data enters the FileNexus System at a faster rate! See benchmark test spreadsheet.

Improved document archiving speeds; data enters the FileNexus System at a faster rate! See benchmark test spreadsheet. Feature Sheet Version 6.100.14 FileNexus Major Advances Client Server Communication - Dependency on Windows DCOM protocols eliminated which means NO additional configuration required on Client PCs after

More information

Connection Broker Managing User Connections to Workstations, Blades, VDI, and More. Quick Start with Microsoft Hyper-V

Connection Broker Managing User Connections to Workstations, Blades, VDI, and More. Quick Start with Microsoft Hyper-V Connection Broker Managing User Connections to Workstations, Blades, VDI, and More Quick Start with Microsoft Hyper-V Version 8.1 October 21, 2015 Contacting Leostream Leostream Corporation http://www.leostream.com

More information

LEA Monitoring User Guide

LEA Monitoring User Guide LEA Monitoring User Guide v. 3.0 September 2012 Contents Contents... 2 Introduction... 4 Acknowledgements... 4 Questions... 4 What is DMI Tracker?... 5 Monitoring... 5 End User System Guidelines... 5 Accessing

More information

The Recipe for Sarbanes-Oxley Compliance using Microsoft s SharePoint 2010 platform

The Recipe for Sarbanes-Oxley Compliance using Microsoft s SharePoint 2010 platform The Recipe for Sarbanes-Oxley Compliance using Microsoft s SharePoint 2010 platform Technical Discussion David Churchill CEO DraftPoint Inc. The information contained in this document represents the current

More information

Documentum Content Distribution Services TM Administration Guide

Documentum Content Distribution Services TM Administration Guide Documentum Content Distribution Services TM Administration Guide Version 5.3 SP5 August 2007 Copyright 1994-2007 EMC Corporation. All rights reserved. Table of Contents Preface... 7 Chapter 1 Introducing

More information

National Fire Incident Reporting System (NFIRS 5.0) Configuration Tool User's Guide

National Fire Incident Reporting System (NFIRS 5.0) Configuration Tool User's Guide National Fire Incident Reporting System (NFIRS 5.0) Configuration Tool User's Guide NFIRS 5.0 Software Version 5.6 1/7/2009 Department of Homeland Security Federal Emergency Management Agency United States

More information

CA Performance Center

CA Performance Center CA Performance Center Single Sign-On User Guide 2.4 This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to as the Documentation ) is

More information

Password Reset Tool for Service Desk Operators Version 2.0

Password Reset Tool for Service Desk Operators Version 2.0 www.telnetport25.com Password Reset Tool for Service Desk Operators Version 2.0 Installation & User Guide Author: Andy Grogan 2 www.telnetport25.com Password Reset Tool Installation Guide Contents Overview...

More information

Taleo Enterprise. Taleo Reporting Getting Started with Business Objects XI3.1 - User Guide

Taleo Enterprise. Taleo Reporting Getting Started with Business Objects XI3.1 - User Guide Taleo Enterprise Taleo Reporting XI3.1 - User Guide Feature Pack 12A January 27, 2012 Confidential Information and Notices Confidential Information The recipient of this document (hereafter referred to

More information

Glyma Deployment Instructions

Glyma Deployment Instructions Glyma Deployment Instructions Version 0.8 Copyright 2015 Christopher Tomich and Paul Culmsee and Peter Chow Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except

More information

HP Insight Diagnostics Online Edition. Featuring Survey Utility and IML Viewer

HP Insight Diagnostics Online Edition. Featuring Survey Utility and IML Viewer Survey Utility HP Industry Standard Servers June 2004 HP Insight Diagnostics Online Edition Technical White Paper Featuring Survey Utility and IML Viewer Table of Contents Abstract Executive Summary 3

More information

FileMaker Server 14. Custom Web Publishing Guide

FileMaker Server 14. Custom Web Publishing Guide FileMaker Server 14 Custom Web Publishing Guide 2004 2015 FileMaker, Inc. All Rights Reserved. FileMaker, Inc. 5201 Patrick Henry Drive Santa Clara, California 95054 FileMaker and FileMaker Go are trademarks

More information

Management Software. Web Browser User s Guide AT-S106. For the AT-GS950/48 Gigabit Ethernet Smart Switch. Version 1.0.0. 613-001339 Rev.

Management Software. Web Browser User s Guide AT-S106. For the AT-GS950/48 Gigabit Ethernet Smart Switch. Version 1.0.0. 613-001339 Rev. Management Software AT-S106 Web Browser User s Guide For the AT-GS950/48 Gigabit Ethernet Smart Switch Version 1.0.0 613-001339 Rev. A Copyright 2010 Allied Telesis, Inc. All rights reserved. No part of

More information

Fax User Guide 07/31/2014 USER GUIDE

Fax User Guide 07/31/2014 USER GUIDE Fax User Guide 07/31/2014 USER GUIDE Contents: Access Fusion Fax Service 3 Search Tab 3 View Tab 5 To E-mail From View Page 5 Send Tab 7 Recipient Info Section 7 Attachments Section 7 Preview Fax Section

More information

OneStop Reporting 3.7 Installation Guide. Updated: 2013-01-31

OneStop Reporting 3.7 Installation Guide. Updated: 2013-01-31 OneStop Reporting 3.7 Installation Guide Updated: 2013-01-31 Copyright OneStop Reporting AS www.onestopreporting.com Table of Contents System Requirements... 1 Obtaining the Software... 2 Obtaining Your

More information

GFI Product Guide. GFI MailArchiver Archive Assistant

GFI Product Guide. GFI MailArchiver Archive Assistant GFI Product Guide GFI MailArchiver Archive Assistant The information and content in this document is provided for informational purposes only and is provided "as is" with no warranty of any kind, either

More information

Best Practices: Extending Enterprise Applications to Mobile Devices

Best Practices: Extending Enterprise Applications to Mobile Devices Best Practices: Extending Enterprise Applications to Mobile Devices by Kulathumani Hariharan Summary: Extending enterprise applications to mobile devices is increasingly becoming a priority for organizations

More information

Migrating to vcloud Automation Center 6.1

Migrating to vcloud Automation Center 6.1 Migrating to vcloud Automation Center 6.1 vcloud Automation Center 6.1 This document supports the version of each product listed and supports all subsequent versions until the document is replaced by a

More information

Basic & Advanced Administration for Citrix NetScaler 9.2

Basic & Advanced Administration for Citrix NetScaler 9.2 Basic & Advanced Administration for Citrix NetScaler 9.2 Day One Introducing and deploying Citrix NetScaler Key - Brief Introduction to the NetScaler system Planning a NetScaler deployment Deployment scenarios

More information

Groove Management Server

Groove Management Server Groove Management Server Version 3.1 Domain Administrator s Guide Copyright Copyright 2001-2005, Groove Networks, Inc. All rights reserved. You may not reproduce or distribute any part of this document

More information

IBM Aspera Add-in for Microsoft Outlook 1.3.2

IBM Aspera Add-in for Microsoft Outlook 1.3.2 IBM Aspera Add-in for Microsoft Outlook 1.3.2 Windows: 7, 8 Revision: 1.3.2.100253 Generated: 02/12/2015 10:58 Contents 2 Contents Introduction... 3 System Requirements... 5 Setting Up... 6 Account Credentials...6

More information

Audit Management Reference

Audit Management Reference www.novell.com/documentation Audit Management Reference ZENworks 11 Support Pack 3 February 2014 Legal Notices Novell, Inc., makes no representations or warranties with respect to the contents or use of

More information

POINT OF SALES SYSTEM (POSS) USER MANUAL

POINT OF SALES SYSTEM (POSS) USER MANUAL Page 1 of 24 POINT OF SALES SYSTEM (POSS) USER MANUAL System Name : POSI-RAD System Release Version No. : V4.0 Total pages including this covering : 23 Page 2 of 24 Table of Contents 1 INTRODUCTION...

More information

Novell ZENworks Asset Management 7.5

Novell ZENworks Asset Management 7.5 Novell ZENworks Asset Management 7.5 w w w. n o v e l l. c o m October 2006 INSTALLATION GUIDE Table Of Contents 1. Installation Overview... 1 If you are upgrading... 1 Installation Choices... 1 ZENworks

More information

Brown County Information Technology Aberdeen, SD. Request for Proposals For Document Management Solution. Proposals Deadline: Submit proposals to:

Brown County Information Technology Aberdeen, SD. Request for Proposals For Document Management Solution. Proposals Deadline: Submit proposals to: Brown County Information Technology Aberdeen, SD Request for Proposals For Document Management Solution Proposals Deadline: 9:10am, January 12, 2016 Submit proposals to: Brown County Auditor 25 Market

More information

Chapter 6 Virtual Private Networking Using SSL Connections

Chapter 6 Virtual Private Networking Using SSL Connections Chapter 6 Virtual Private Networking Using SSL Connections The FVS336G ProSafe Dual WAN Gigabit Firewall with SSL & IPsec VPN provides a hardwarebased SSL VPN solution designed specifically to provide

More information

Optum Patient Portal. 70 Royal Little Drive. Providence, RI 02904. Copyright 2002-2013 Optum. All rights reserved. Updated: 3/7/13

Optum Patient Portal. 70 Royal Little Drive. Providence, RI 02904. Copyright 2002-2013 Optum. All rights reserved. Updated: 3/7/13 Optum Patient Portal 70 Royal Little Drive Providence, RI 02904 Copyright 2002-2013 Optum. All rights reserved. Updated: 3/7/13 Table of Contents 1 Patient Portal Activation...1 1.1 Pre-register a Patient...1

More information

1. Nuxeo DAM User Guide... 2 1.1 Nuxeo DAM Concepts... 2 1.2 Working with digital assets... 3 1.2.1 Import assets in Nuxeo DAM... 3 1.2.

1. Nuxeo DAM User Guide... 2 1.1 Nuxeo DAM Concepts... 2 1.2 Working with digital assets... 3 1.2.1 Import assets in Nuxeo DAM... 3 1.2. Nuxeo DAM User Guide...................................................................................... 2 1 Nuxeo DAM Concepts....................................................................................

More information

Easy Manage Helpdesk Guide version 5.4

Easy Manage Helpdesk Guide version 5.4 Easy Manage Helpdesk Guide version 5.4 Restricted Rights Legend COPYRIGHT Copyright 2011 by EZManage B.V. All rights reserved. No part of this publication or software may be reproduced, transmitted, stored

More information

Installation Instructions

Installation Instructions Installation Instructions 25 February 2014 SIAM AST Installation Instructions 2 Table of Contents Server Software Requirements... 3 Summary of the Installation Steps... 3 Application Access Levels... 3

More information

Papermule Workflow. Workflow and Asset Management Software. Papermule Ltd

Papermule Workflow. Workflow and Asset Management Software. Papermule Ltd Papermule Workflow Papermule Workflow - the power to specify adaptive and responsive workflows that let the business manage production problems in a resilient way. Workflow and Asset Management Software

More information

BlueJ Teamwork Tutorial

BlueJ Teamwork Tutorial BlueJ Teamwork Tutorial Version 2.0 for BlueJ Version 2.5.0 (and 2.2.x) Bruce Quig, Davin McCall School of Engineering & IT, Deakin University Contents 1 OVERVIEW... 3 2 SETTING UP A REPOSITORY... 3 3

More information

The Requirements Compliance Matrix columns are defined as follows:

The Requirements Compliance Matrix columns are defined as follows: 1 DETAILED REQUIREMENTS AND REQUIREMENTS COMPLIANCE The following s Compliance Matrices present the detailed requirements for the P&I System. Completion of all matrices is required; proposals submitted

More information

BillQuick Agent 2010 Getting Started Guide

BillQuick Agent 2010 Getting Started Guide Time Billing and Project Management Software Built With Your Industry Knowledge BillQuick Agent 2010 Getting Started Guide BQE Software, Inc. 2601 Airport Drive Suite 380 Torrance CA 90505 Support: (310)

More information

Citrix Access Gateway Plug-in for Windows User Guide

Citrix Access Gateway Plug-in for Windows User Guide Citrix Access Gateway Plug-in for Windows User Guide Access Gateway 9.2, Enterprise Edition Copyright and Trademark Notice Use of the product documented in this guide is subject to your prior acceptance

More information

Symantec Backup Exec 12.5 for Windows Servers. Quick Installation Guide

Symantec Backup Exec 12.5 for Windows Servers. Quick Installation Guide Symantec Backup Exec 12.5 for Windows Servers Quick Installation Guide 13897290 Installing Backup Exec This document includes the following topics: System requirements Before you install About the Backup

More information

EVault for Data Protection Manager. Course 361 Protecting Linux and UNIX with EVault

EVault for Data Protection Manager. Course 361 Protecting Linux and UNIX with EVault EVault for Data Protection Manager Course 361 Protecting Linux and UNIX with EVault Table of Contents Objectives... 3 Scenario... 3 Estimated Time to Complete This Lab... 3 Requirements for This Lab...

More information

Business Insight Report Authoring Getting Started Guide

Business Insight Report Authoring Getting Started Guide Business Insight Report Authoring Getting Started Guide Version: 6.6 Written by: Product Documentation, R&D Date: February 2011 ImageNow and CaptureNow are registered trademarks of Perceptive Software,

More information