Last updated on 21 April 2017

July 2002-October 2002

compiled by Michael Day (UKOLN, University of Bath) and Gerard Clifton (National Library of Australia)


This is a summary of selected recent activity in the field of digital preservation compiled from the Preserving Access to Digital Information (PADI) Gateway and the digital-preservation and padiforum-l mailing lists.


1. Organisations

1.1 IFLA and the International Publishers Association

IFLA (International Federation of Library Associations and Institutions) and the International Publishers Association (IPA) have issued a joint statement on the archiving and preservation of digital information. It advocates seven principles and recommendations for the long-term preservation of digital materials. Both organisations recognise the need for action and will seek to make long term preservation a key agenda item internationally and the focus of standards development, research and support. Libraries are considered best placed to take responsibility for long term preservation; arrangements with publishers, and joint initiatives are forecast.

1.2 The Arts and Humanties Data Service

The Archaeology Data Service (ADS) has published two more of the good practice guides being produced by the UK Arts and Humanities Data Service (AHDS). The first of these concerns archaeological geophysics and the variety of data that is produced, including raw measurement data, processed data and interpretative drawings. The second guide looks at the use of computer aided design (CAD) software in archaeological and architectural projects. The CAD guide is aimed at those who create CAD files, at project managers and at those considering the management of CAD archives.

2. Projects

2.1 InterPARES project

The international InterPARES project has published a report on the findings of the first phase of the project, which ran from 1999 to 2001. The report contains reports from the project's task forces on authenticity, appraisal, preservation and strategy as well as from the national research teams. Work on the second phase of InterPARES is continuing.

2.2 The Universal Virtual Computer (UVC)

An article published in the New York Times on the 29 August 2002 publicised the Universal Virtual Computer (UVC) approach to the preservation of digital data developed by Raymond Lorie of the IBM Almaden Research Center. Lorie's approach is an alternative to emulation based on the creation of programs that can extract data from an associated bit stream and return it to the caller in an understandable way. The program is written for a Universal Virtual Computer so all that should be needed in the future for executing the program is an interpreter of the UVC instructions. A pilot project undertaken by IBM Netherlands and the Koninklike Bibliotheek was successful in using UVC techniques to render the data structures of PDF files.

The New York Times article describes the UVC concept and includes quotes from Jeff Rothenberg, Johan Steenbakkers and Lorie himself. Another news article, published in March/April's IEEE Software, discusses the UVC concept in more detail and adds a review of the NARA sponsored work on persistent archive technology that was undertaken by the San Diego Supercomputer Center. Lorie also delivered an update on the UVC approach in paper delivered at the Joint Conference on Digital Libraries 2002 (JCDL 2002).

2.3 New reports from the Belgian DAVID project

DAVID (Digital Archiving in Flemish Institutions and Administrations) is a project on the digital preservation of records funded by the Fund for Scientific Research - Flanders (Belgium). The project is a co-operation between Antwerp City Archives and the Interdisciplinary Centre for Law and Informatics (ICRI) of the Katholieke Universiteit Leuven (KU Leuven).

Two reports have recently been made available in English from the DAVID Web site. One is a report by Filip Boudrez and Sofie Van den Eynde on the archiving of Web sites, first published in Dutch in May 2002. This report elaborates why Web sites need archiving, provides information on organisational and technical approaches to their collection and management, and looks at a range of legal issues (copyright, personal data, liability, etc.). The second report is the English version of a deliverable by the same authors on the archiving of e-mails, first published in Dutch in October 2001. This looks in detail at the technical and legal issues related to the preservation of e-mail records.

2.4 Reports on e-mail and XML from the Dutch Digital Preservation Testbed initiative

The Dutch Digital Preservation Testbed (Testbed Digitale Bewaring) initiative has recently published three items relating to XML and digital preservation in English:

2.5 Archiving the Avant Garde: Documenting and Preserving Variable Media Art

Over the past few years, museums and galleries have increasingly begun to collect and exhibit art based on digital technologies, including multimedia installations and Internet art. Some of these are 'born-digital' in that they are not based on any previously existing non-digital form. Following older initiatives like the Guggenheim Museum's Variable Media Initiative and's ArtBase, a consortium project called "Archiving the Avant Garde" is now beginning to investigate some of the issues associated with the preservation and documentation of variable media artworks. Specifically, the project will attempt to "develop, document, and disseminate strategies for describing and preserving non-traditional, intermedia, and variable media art forms, such as performance, installation, conceptual, and digital art." The collaboration includes the Berkeley Art Museum and Pacific Film Archive, the Solomon R. Guggenheim Museum, the Walker Art Center,, the Franklin Furnace Archive, and the Cleveland Performance Art Festival and Archive.

3. Events

A number of conferences and other events have taken place since the production of the last bulletin. Some of these have been completely concerned with digital preservation issues, while others are more general in scope but including some papers with relevance to the subject. A number of these are listed here, together with links to relevant papers and event reports.

3.1 The State of Digital Preservation, Washington, D.C., 24-25 April 2002

The Council on Library and Information Resources (CLIR) has now published the proceedings of this important international conference. Papers include an overview of technological approaches to digital preservation (Kenneth Thibodeau) and a research agenda (Margaret Hedstrom). The volume also includes activity reports from OCLC (Meg Bellinger), the US National Digital Infrastructure Initiative (Laura Campbell), the National Library of the Netherlands (Titia van der Werf) and the National Library of Australia (Colin Webb). A final paper reflects on some recent developments (Donald Waters).

3.2 DPC Forum with Industry, London, UK, 5 June 2002

This web page provides links to a summary meeting report and presentations by speakers from both the public and private sectors, as presented at the Digital Preservation Coalition Forum with Industry, held in, London on 5 June, 2002. Topics covered long-term trends and research and development issues for preservation and access to digital material in both government and industry.

3.3 RLG Open Forum, Atlanta, Georgia, 16 June 2002

This page provides links to presentations made at the RLG Open Forum held as part of the American Library Association (ALA) Annual Conference in Atlanta, Georgia, USA. Topics include standards work in preservation metadata for various media, the Metadata Encoding and Transmission Standard (METS) and work with the Open Archival Information Systems (OAIS) model.

3.4 ACS National Meeting, Boston, Massachusetts, 18-22 August 2002

The 224th American Chemical Society (ACS) National Meeting was held in Boston on 18-22 August 2002. This year the meeting included a symposium on digital archiving. Presentations included descriptions of Elsevier Science's approach to digital preservation policies, partnership relations, technologies, etc., and an intrduction to the DSpace institutional repository project of MIT Libraries.

3.5 IFLA 2002, Glasgow, UK, 18-24 August 2002

The 68th IFLA General Conference and Council was held on 18-24 August 2002 in Glasgow. There were a number of presentations on digital preservation issues on the main programme (these are listed below), but the conference also hosted an important announcement from the Koninklijke Bibliotheek (National Library of the Netherlands) and Elsevier Science. An agreement between the two organisations will allow the Koninklijke Bibliotheek (KB) to receive digital copies of approximately 1,500 journals covering all areas of science, technology and medicine for deposit into the KB's e-Deposit system. The agreement also allows the KB to provide access to the archived journals within the library itself, with permission to extend this to remote access if Elsevier or its successor/s cease providing the journals online.

Selected presentations:

3.6 ECDL 2002, Rome, Italy, 16-18 September 2002

Several papers at ECDL 2002 concerned digital preservation issues. A session on the opening day included papers describing parts of the Web archiving experiments undertaken in France and Austria. The paper on archiving the French Web concentrated on describing issues relating to acquisition, while the paper on the Austrian Online Archive (AOLA) was concerned more with the analysis of the resulting archives. Other relevant papers described the 'migration on request' technique developed by researchers involved in the CAMiLEON project and the DSpace repository built by MIT Libraries and Hewlett Packard Laboratories.

3.7 2nd ECDL Workshop on Web Archiving, Rome, Italy, 19 September 2002

Presentation slides and event reports from the 2nd ECDL Workshop on Web Archiving are now becoming available. The workshop concentrated on three main themes: technologies, collection policies and platforms for collaboration between Web archiving initiatives.

3.8 Erpanet Workshop on XML as a Preservation Strategy, Urbino, Italy, 9-11 October 2002

3.9 DC-2002, Florence, Italy, 13-17 October 2002

There was one paper with relevance to digital preservation given at the latest Dublin Core conference held in Florence. A team of researchers based in Japan gave a paper on how a collection management framework for digital resources might influence the metadata granularity required for digital resource management within a repository:

3.10 DPC Forum: Preservation of e-Learning Materials and Cost Models for Digital Preservation, London, 15 October 2002

4. Other recent publications

An article describing the phased approach taken by the University of North Texas (UNT) Libraries to ensure long-term access to its digital resources. It argues that formulating preservation policies is important, as is the creation and maintenance of preservation metadata for objects and collections.

A paper on the NCSTRL (Networked Computer Science Technical Reference Library), a US federation of digital libraries providing resources on computer science materials. The paper described how the project team approached the migration of the NCSTRL collection to OAI (Open Archives Initiative) principles.

This is a report of the US General Accounting Office (GAO-02-586) focusing on the activities of the National Archives and Records Administration (NARA) with regard to the management and preservation of electronic records. This includes an evaluation of NARA's Electronic Records Archive (ERA) project. The appendices include a useful introduction to digital preservation strategies.

A short article describing the problems of understanding software code with little or no documentation. It ends with some suggestions as to how current developers could make code easier to work with in the future.

This is ICPSR's guide to the task of preparing data for deposit in a public archive. It is mostly focused on quantitative data generated by statistical software like SPSS or SAS.

This short document explains how collaboration between agencies, the National Library of Australia, the National Archives of Australia and the National Office for the Information Economy will help to ensure that online Commonwealth Government publications remain accessible. It provides advice on the steps that Commonwealth agencies are required or recommended to take, provides definitions of 'publications' and 'records,' and elaborates in more detail the roles of the three national organisations that seek to work with agencies.

This article describes the development of an e-print archive at the University of Glasgow, including the aims of the archive, decisions about implementation, metadata and presentation. The article also notes the role of the library in encouraging submissions, providing advice on copyright, and arranging deposit, conversion of formats and reviews of metadata quality.

This is a study of how American perceptions of the Internet were changed by the terrorist attacks on the United States on 11 September 2001. The report has two main strands. Firstly, it contains the results of a survey on how the attacks had affected opinions on the public availablity of sensitive online information, i.e. that which could be of potential use to terrorists. Secondly, the report uses a preliminary analysis of archived Web resources (those that make up the "September 11 Web Archive" ( , collected in association with the Internet Archive, the Library of Congress, et al.) to explain how use of the World Wide Web had evolved in response to the attacks.

This report is an e-commerce impact assessment of the UK publishing media commissioned by the UK Department of Trade and Industry (DTI) and co-managed by the DTI and UK Publishing Media (UKPM). Chapter 17 concerns digital media and recommends that the archiving of digital research should be organised at national level by the government.

This is the final report from a project initiated by the Special Collections Department of Edinburgh University Library. The project aimed to act as a focus for raising digital preservation issues across the whole university.

A progress report on Yale University Library and Elsevier Science's joint Digital Preservation Planning Project on the preservation of electronic journals, funded by the Andrew W. Mellon Foundation. The project sought to "understand better the scope and scale of digital journal preservation" and to "identify practical next steps" for a large collection of commercially published journals. The report recognises the need for co-operative efforts and standards and relationships between publishers and repositories, and discusses the building of a prototype archive.

This news article discusses the creation of online 'institutional repositories' to preserve and provide access to the intellectual output of institutions like universities. These outputs include published and unpublished research papers, data sets, field notes, images, etc. Models that are very briefly discussed in the article include MIT's DSpace project, the California Institute of Technology's digital repository, the University of California's Scholarly Repository, Ohio State University's Knowledge Bank, and the University of Southampton's initiative.

