Preservation Health Check
#phcpilot
Impact
The desired impact is to help libraries and archives in materializing their digital preservation duties and responsibilities. This pilot will:
- demonstrate the value of producing and maintaining preservation metadata,
- help institutions to gain better insight in the need for preservation metadata,
- help institutions to make informed choices about which metadata are useful to produce and which are less useful,
- demonstrate the use of tools that can identify and validate file formats and extract technical metadata, and
- testify to the importance of using the latest versions of such tools and of keeping registries up to date, as sources of information about file formats.
These outcomes will provide useful feed-back to the further development of standards, best practices and tools and repository certification efforts.
The pilot will deliver individual private Preservation Health Check reports to the pilot sites and an overall public report with recommendations on:
- the value and usefulness of preservation metadata;
- the necessary measures and steps to improve the quality of preservation metadata;
- the necessary measures to improve the quality of tools in use;
- the need for regular health checks, based on preservation metadata; and
- the basic parameters/pre-conditions for setting-up a preservation health check service.
Interim results will be published on the OCLC Research and OPF web pages.
Progress and Outputs
2014
- Report: Preservation Health Check: Monitoring Threats to Digital Repository Content, by Wouter Kool, Brian Lavoie and Titia van der Werf #phcpilot
- 8.5x11" format (.pdf: 307K/20pp.)
- A4 format (.pdf: 302K/20pp.)
2013
- OCLC Research gave an update of the Preservation Health Check Pilot at the PREMIS Implementation Fair (iPres2013) on 5 September 2013 at the Lisbon Technical University.
- See the Preservation Health Check: Work in Progress presentation (.pdf: 1.5MB/24 pp.) [embedded above on this page] given by Titia van der Werf at that event.
- Also see the agenda and links to the other slides that were presented at the Fair.
- The Bibliothèque nationale de France (BnF), a charter member of OPF, agreed to become a pilot site for the Preservation Health Check Pilot. In June 2013 the BnF shipped a copy of its preservation metadata from the BnF SPAR system to OCLC Research in Leiden.
2012
- OCLC Research and OPF held a Preservation Health Check Workshop at the PREMIS Implementation Fair (iPres2012) on 2 October 2012 at the University of Toronto.
- See the Preservation Health Check iPRES Workshop report (.pdf: 415K/1 pp.) for an overview of this event.
- Also see the following slides that were presented at the event:
- Introduction to the Pilot slides by Titia van der Werf (.pdf: 281K/16 pp.)
- Standards, Risk Analysis, Common Sense and Evidence slides by Bram van der Werf (.pdf: 496K/10 pp.)
- BnF and OCLC/OPF Pilot: Why BnF is Interested slides by Sébastien Peyrard (.pdf: 843K/10 pp.)
- The SPOT Model for Risk Assessment slides by Priscilla Caplan (.pdf: 191K/9 pp.)
- Preservation Metadata as an Evidence Base for Risk Assessment slides by Brian Lavoie (.pdf: 670K/15 pp.)
- Preservation Health Check Plenary Discussion slides by Titia van der Werf (.pdf: 195K/3 pp.)
- Introduction to the Pilot slides by Titia van der Werf (.pdf: 281K/16 pp.)
OCLC Research and the Open Planets Foundation (OPF) are conducting a Preservation Health Check pilot to analyze the quality of preservation metadata created and in use by operational repository and deposit systems and evaluate the potential of such metadata for assessing digital preservation risks.
OPF, representing the digital preservation community needs, will contribute pilot sites and datasets, provide feed-back to interim-research findings and organize workshops/hackathons to disseminate and advance the take-up of research findings.
OCLC Research, with expertise in preservation metadata and skills in risk assessment, will design the research methodology, carry out the research activities, provide the technical infrastructure, develop data analysis tools and risk assessment methods and contribute to the dissemination of results.
Other parties will be involved during the pilot, in particular maintainers of preservation metadata schemas (LoC), format registration tools (UK National Archive) and risk assessment tools (DCC, NARA, NESTOR).
Background
An important function of preservation metadata is to understand what exactly is in the repository and to provide information that enables periodic check-ups and screenings for risks to long-term access. Several schemas (PREMIS, MIXED, etc.), best practices, tools and registries (PRONOM, JHOVE, DROID, UDFR, etc.) for preservation metadata have been developed in the past 15 years and are in use by most repositories. Preservation risk assessment toolkits and checklists (DRAMBORA, TRAC, etc.) and standards (Metrics for digital repository audit and certification, CCSDS 2009, etc.) have been devised to help repositories assess the preservation risks they run. However, there is little evidence that such risk assessment has become a part of the preservation management process of repositories, nor is there evidence that assessment results are fed back into the development of standards, best practices and tools.
OPF, which represents major libraries and archives with a long-term access mandate, has identified a shared need for supporting repositories in carrying out their preservation management tasks. One of these tasks is to perform regular preservation risk assessment (health check), a task that could be offered as a service, independently from specific repository systems in use.
Team Members
- Brian Lavoie, OCLC Research Scientist
- Titia van der Werf, OCLC Research Senior Program Officer
- Wooter Kool, OCLC EMEA Metadata Specialist
Most recent updates: Page content: 2013-10-11