Harvesting metadata via the Web

This activity is now closed. The information on this page is provided for historical purposes only.

Goal

The goal of the Metadata Switch Harvesting project is to harvest over the Web, metadata that complies with the Open Archives Initiative's Protocol for Metadata Harvesting (OAI-PMH) and produce an integrated catalog from the harvested metadata. This project is one component of the OCLC Research Metadata Switch project.

Overview

We will investigate and attempt to solve the problems inherent in integrating a diverse set of metadata resources. Our goal is produce the most useful aggregate of harvested academic metadata available.

Research methodology

Our work building a union catalog of theses, part of our Networked Digital Library of Theses and Dissertations (NDLTD) activities, will be expanded to cover not just theses, but other harvestable academic material. Where possible we will be encouraging the formation of communities that will be active participants in the creation of metadata for harvesting.

We will employ Web services developed for other Metadata Switch activities to create authoritative forms wherever possible, experiment with the embedding of services within records (e.g. links to a resolver service) and make the records both searchable and harvestable by others. Data collected by harvesting will be used to develop and extend these services.

Duration

The NDLTD work is already underway. We expect to have a significantly more diverse collection available by summer 2003. The total duration of the project is expected to span 24 months.

Relationship to other projects

OCLC

Other

Research team

We are a worldwide library cooperative, owned, governed and sustained by members since 1967. Our public purpose is a statement of commitment to each other—that we will work together to improve access to the information held in libraries around the globe, and find ways to reduce costs for libraries through collaboration.