OCLC Research will harvest DSpace metadata

 
OCLC Research will periodically harvest OAI-compliant metadata from the institutional repositories of interested DSpace users. OCLC Research will convert the harvested metadata into a format suitable for re-harvesting by non-OAI services.

Much of the scholarly material on the Web is missed by harvesters. This includes metadata in OAI-PMH repositories, which DSpace uses. Google has several problems harvesting OAI repositories, which are different from standard Web pages.

The standard DSpace uses the Handle system (www.handle.net) for identifying items, which (purposely) mask the identity of the host, making harvesting difficult to schedule. The OAI protocol uses possibly non-persistent URLs to link pages of metadata. This also interferes with standard methods of harvesting.

OCLC Research is working with Google and MIT to periodically harvest interested DSpace users' metadata and transform it into a harvest-friendly format, resolve the handles so that institutions can be identified, and make the resulting URLs harvestable by search services such as Google.

For more information:

Bob Bolander
Communications & Programs Manager
OCLC Research
bolander@oclc.org
+1-614-761-5207

We are a worldwide library cooperative, owned, governed and sustained by members since 1967. Our public purpose is a statement of commitment to each other—that we will work together to improve access to the information held in libraries around the globe, and find ways to reduce costs for libraries through collaboration.