Data Science & Metadata Research
To be discoverable by today’s online users, traditional library data must be transformed. OCLC Research analyzes bibliographic data to derive new meaning, insights, and services for use by library and information seekers. This work includes special projects, data science research, engagement with metadata communities, publications and presentations, and the creation of illustrative experimental applications.
Applications
Please note: these are demonstrations. If something is not working, let us know and please come back and try again later.
ArchiveGrid
ArchiveGrid is a growing collection of over two million archival material descriptions that provides a foundation for OCLC Research collaboration and interactions with the archival community, and also serves as the basis for our experimentation and testing in text mining, data analysis, and discovery system applications and interfaces.
Learn more »
assignFAST
A Web interface for FAST Subject selection, assignFAST explores automating the manual selection of the Authorized and Use For headings based on autosuggest technology.
Learn more »
Classify
Classify is a FRBR-based prototype designed to support the assignment of classification numbers and subject headings for books, DVDs, CDs, and other types of materials.
Learn more »
Cookbook Finder
Cookbook Finder is a works-based application that provides access to thousands of cookbooks and other works about food and nutrition described in library records. You can search by person, place, topic (e.g., course, ingredient, method, and more) and browse related works by author and topic (supplied by the Kindred Works/Recommender API). Results include links to full-text when available from HathiTrust and Project Gutenberg.
Learn more »
FAST Converter
The FAST Converter is a Web interface for the conversion of LCSH headings to FAST headings. Either single headings or small sets of bibliographic records can be converted. The intent of this Web site is to provide a learning tool to help familiarize users with FAST and the differences between FAST and LCSH.
Learn more »
FictionFinder: A FRBR-based Prototype for Fiction in WorldCat
FictionFinder is a FRBR-based prototype that provides access to over 2.9 million bibliographic records for fiction books, eBooks, and audio materials described in OCLC WorldCat.
Learn more »
IIIF Explorer
The IIIF Explorer is an OCLC ResearchWorks prototype that uses data from the IIIF Image and Presentation Manifest APIs to provide a single index of IIIF API-accessible images from CONTENTdm hosted collections, with a user interface for searching and an embedded Project Mirador viewer for examining images in great detail. There are currently over 11 million images accessible through the prototype.
Learn more »
Info URI
The "info" URI Registry was set up on behalf of NISO to identify and describe registered "info" URIs.
Learn more »
Kindred Works
Kindred Works is a demonstration interface built upon an experimental content-based recommender service. Various characteristics associated with a sample resource, such as classification numbers, subject headings, and genre terms, are matched to WorldCat to provide a list of recommendations.
Learn more »
The NDLTD Union Catalog
The NDLTD Union Catalog project focused on thesis metadata via the Open Archives Initiative's Protocol for Metadata Harvesting (OAI-PMH). This was a lightweight protocol for moving or sharing metadata that allowed synchronization of loosely coupled databases and mandates XML Dublin Core as the default metadata format.
Learn more »
Genres
Genre profiles allow users to browse genre terms for hundreds of titles, authors, subjects, characters, places, and more, ranked by popularity in WorldCat.
Learn more »
WorldCat Identities
WorldCat Identities has a summary page for every name in WorldCat.
Learn more »