Data Science

The internet is the native environment of information seekers. OCLC Research recognizes that to be integrated into the internet, traditional library data must be transformed in various ways. We are analyzing the data in WorldCat and other sources to derive new meaning, insights, and services for use by libraries and others on the internet. Our work includes:

Applications

Please note: these are demonstrations. If something is not working, let us know and please come back and try again later.

ArchiveGrid

ArchiveGrid is a growing collection of over two million archival material descriptions that provides a foundation for OCLC Research collaboration and interactions with the archival community, and also serves as the basis for our experimentation and testing in text mining, data analysis, and discovery system applications and interfaces.
Learn more »

 


assignFAST

A Web interface for FAST Subject selection, assignFAST explores automating the manual selection of the Authorized and Use For headings based on autosuggest technology.
Learn more »

 


Classify

Classify is a FRBR-based prototype designed to support the assignment of classification numbers and subject headings for books, DVDs, CDs, and other types of materials.
Learn more »

 


Cookbook Finder

Cookbook Finder is a works-based application that provides access to thousands of cookbooks and other works about food and nutrition described in library records. You can search by person, place, topic (e.g., course, ingredient, method, and more) and browse related works by author and topic (supplied by the Kindred Works/Recommender API). Results include links to full-text when available from HathiTrust and Project Gutenberg.
Learn more »

 


FAST Converter

The FAST Converter is a Web interface for the conversion of LCSH headings to FAST headings. Either single headings or small sets of bibliographic records can be converted. The intent of this Web site is to provide a learning tool to help familiarize users with FAST and the differences between FAST and LCSH.
Learn more »

 


FictionFinder: A FRBR-based Prototype for Fiction in WorldCat

FictionFinder is a FRBR-based prototype that provides access to over 2.9 million bibliographic records for fiction books, eBooks, and audio materials described in OCLC WorldCat.
Learn more »

 


info URI Registry

The "info" URI Registry was set up on behalf of NISO to identify and describe registered "info" URIs.
Learn more »

 


Kindred Works

Kindred Works is a demonstration interface built upon an experimental content-based recommender service. Various characteristics associated with a sample resource, such as classification numbers, subject headings, and genre terms, are matched to WorldCat to provide a list of recommendations.
Learn more »

 


The NDLTD Union Catalog

The NDLTD Union Catalog project focused on thesis metadata via the Open Archives Initiative's Protocol for Metadata Harvesting (OAI-PMH). This was a lightweight protocol for moving or sharing metadata that allowed synchronization of loosely coupled databases and mandates XML Dublin Core as the default metadata format.
Learn more »

 


Genres

Genre profiles allow users to browse genre terms for hundreds of titles, authors, subjects, characters, places, and more, ranked by popularity in WorldCat.
Learn more »

 


WorldCat Identities

WorldCat Identities has a summary page for every name in WorldCat.
Learn more »