Data Science

The internet is the native environment of information seekers. OCLC Research recognizes that to be integrated into the internet, traditional library data must be transformed in various ways. We are analyzing the data in WorldCat and other sources to derive new meaning, insights, and services for use by libraries and others on the internet. Our work includes:

Projects

Ariadne's Thread: Interactive Context Explorer for Bibliographic Data

Ariadne's Thread is designed to visualize the networks of entities associated with bibliographic records and allows users to interactively explore the local context of the interested entities.
Learn more »


CatVis: Visual Analytics for the World's Library Data

The CatVis project addresses the following questions: How can librarians use data visualizations to manage, analyze, and present library collections? How can visualizations of large bibliographic datasets and other complex data help researchers in the e-Humanities to ask and answer new research questions?
Learn more »


Classify

Classify is a FRBR-based prototype designed to support the assignment of classification numbers and subject headings for books, DVDs, CDs, and other types of materials.
Learn more »


Cookbook Finder

Cookbook Finder is a works-based application that provides access to thousands of cookbooks and other works about food and nutrition described in library records. You can search by person, place, topic (e.g., course, ingredient, method, and more) and browse related works by author and topic (supplied by the Kindred Works/Recommender API). Results include links to full-text when available from HathiTrust and Project Gutenberg.
Learn more »


FAST (Faceted Application of Subject Terminology)

FAST is an enumerative faceted subject heading schema derived from the Library of Congress Subject Headings (LCSH). FAST is easier to apply and can be successfully used by non-professionals.
Learn more »


IIIF: Improving the Interoperability of Digital Materials

IIIF (the International Image Interoperability Framework) is an emerging set of standards for sharing structural metadata about digital materials. Learn more about IIIF and OCLC Research's work with this framework.


Kindred Works

Kindred Works is a demonstration interface built upon an experimental content-based recommender service. Various characteristics associated with a sample resource, such as classification numbers, subject headings, and genre terms, are matched to WorldCat to provide a list of recommendations.
Learn more »


OCLC Linked Data Research

OCLC production units and OCLC Research are supporting the collaborative and the larger community with Linked Data-related research and standards activities, and are exploring Linked Data activities and applications.
Learn more »


Multilingual Bibliographic Structure

This activity is designed to leverage the multilingual content of WorldCat® so that bibliographic information can be presented in the preferred language and script of the user.
Learn more »


What in the WorldCat

At OCLC Research, we're exploring records and mining data from WorldCat, the world's largest library catalog, to highlight interesting and different views of the world's library collections.
Learn more »


WorldCat Identities

WorldCat Identities has a summary page for every name in WorldCat.
Learn more »


WorldCat Identities Network

The WorldCat Identities Network gives users the opportunity to visually explore the interconnectivity and relationships between WorldCat Identities.
Learn more »


Completed Projects

OCLC Research has many recently completed projects you can explore and use in your research.

View Completed Projects >

OCLC Research Archive

OCLC Research continually evolves what we investigate, research, and report on as the field's needs change. For historical project information, explore the OCLC Research Archive, which holds a wealth of information about the work of OCLC produced over the decades..

Access the OCLC Research Archive >