Linked Data

Linked Data is about communities agreeing on the meaning of their data and sharing it in a massively networked information space. This vision is taking shape in many sectors, including e-commerce, medicine, scientific research, and government services. OCLC Research is a leader in driving this transformation in the library community.

Linked Data Research

The goal of the OCLC's research program is to advance the state of the art in modeling, storage, and delivery of linked data in the service of the library community's use cases: creating cleaner, more normalized, more machine-understandable data that works harder to connect library patrons to the resources they seek. Outcomes of our work include publications, presentations, webinars, datasets, and software demos that address researchers, engineers, and subject-matter experts engaged in the same work. The list on this page highlights key outcomes not discussed in Library Linked Data in the Cloud.

Explore all our work, or jump to a specific area below:

Experiments with Wikibase
International Image Interoperability Format (IIIF)
From Headings to Identities
Creating Linked Data for Use
Transition from MARC and Other Legacy Standards
Ontologies and Models

The International Image Interoperability Framework

IIIF (the International Image Interoperability Framework) is an emerging set of standards for sharing structural metadata about digital materials. Learn more about IIIF and OCLC Research's work with this framework on the IIIF: Improving the Interoperability of Digital Materials page.

How IIIF standards improve search and discovery for Cultural Heritage collections
By Jeff Mixter
15 October 2019

DLF Forum
Tampa, Florida, USA

IIIF is an emerging standard for sharing digital structural metadata. OCLC is an active member of the IIIF community and has been working to integrate the standard in is services/products. This talk discusses the experimental IIIF work being done by OCLC Research to help test evolving IIIF standards and help integrate them into production services.

Managing Entities in Wikibase

Lessons from Representing Library Metadata in OCLC Research’s Linked Data Wikibase Prototype (video)
By Karen Smith-Yoshimura
27 November 2019

Semantic Web in Libraries (SWIB) 2019
Hamburg, Germany

This presentation highlights key lessons from OCLC Research’s Linked Data Wikibase Prototype (“Project Passage”), a 10-month pilot done in 2018 in collaboration with metadata specialists in 16 US libraries.

Additional Materials:
PowerPoint Slides (11MB)

What are the entities that matter, and how much should we say about them?
By Jean Godby
13 November 2019

NISO Webinar: Implementing Library Linked Data
Virtual

This presentation discusses the work of catalogers who participated in OCLC's Project Passage in 2018. It develops the theme of identification of "the entities that matter" and concludes with a brief update on OCLC's post-Passage activities involving resource description in Wikibase.

Introducing the CONTENTdm Linked Data Pilot Project
By Jeff Mixter, Bruce Washburn
7 August 2019

CONTENTdm User Group Meeting
Indianapolis, IN, USA

The CONTENTdm Linked Data pilot explores how to convert CONTENTdm data into linked data, how to curate the data in the Wikibase infrastructure, and how to use the data to improve end-user experiences in CONTENTdm. This presentation covers the background research that led to the development of the pilot, the plans for the 3 phases of the pilot, and some early feedback from one of the pilot participants.

From Headings to Identities

From Authorities to Identifiers—Bridging the Silos
By Karen Smith-Yoshimura
3 December 2019

OCLC Research Mini-Symposium
Barcelona, Spain

In this keynote presentation, Karen Smith-Yoshimura explains how library practices are shifting from authority control to using identifiers and how identifiers can disambiguate and control names more expeditiously and make library data more web friendly. She notes the emergence of identity hubs to address identity management across domains, aggregating names from different types of resources.

File: pptx, 12MB Topics: Linked Data

OCLC Research Authorities Identifiers Annotated Bibliography Flyer

A reverse chronological annotated bibliography that provided an overview of OCLC Research’s work on authorities and identifiers.

National Strategy for Shareable Local Name Authorities National Forum: White Paper
White paper | March 2018
By Michele Casalini, Chew Chiat Naun, Chad Cluff, Michelle Durocher, Steven Folsom, Paul Frank, Janifer Gatenby, Jean Godby, Jason Kovari, Nancy Lorimer, Clifford Lynch, Peter Murray, Jeremy Myntti, Anna Neatrour, Cory Nimer, Suzanne Pilsk, Daniel Pitti, Isabel Quintana, Jing Wang, and Simeon Warner
The white paper identifies the need to increase the quality, quantity, and efficiency of workflows that produce disambiguated names of people, organizations, and other important entities associated with library resources. In the process, the practice of library authority control will be modernized to focus on the management of identities instead of authorized name strings. The result is more congruent with principles of linked data. Follow-up work is being conducted in the PCC Task Group on Identity Management in NACO (.pdf). Jean Godby and Diane Vizine-Goetz are participating.
An outcome of the IMLS grant Shareable Local Authorities awarded to Cornell University; Chew Chiat Naun and Jason Kovari, principal investigators.

Creating Linked Data for Use

Linking Your Data
Presentation (.pdf) | 2016
By Jeff Mixter
Mixter describes an editing environment that was prototyped with CONTENTdm users, which allows metadata subject-matter experts to improve collections of Dublin Core descriptions by normalizing or remediating text strings and associating them with URIs. Results are available either as enriched Dublin Core records, or as linked data expressed in Schema.org.
Presented at the Canadian Linked Data Summit, 24-25 October 2016, Montréal (Canada).

Looking Inside the Library Knowledge Vault
Works in Progress Webinar | 12 August 2015
By Bruce Washburn and Jeff Mixter
The webinar evaluates Google research as the basis for an experimental Library Knowledge Vault, a resource containing clean, authoritative data organized as entities and relationships and expressed as linked data. This resource is exposed through the EntityJS prototype, which supports a rich user experience for browsing and knowledge discovery. Experts can use EntityJS as an editing interface, thus enriching the Vault by filling in gaps and correcting errors.
Presented by OCLC Research.

Screenshot: Exploring ways to use linked data

"Exploring Ways to Use Linked Data" from the webinar Looking Inside the Library Knowledge Vault by OCLC Research, CC BY 4.0

The Transition from MARC and Other Legacy Standards

From Records to Things: Managing the Transition from Legacy Library Metadata to Linked Data
Article | 2017
By Jean Godby and Karen Smith-Yoshimura
Using translated works as an example, the authors show how MARC can be converted to linked data more easily if catalogers use MARC’s machine-understandable features to encode descriptions of important entities and relationships. To enact the changes, the Program for Cooperative Cataloging (PCC) can be enlisted to promote the recommendations of individuals to best practices that can be implemented on a large scale. This article is a follow-up to the November 2015 webinar How You Can Make the Transition from MARC to Linked Data Easier, which was a springboard for Jean’s participation in the PCC Task Group for URIs in MARC.
Published in the Bulletin of the Association for Information Science and Technology. 43 (2): 18-23.

"Making MARC 'Linked Data Ready'" from the webinar How You can make the Transition
from MARC to Linked Data Easier by OCLC Research, CC BY 4.0

Challenges of Mapping Digital Collections Metadata to Schema. org: Working with CONTENTdm
Conference Paper | 2016
By Patricia Lampron, Jeff Mixter and Myung-Ja K Han
This paper describes the challenges of mapping unique digital collections metadata to Schema.org and identifies lessons learned from linked data experiments on CONTENTdm metadata at the University of Illinois at Champaign-Urbana Library and OCLC.
Presentation at the Metadata and Semantics Research: 10th International Conference, MTSR 2016, Göttingen, Germany, November 22-25, 2016. Proceedings edited by Emmanouel Garoufallou, Imma Subirats Coll, Armando Stellato and Jane Greenberg,181-186. Cham, Switzerland.

Using a Common Model: Mapping VRA Core 4.0 Into an RDF Ontology
Article | 2014
By Jeff Mixter
Mixter describes a procedure for creating an RDF ontology for VRA Core, a vocabulary maintained by the Visual Resources Association and used for describing works of visual culture and associated images. The ontology is expressed in Schema.org, with extensions from several commonly used RDF vocabularies. This work is a foundation for Jeff’s continued involvement in the VRA community.
Published in the Journal of Library Metadata, Volume 14, 2014, Issue 1.

Ontologies and Models

BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
Presentation (.pptx) | 25 June 2017
By Jean Godby and Diane Vizine-Goetz
This presentation describes the alignment of OCLC's and LCs "work" clusters algorithmically derived from MARC bibliographic records and is a companion to OCLC's Work on Works (.ppt), which gives a brief overview of work descriptions in VIAF. Both presentations contain material that is described in the white paper (pdf) being prepared by the PCC SCS/LDAC Task Group on the Work Entity.
Presented at the BIBFRAME Update Forum at ALA Annual Conference, Chicago, IL (USA).

"OCLC's models of 'Work'" from BIBFRAME and OCLC Works by OCLC Research, CC BY 4.0

A Division of Labor: The Role of Schema.org in a Semantic Model of Library Resources
Book chapter (.pdf) | 2016
By Carol Jean Godby
Books, sound recordings, and other objects in a library collection can be described in ordinary language. But librarians also need a specialist’s language that defines concepts involved in their curation. Since current library linked data models are less focused on curation than the ordinary user’s experience of library resources, Schema.org (the language of webmasters) easily matches the expressiveness of ontologies designed for librarianship. This relationship is expected to change as curation models mature.
Published as part of Linked Data for Cultural Heritage, an ALCTS Monograph, edited by Ed Jones and Michele Seikel. Chicago, IL: ALA Editions, an imprint of the American Library Association.

Common Ground: Exploring Compatibilities Between the Linked Data Models of the Library of Congress and OCLC
OCLC Research Report (.pdf) | 2015
By Carol Jean Godby and Ray Denenberg
A joint statement with the Library of Congress discusses the high-level alignment between BIBFRAME and OCLC’s model of creative works based on Schema.org.
Published by Library of Congress and OCLC Research.

Describing Theses and Dissertations Using Schema.org
Conference paper (.pdf) | 2014
By Jeff Mixter, Patrick OBrien, and Kenning Arlitsch
The authors describe an extension vocabulary for expressing a model of theses and dissertations, using Schema.org as a foundation. This work is a deliverable from the IMLS grant Getting Found, awarded to Kenning Arlitsch, Dean of the Library, Montana State University.
Published in DCMI'14 Proceedings of the 2014 International Conference on Dublin Core and Metadata Applications, Austin, Texas, October 8-11, 2017.

Linked Data Blog Posts

Explore OCLC Research's Linked Data ideas, thoughts, data, and projects on our blog Hanging Together.

Read Linked Data Posts

OCLC linked data strategy

Expand influence and reach with linked data and OCLC

Learn more

Linked Data

Linked Data Research

The International Image Interoperability Framework

Managing Entities in Wikibase

From Headings to Identities

Creating Linked Data for Use

The Transition from MARC and Other Legacy Standards

Ontologies and Models

Linked Data Blog Posts

OCLC linked data strategy

Related OCLC Research Projects

Follow OCLC Research

OCLC Research

Related sites

Subscribe to our blog