Menu
Search

Data Science & Metadata Research

To be discoverable by today’s online users, traditional library data must be transformed. OCLC Research analyzes bibliographic data to derive new meaning, insights, and services for use by library and information seekers. This work includes special projects in metadata enrichment, authorities & identities, linked data, subjects & classification, and data analysis.

Presentations

An Innovative Approach to Scalable Semantic Embedding

An Innovative Approach to Scalable Semantic Embedding

By Shenghui Wang, Rob Koopman

AIDR 2019: Artificial Intelligence for Data Discovery and Reuse
Pittsburgh, Pennsylvania, USA

Semantic search, in addition to keyword based search, is a desirable feature for many digital library systems. Even in the largely structured library data world, there is still a lot of tacit information locked in the free-text fields. Embedding words and texts in compact, semantically meaningful vector spaces allows for computable semantic similarity/relatedness which would make search more intelligent.

Topics: Semantic Embedding

Ideation to Prototype: Turning new ideas into useful services

Ideation to Prototype: Turning new ideas into useful services

By Andrew Pace

LD4 Conference on Linked Data in Libraries
Boston, Massachusetts, USA

Using the Wikibase Linked Data Prototype as an example, Pace will outline 5 simple steps for managing a complex project that will improve your chances for getting from an experiment to a production service.

Topics: Linked Data

Taking Advantage of Multilingualism Support in Wikidata

Taking Advantage of Multilingualism Support in Wikidata

By Karen Smith-Yoshimura and Xiaioli Li

LD4 Conference on Linked Data in Libraries
Boston, MA (USA)

View highlights of some key lessons from the OCLC Research Linked Data Wikibase Prototype (“Project Passage”) regarding Wikidata’s multilingualism support.

Topics: Wikimedia, Linked Data

Digging into the Research: An Overview of Models and Networks

Adoption and Use of IIIF for Digital Resource Sharing in CONTENTdm

By Shane Huddleston, Jeff Mixter

Best Practices Exchange 2019 Conference
Columbus, OH (USA)

Huddleston and Mixter provide an overview of IIIF Application Programming Interfaces (APIs), and how OCLC is using them across services, as well as our work in supporting standards with other organizations.

Topics: IIIF

Amplifying Metadata as Entities to Support Multilingualism

Amplifying Metadata as Entities to Support Multilingualism

By Karen Smith-Yoshimura

OCLC EMEA Regional Council Meeting
Marseille (France)

Karen Smith-Yoshimura provides an overview of the "Project Passage" linked data Wikibase prototype and a dive into use cases using the prototype for multilingualism.  

Topics: Linked Data, Metadata

Wikidata Lessons Learned

Wikidata Lessons Learned

By Jeff Mixter

code4lib
San Jose, CA (USA)

Jeff Mixter shares lessons learned during the Linked Data Wikibase Prototype pilot project, including features of MediaWiki and Wikibase and a variety of use cases.

Topics: Linked Data

OCLC Research Update: Emerging Trends

OCLC Research Update: Emerging Trends

By Lynn Silipigni-Connaway, Betha Gutsche, and Karen Smith-Yoshimura

ALA Midwinter
Seattle, WA (USA)

Lynn Silipigni Connaway provides overviews of several active projects, Karen Smith-Yoshimura presents emerging trends in linked data that were revealed in a recent survey, and Betha Gutsche shares the inspirational transformations that small public libraries made to their libraries as part of an IMLS grant-funded project. Watch a recording of the update on YouTube.

Topics: Linked Data, WebJunction

From Prototype to Production: Turning Good Ideas into Useful Library Services

From Prototype to Production: Turning Good Ideas into Useful Library Services

By Andrew K. Pace and Holly Tomren

CNI Fall 2018 Membership Meeting
Washington, DC (USA)

This session explores two projects at different points of the prototype-to-production workflow: IIIF (integration of the International Image Interoperability Framework) into a digital discovery environment; and a Linked Data Wikibase prototype (reconciliation tool and editor built to match library metadata workflows) transitioning to production. Temple University presents from the experimenter and practitioner point of view.

Topics: IIIF, Linked Data

CONTENTdm IIIF Discovery API

CONTENTdm IIIF Discovery API

By Jeff Mixter

IIIF Technical Meeting
Edinburgh, Scotland (UK)

Jeff Mixter details how OCLC Research built a Change Discovery API for all 14.4 million CONTENTdm items using the current IIIF Change Discovery API v0.2 spec, as well as current and future plans for the service.

Topics: IIIF