English

DevConnect Online 2020
MAY 20

Building an OpenRefine Reconciliation Endpoint for a Wikibase project: Lessons Learned

This webinar will review the system architecture for the CONTENTdm Wikibase project, describe the Wikidata OpenRefine endpoint, update progress from the W3C community group, and explore the OCLC alternative OpenRefine endpoint in more detail, with a discussion of lessons learned.

This event has passed.

OCLC is carrying out a pilot project to evaluate Wikibase as a metadata management and discovery environment for data from CONTENTdm digital content management systems.

An important part of this pilot is “reconciliation” – matching the text strings for people, organizations, concepts, places, and events to their corresponding entities in the Wikibase. Recognizing that OpenRefine is a widely used tool for reconciling metadata, the Wikidata community has developed an OpenRefine-compatible API endpoint which can be modified to work with any Wikibase.

After adapting that application for our project, we looked for an alternative that had fewer dependencies on the Wikibase system infrastructure, that could increase the speed with which matching is done, and could give us more control over ranking results.

This investigation is also in support of the W3C Entity Reconciliation Community Group which is developing an entity reconciliation standard.

This webinar will review the system architecture for the CONTENTdm Wikibase project, describe the Wikidata OpenRefine endpoint, update progress from the W3C community group, and explore the OCLC alternative OpenRefine endpoint in more detail, with a discussion of lessons learned.

View Recording

View Slides

Date

20 May 2020

Time

11:00 AM – 12:00 PM
Eastern Daylight Time, North America [UTC -4]