An analysis of crosswalks from research data schemas to Schema.org

Mingfang Wu, Stephen M. Richard, Chantelle Verhey, Leyla Jael Castro, Baptiste Cecconi, Nick Juty

Research output: Contribution to journalArticlepeer-review

Abstract

The increased number of data repositories has greatly increased the availability of open data. To enable broad discovery and access to research dataset, some data repositories have begun leveraging data discovery services from commercial search engines by embedding structured metadata markup in dataset web landing pages using vocabularies from Schema.org and extensions. This paper aims to examine metadata interoperability for supporting global data discovery. Specifically, the paper reports a survey on which metadata schema has been adopted by participating data repositories, and presents an analysis of crosswalks from fourteen research data schemas to Schema.org. The analysis indicates most descriptive metadata are interoperable among the schemas, the most inconsistent mapping is the rights metadata, and a large gap exists in the structural metadata and controlled vocabularies to specify various property values. The analysis and collated crosswalks can serve as a reference for data repositories when they develop crosswalks from their own schemasto Schema.org, and provide the research data community a benchmark of structured metadata implementation.
Original languageEnglish
Article number186
Pages (from-to)1-21
Number of pages21
JournalData Intelligence
Early online date7 Oct 2022
DOIs
Publication statusPublished - 7 Oct 2022

Keywords

  • mapping
  • schemas
  • metadata

Fingerprint

Dive into the research topics of 'An analysis of crosswalks from research data schemas to Schema.org'. Together they form a unique fingerprint.

Cite this