eLife (Mar 2020)

Wikidata as a knowledge graph for the life sciences

  • Andra Waagmeester,
  • Gregory Stupp,
  • Sebastian Burgstaller-Muehlbacher,
  • Benjamin M Good,
  • Malachi Griffith,
  • Obi L Griffith,
  • Kristina Hanspers,
  • Henning Hermjakob,
  • Toby S Hudson,
  • Kevin Hybiske,
  • Sarah M Keating,
  • Magnus Manske,
  • Michael Mayers,
  • Daniel Mietchen,
  • Elvira Mitraka,
  • Alexander R Pico,
  • Timothy Putman,
  • Anders Riutta,
  • Nuria Queralt-Rosinach,
  • Lynn M Schriml,
  • Thomas Shafee,
  • Denise Slenter,
  • Ralf Stephan,
  • Katherine Thornton,
  • Ginger Tsueng,
  • Roger Tu,
  • Sabah Ul-Hasan,
  • Egon Willighagen,
  • Chunlei Wu,
  • Andrew I Su

DOI
https://doi.org/10.7554/eLife.52614
Journal volume & issue
Vol. 9

Abstract

Read online

Wikidata is a community-maintained knowledge base that has been assembled from repositories in the fields of genomics, proteomics, genetic variants, pathways, chemical compounds, and diseases, and that adheres to the FAIR principles of findability, accessibility, interoperability and reusability. Here we describe the breadth and depth of the biomedical knowledge contained within Wikidata, and discuss the open-source tools we have built to add information to Wikidata and to synchronize it with source databases. We also demonstrate several use cases for Wikidata, including the crowdsourced curation of biomedical ontologies, phenotype-based diagnosis of disease, and drug repurposing.

Keywords