The FAIR data point populator: collaborative FAIRification and population of FAIR data points

Daphne Wijnbergen; Rajaram Kaliyaperumal; Kees Burger; Luiz Olavo Bonino da Silva Santos; Barend Mons; Marco Roos; Eleni Mina

doi:10.1186/s12911-025-03022-7

BMC Medical Informatics and Decision Making (Jun 2025)

The FAIR data point populator: collaborative FAIRification and population of FAIR data points

Daphne Wijnbergen,
Rajaram Kaliyaperumal,
Kees Burger,
Luiz Olavo Bonino da Silva Santos,
Barend Mons,
Marco Roos,
Eleni Mina

Affiliations

Daphne Wijnbergen: Human Genetics, Leiden University Medical Center
Rajaram Kaliyaperumal: Human Genetics, Leiden University Medical Center
Kees Burger: Human Genetics, Leiden University Medical Center
Luiz Olavo Bonino da Silva Santos: Human Genetics, Leiden University Medical Center
Barend Mons: Human Genetics, Leiden University Medical Center
Marco Roos: Human Genetics, Leiden University Medical Center
Eleni Mina: Human Genetics, Leiden University Medical Center

DOI: https://doi.org/10.1186/s12911-025-03022-7
Journal volume & issue: Vol. 25, no. S1
pp. 1 – 11

Abstract

Read online

Abstract Background Use of the FAIR principles (Findable, Accessible, Interoperable and Reusable) allows the rapidly growing number of biomedical datasets to be optimally (re)used. An important aspect of the FAIR principles is metadata. The FAIR Data Point specifications and reference implementation have been designed as an example on how to publish metadata according to the FAIR principles. Metadata can be added to a FAIR Data Point with the FDP’s web interface or through its API. However, these methods are either limited in scalability or only usable by users with a background in programming. We aim to provide a new tool for populating FDPs with metadata that addresses these limitations with the FAIR Data Point Populator. Results The FAIR Data Point Populator consists of a GitHub workflow together with Excel templates that have tooltips, validation and documentation. The Excel templates are targeted towards non-technical users, and can be used collaboratively in online spreadsheet software. A more technical user then uses the GitHub workflow to read multiple entries in the Excel sheets, and transform it into machine readable metadata. This metadata is then automatically uploaded to a connected FAIR Data Point. We applied the FAIR Data Point Populator on the metadata of two datasets, and a patient registry. We were then able to run a query on the FAIR Data Point Index, in order to retrieve one of the datasets. Conclusion The FAIR Data Point Populator addresses the limitations of the other metadata publication methods by allowing the bulk creation of metadata entries while remaining accessible for users without a background in programming. Additionally, it allows efficient collaboration. As a result of this, the barrier of entry for FAIRification is lower, which allows the creation of FAIR data by more people.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords