Daily life in the Open Biologist’s second job, as a Data Curator [version 2; peer review: 3 approved, 1 approved with reservations]

Tomasz Zieliński; Irina Kalita; Andrew J. Millar; Livia C.T. Scorza; Meriem El Karoui; Alessia Lepore

Wellcome Open Research (Dec 2024)

Daily life in the Open Biologist’s second job, as a Data Curator [version 2; peer review: 3 approved, 1 approved with reservations]

Tomasz Zieliński,
Irina Kalita,
Andrew J. Millar,
Livia C.T. Scorza,
Meriem El Karoui,
Alessia Lepore

Affiliations

Tomasz Zieliński: ORCiD; Centre for Engineering Biology and School of Biological Sciences, University of Edinburgh, Edinburgh, Scotland, EH9 3BF, UK
Irina Kalita: Centre for Engineering Biology and School of Biological Sciences, University of Edinburgh, Edinburgh, Scotland, EH9 3BF, UK
Andrew J. Millar: ORCiD; Centre for Engineering Biology and School of Biological Sciences, University of Edinburgh, Edinburgh, Scotland, EH9 3BF, UK
Livia C.T. Scorza: ORCiD; Centre for Engineering Biology and School of Biological Sciences, University of Edinburgh, Edinburgh, Scotland, EH9 3BF, UK
Meriem El Karoui: Centre for Engineering Biology and School of Biological Sciences, University of Edinburgh, Edinburgh, Scotland, EH9 3BF, UK
Alessia Lepore: ORCiD; Centre for Engineering Biology and School of Biological Sciences, University of Edinburgh, Edinburgh, Scotland, EH9 3BF, UK

Journal volume & issue: Vol. 9

Abstract

Read online

Background Data reusability is the driving force of the research data life cycle. However, implementing strategies to generate reusable data from the data creation to the sharing stages is still a significant challenge. Even when datasets supporting a study are publicly shared, the outputs are often incomplete and/or not reusable. The FAIR (Findable, Accessible, Interoperable, Reusable) principles were published as a general guidance to promote data reusability in research, but the practical implementation of FAIR principles in research groups is still falling behind. In biology, the lack of standard practices for a large diversity of data types, data storage and preservation issues, and the lack of familiarity among researchers are some of the main impeding factors to achieve FAIR data. Past literature describes biological curation from the perspective of data resources that aggregate data, often from publications. Methods Our team works alongside data-generating, experimental researchers so our perspective aligns with publication authors rather than aggregators. We detail the processes for organizing datasets for publication, showcasing practical examples from data curation to data sharing. We also recommend strategies, tools and web resources to maximize data reusability, while maintaining research productivity. Conclusion We propose a simple approach to address research data management challenges for experimentalists, designed to promote FAIR data sharing. This strategy not only simplifies data management, but also enhances data visibility, recognition and impact, ultimately benefiting the entire scientific community.

Published in Wellcome Open Research

ISSN: 2398-502X (Online)
Publisher: Wellcome
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://wellcomeopenresearch.org/

About the journal

Abstract

Keywords