Learning Health Systems (Jan 2024)
Toward a common standard for data and specimen provenance in life sciences
- Rudolf Wittner,
- Petr Holub,
- Cecilia Mascia,
- Francesca Frexia,
- Heimo Müller,
- Markus Plass,
- Clare Allocca,
- Fay Betsou,
- Tony Burdett,
- Ibon Cancio,
- Adriane Chapman,
- Martin Chapman,
- Mélanie Courtot,
- Vasa Curcin,
- Johann Eder,
- Mark Elliot,
- Katrina Exter,
- Carole Goble,
- Martin Golebiewski,
- Bron Kisler,
- Andreas Kremer,
- Simone Leo,
- Sheng Lin‐Gibson,
- Anna Marsano,
- Marco Mattavelli,
- Josh Moore,
- Hiroki Nakae,
- Isabelle Perseil,
- Ayat Salman,
- James Sluka,
- Stian Soiland‐Reyes,
- Caterina Strambio‐De‐Castillia,
- Michael Sussman,
- Jason R. Swedlow,
- Kurt Zatloukal,
- Jörg Geiger
Affiliations
- Rudolf Wittner
- BBMRI‐ERIC Graz Austria
- Petr Holub
- BBMRI‐ERIC Graz Austria
- Cecilia Mascia
- CRS4—Center for Advanced Studies Research and Development in Sardinia Pula Italy
- Francesca Frexia
- CRS4—Center for Advanced Studies Research and Development in Sardinia Pula Italy
- Heimo Müller
- Medical University Graz Graz Austria
- Markus Plass
- Medical University Graz Graz Austria
- Clare Allocca
- National Institute of Standards and Technology Gaithersburg Maryland USA
- Fay Betsou
- Biological Resource Center of Institut Pasteur (CRBIP) Paris France
- Tony Burdett
- EMBL's European Bioinformatics Institute (EMBL‐EBI) Cambridge UK
- Ibon Cancio
- Plentzia Marine Station (PiE‐UPV/EHU) University of the Basque Country, EMBRC‐Spain Bilbao Spain
- Adriane Chapman
- University of Southampton Southampton UK
- Martin Chapman
- King's College London London UK
- Mélanie Courtot
- Ontario Institute for Cancer Research Toronto Ontario Canada
- Vasa Curcin
- King's College London London UK
- Johann Eder
- University of Klagenfurt Klagenfurt Austria
- Mark Elliot
- Department of Social Statistics, School of Social Sciences University of Manchester Manchester UK
- Katrina Exter
- Flanders Marine Institute (VLIZ), EMBRC‐Belgium Ostend Belgium
- Carole Goble
- Department of Computer Science University of Manchester Manchester UK
- Martin Golebiewski
- Heidelberg Institute for Theoretical Studies (HITS gGmbH) Heidelberg Germany
- Bron Kisler
- Independent consultant
- Andreas Kremer
- ITTM S.A. Esch‐sur‐Alzette Luxembourg
- Simone Leo
- CRS4—Center for Advanced Studies Research and Development in Sardinia Pula Italy
- Sheng Lin‐Gibson
- Biosystems and Biomaterials Division NIST Gaithersburg Maryland USA
- Anna Marsano
- Department of Biomedicine University of Basel Basel Switzerland
- Marco Mattavelli
- SCI‐STI‐MM École Politechnique Fédérale de Lausanne Lausanne Switzerland
- Josh Moore
- Centre for Gene Regulation and Expression and Division of Computational Biology, School of Life Sciences University of Dundee Dundee UK
- Hiroki Nakae
- Japan bio‐Measurement and Analysis Consortium Tokyo Japan
- Isabelle Perseil
- INSERM–Institut National de la Sante et de la Recherche Medicale Paris France
- Ayat Salman
- Standards Council of Canada Ottawa Ontario Canada
- James Sluka
- Biocomplexity Institute Indiana University Bloomington Indiana USA
- Stian Soiland‐Reyes
- Department of Computer Science University of Manchester Manchester UK
- Caterina Strambio‐De‐Castillia
- Program in Molecular Medicine University of Massachusetts Chan Medical School Worcester Massachusetts USA
- Michael Sussman
- US Department of Agriculture Washington District of Columbia USA
- Jason R. Swedlow
- Centre for Gene Regulation and Expression and Division of Computational Biology, School of Life Sciences University of Dundee Dundee UK
- Kurt Zatloukal
- Medical University Graz Graz Austria
- Jörg Geiger
- Interdisciplinary Bank of Biomaterials and Data Würzburg (ibdw) Würzburg Germany
- DOI
- https://doi.org/10.1002/lrh2.10365
- Journal volume & issue
-
Vol. 8,
no. 1
pp. n/a – n/a
Abstract
Abstract Open and practical exchange, dissemination, and reuse of specimens and data have become a fundamental requirement for life sciences research. The quality of the data obtained and thus the findings and knowledge derived is thus significantly influenced by the quality of the samples, the experimental methods, and the data analysis. Therefore, a comprehensive and precise documentation of the pre‐analytical conditions, the analytical procedures, and the data processing are essential to be able to assess the validity of the research results. With the increasing importance of the exchange, reuse, and sharing of data and samples, procedures are required that enable cross‐organizational documentation, traceability, and non‐repudiation. At present, this information on the provenance of samples and data is mostly either sparse, incomplete, or incoherent. Since there is no uniform framework, this information is usually only provided within the organization and not interoperably. At the same time, the collection and sharing of biological and environmental specimens increasingly require definition and documentation of benefit sharing and compliance to regulatory requirements rather than consideration of pure scientific needs. In this publication, we present an ongoing standardization effort to provide trustworthy machine‐actionable documentation of the data lineage and specimens. We would like to invite experts from the biotechnology and biomedical fields to further contribute to the standard.
Keywords