BMC Bioinformatics (Feb 2022)

A data management infrastructure for the integration of imaging and omics data in life sciences

  • Luis Kuhn Cuellar,
  • Andreas Friedrich,
  • Gisela Gabernet,
  • Luis de la Garza,
  • Sven Fillinger,
  • Adrian Seyboldt,
  • Tobias Koch,
  • Sven zur Oven-Krockhaus,
  • Friederike Wanke,
  • Sandra Richter,
  • Wolfgang M. Thaiss,
  • Marius Horger,
  • Nisar Malek,
  • Klaus Harter,
  • Michael Bitzer,
  • Sven Nahnsen

DOI
https://doi.org/10.1186/s12859-022-04584-3
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Background As technical developments in omics and biomedical imaging increase the throughput of data generation in life sciences, the need for information systems capable of managing heterogeneous digital assets is increasing. In particular, systems supporting the findability, accessibility, interoperability, and reusability (FAIR) principles of scientific data management. Results We propose a Service Oriented Architecture approach for integrated management and analysis of multi-omics and biomedical imaging data. Our architecture introduces an image management system into a FAIR-supporting, web-based platform for omics data management. Interoperable metadata models and middleware components implement the required data management operations. The resulting architecture allows for FAIR management of omics and imaging data, facilitating metadata queries from software applications. The applicability of the proposed architecture is demonstrated using two technical proofs of concept and a use case, aimed at molecular plant biology and clinical liver cancer research, which integrate various imaging and omics modalities. Conclusions We describe a data management architecture for integrated, FAIR-supporting management of omics and biomedical imaging data, and exemplify its applicability for basic biology research and clinical studies. We anticipate that FAIR data management systems for multi-modal data repositories will play a pivotal role in data-driven research, including studies which leverage advanced machine learning methods, as the joint analysis of omics and imaging data, in conjunction with phenotypic metadata, becomes not only desirable but necessary to derive novel insights into biological processes.

Keywords