Frontiers in Genetics (Oct 2020)

A Review of Integrative Imputation for Multi-Omics Datasets

  • Meng Song,
  • Jonathan Greenbaum,
  • Joseph Luttrell,
  • Weihua Zhou,
  • Chong Wu,
  • Hui Shen,
  • Ping Gong,
  • Chaoyang Zhang,
  • Hong-Wen Deng

DOI
https://doi.org/10.3389/fgene.2020.570255
Journal volume & issue
Vol. 11

Abstract

Read online

Multi-omics studies, which explore the interactions between multiple types of biological factors, have significant advantages over single-omics analysis for their ability to provide a more holistic view of biological processes, uncover the causal and functional mechanisms for complex diseases, and facilitate new discoveries in precision medicine. However, omics datasets often contain missing values, and in multi-omics study designs it is common for individuals to be represented for some omics layers but not all. Since most statistical analyses cannot be applied directly to the incomplete datasets, imputation is typically performed to infer the missing values. Integrative imputation techniques which make use of the correlations and shared information among multi-omics datasets are expected to outperform approaches that rely on single-omics information alone, resulting in more accurate results for the subsequent downstream analyses. In this review, we provide an overview of the currently available imputation methods for handling missing values in bioinformatics data with an emphasis on multi-omics imputation. In addition, we also provide a perspective on how deep learning methods might be developed for the integrative imputation of multi-omics datasets.

Keywords