BMC Bioinformatics (May 2023)

Bayesian compositional regression with microbiome features via variational inference

  • Darren A. V. Scott,
  • Ernest Benavente,
  • Julian Libiseller-Egger,
  • Dmitry Fedorov,
  • Jody Phelan,
  • Elena Ilina,
  • Polina Tikhonova,
  • Alexander Kudryavstev,
  • Julia Galeeva,
  • Taane Clark,
  • Alex Lewin

DOI
https://doi.org/10.1186/s12859-023-05219-x
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 29

Abstract

Read online

Abstract The microbiome plays a key role in the health of the human body. Interest often lies in finding features of the microbiome, alongside other covariates, which are associated with a phenotype of interest. One important property of microbiome data, which is often overlooked, is its compositionality as it can only provide information about the relative abundance of its constituting components. Typically, these proportions vary by several orders of magnitude in datasets of high dimensions. To address these challenges we develop a Bayesian hierarchical linear log-contrast model which is estimated by mean field Monte-Carlo co-ordinate ascent variational inference (CAVI-MC) and easily scales to high dimensional data. We use novel priors which account for the large differences in scale and constrained parameter space associated with the compositional covariates. A reversible jump Monte Carlo Markov chain guided by the data through univariate approximations of the variational posterior probability of inclusion, with proposal parameters informed by approximating variational densities via auxiliary parameters, is used to estimate intractable marginal expectations. We demonstrate that our proposed Bayesian method performs favourably against existing frequentist state of the art compositional data analysis methods. We then apply the CAVI-MC to the analysis of real data exploring the relationship of the gut microbiome to body mass index.

Keywords