Frontiers in Genetics (Oct 2024)

Simplified methods for variance estimation in microbiome abundance count data analysis

  • Yiming Shi,
  • Lili Liu,
  • Jun Chen,
  • Kristine M. Wylie,
  • Todd N. Wylie,
  • Molly J. Stout,
  • Chan Wang,
  • Haixiang Zhang,
  • Ya-Chen T. Shih,
  • Xiaoyi Xu,
  • Ai Zhang,
  • Sung Hee Park,
  • Hongmei Jiang,
  • Lei Liu

DOI
https://doi.org/10.3389/fgene.2024.1458851
Journal volume & issue
Vol. 15

Abstract

Read online

The complex nature of microbiome data has made the differential abundance analysis challenging. Microbiome abundance counts are often skewed to the right and heteroscedastic (also known as overdispersion), potentially leading to incorrect inferences if not properly addressed. In this paper, we propose a simple yet effective framework to tackle the challenges by integrating Poisson (log-linear) regression with standard error estimation through the Bootstrap method and Sandwich robust estimation. Such standard error estimates are accurate and yield satisfactory inference even if the distributional assumption or the variance structure is incorrect. Our approach is validated through extensive simulation studies, demonstrating its effectiveness in addressing overdispersion and improving inference accuracy. Additionally, we apply our approach to two real datasets collected from the human gut and vagina, respectively, demonstrating the wide applicability of our methods. The results highlight the efficacy of our covariance estimators in addressing the challenges of microbiome data analysis. The corresponding software implementation is publicly available at https://github.com/yimshi/robustestimates.

Keywords