mSystems (Dec 2017)

Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes

  • Lingjing Jiang,
  • Amnon Amir,
  • James T. Morton,
  • Ruth Heller,
  • Ery Arias-Castro,
  • Rob Knight

DOI
https://doi.org/10.1128/mSystems.00092-17
Journal volume & issue
Vol. 2, no. 6

Abstract

Read online

ABSTRACT Differential abundance testing is a critical task in microbiome studies that is complicated by the sparsity of data matrices. Here we adapt for microbiome studies a solution from the field of gene expression analysis to produce a new method, discrete false-discovery rate (DS-FDR), that greatly improves the power to detect differential taxa by exploiting the discreteness of the data. Additionally, DS-FDR is relatively robust to the number of noninformative features, and thus removes the problem of filtering taxonomy tables by an arbitrary abundance threshold. We show by using a combination of simulations and reanalysis of nine real-world microbiome data sets that this new method outperforms existing methods at the differential abundance testing task, producing a false-discovery rate that is up to threefold more accurate, and halves the number of samples required to find a given difference (thus increasing the efficiency of microbiome experiments considerably). We therefore expect DS-FDR to be widely applied in microbiome studies. IMPORTANCE DS-FDR can achieve higher statistical power to detect significant findings in sparse and noisy microbiome data compared to the commonly used Benjamini-Hochberg procedure and other FDR-controlling procedures.

Keywords