Genes (Mar 2024)

Computing Power and Sample Size for the False Discovery Rate in Multiple Applications

  • Yonghui Ni,
  • Anna Eames Seffernick,
  • Arzu Onar-Thomas,
  • Stanley B. Pounds

DOI
https://doi.org/10.3390/genes15030344
Journal volume & issue
Vol. 15, no. 3
p. 344

Abstract

Read online

The false discovery rate (FDR) is a widely used metric of statistical significance for genomic data analyses that involve multiple hypothesis testing. Power and sample size considerations are important in planning studies that perform these types of genomic data analyses. Here, we propose a three-rectangle approximation of a p-value histogram to derive a formula to compute the statistical power and sample size for analyses that involve the FDR. We also introduce the R package FDRsamplesize2, which incorporates these and other power calculation formulas to compute power for a broad variety of studies not covered by other FDR power calculation software. A few illustrative examples are provided. The FDRsamplesize2 package is available on CRAN.

Keywords