Computing Power and Sample Size for the False Discovery Rate in Multiple Applications

Yonghui Ni; Anna Eames Seffernick; Arzu Onar-Thomas; Stanley B. Pounds

doi:10.3390/genes15030344

Genes (Mar 2024)

Computing Power and Sample Size for the False Discovery Rate in Multiple Applications

Yonghui Ni,
Anna Eames Seffernick,
Arzu Onar-Thomas,
Stanley B. Pounds

Affiliations

Yonghui Ni: Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
Anna Eames Seffernick: Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
Arzu Onar-Thomas: Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
Stanley B. Pounds: Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA

DOI: https://doi.org/10.3390/genes15030344
Journal volume & issue: Vol. 15, no. 3
p. 344

Abstract

Read online

The false discovery rate (FDR) is a widely used metric of statistical significance for genomic data analyses that involve multiple hypothesis testing. Power and sample size considerations are important in planning studies that perform these types of genomic data analyses. Here, we propose a three-rectangle approximation of a p-value histogram to derive a formula to compute the statistical power and sample size for analyses that involve the FDR. We also introduce the R package FDRsamplesize2, which incorporates these and other power calculation formulas to compute power for a broad variety of studies not covered by other FDR power calculation software. A few illustrative examples are provided. The FDRsamplesize2 package is available on CRAN.

Published in Genes

ISSN: 2073-4425 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Biology (General): Genetics
Website: http://www.mdpi.com/journal/genes/

About the journal

Abstract

Keywords