Clipper: p-value-free FDR control on high-throughput data from two conditions

Xinzhou Ge; Yiling Elaine Chen; Dongyuan Song; MeiLu McDermott; Kyla Woyshner; Antigoni Manousopoulou; Ning Wang; Wei Li; Leo D. Wang; Jingyi Jessica Li

doi:10.1186/s13059-021-02506-9

Genome Biology (Oct 2021)

Clipper: p-value-free FDR control on high-throughput data from two conditions

Xinzhou Ge,
Yiling Elaine Chen,
Dongyuan Song,
MeiLu McDermott,
Kyla Woyshner,
Antigoni Manousopoulou,
Ning Wang,
Wei Li,
Leo D. Wang,
Jingyi Jessica Li

Affiliations

Xinzhou Ge: Department of Statistics, University of California
Yiling Elaine Chen: Department of Statistics, University of California
Dongyuan Song: Interdepartmental Program in Bioinformatics, University of California
MeiLu McDermott: Beckman Research Institute, City of Hope National Medical Center
Kyla Woyshner: Beckman Research Institute, City of Hope National Medical Center
Antigoni Manousopoulou: Beckman Research Institute, City of Hope National Medical Center
Ning Wang: Interdepartmental Program in Bioinformatics, University of California
Wei Li: Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California
Leo D. Wang: Beckman Research Institute, City of Hope National Medical Center
Jingyi Jessica Li: Department of Statistics, University of California

DOI: https://doi.org/10.1186/s13059-021-02506-9
Journal volume & issue: Vol. 22, no. 1
pp. 1 – 29

Abstract

Read online

Abstract High-throughput biological data analysis commonly involves identifying features such as genes, genomic regions, and proteins, whose values differ between two conditions, from numerous features measured simultaneously. The most widely used criterion to ensure the analysis reliability is the false discovery rate (FDR), which is primarily controlled based on p-values. However, obtaining valid p-values relies on either reasonable assumptions of data distribution or large numbers of replicates under both conditions. Clipper is a general statistical framework for FDR control without relying on p-values or specific data distributions. Clipper outperforms existing methods for a broad range of applications in high-throughput data analysis.

Published in Genome Biology

ISSN: 1474-760X (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General): Genetics
Website: https://genomebiology.biomedcentral.com/

About the journal