The balance of reproducibility, sensitivity, and specificity of lists of differentially expressed genes in microarray studies

Su Zhenqiang; Shippy Richard; Puri Raj K; Peterson Ron L; Mei Nan; Ma Yunqing; Luo Yuling; Li Quan-Zhen; Kawasaki Ernest S; Hong Huixiao; Herman Damir; Han Jing; Guo Xu; Fuscoe James C; Frueh Felix W; Fan Xiao-hui; Collins Patrick J; Chu Tzu-Ming; Bertholet Vincent; Cao Xiaoxi; Bao Wenjun; Barbacioru Catalin C; Amur Shashi; Qian Feng; Fang Hong; Boysen Cecilie; Croner Lisa J; Guo Lei; Goodsaid Federico M; Perkins Roger G; Harris Stephen C; Jensen Roderick V; Jones Wendell D; Shi Leming; Sun Yongming; Sun Hongmei; Thorn Brett; Turpaz Yaron; Wang Charles; Wang Sue; Warrington Janet A; Willey James C; Wu Jie; Xie Qian; Zhang Liang; Zhang Lu; Zhong Sheng; Wolfinger Russell D; Tong Weida

doi:10.1186/1471-2105-9-S9-S10

BMC Bioinformatics (Aug 2008)

The balance of reproducibility, sensitivity, and specificity of lists of differentially expressed genes in microarray studies

Su Zhenqiang,
Shippy Richard,
Puri Raj K,
Peterson Ron L,
Mei Nan,
Ma Yunqing,
Luo Yuling,
Li Quan-Zhen,
Kawasaki Ernest S,
Hong Huixiao,
Herman Damir,
Han Jing,
Guo Xu,
Fuscoe James C,
Frueh Felix W,
Fan Xiao-hui,
Collins Patrick J,
Chu Tzu-Ming,
Bertholet Vincent,
Cao Xiaoxi,
Bao Wenjun,
Barbacioru Catalin C,
Amur Shashi,
Qian Feng,
Fang Hong,
Boysen Cecilie,
Croner Lisa J,
Guo Lei,
Goodsaid Federico M,
Perkins Roger G,
Harris Stephen C,
Jensen Roderick V,
Jones Wendell D,
Shi Leming,
Sun Yongming,
Sun Hongmei,
Thorn Brett,
Turpaz Yaron,
Wang Charles,
Wang Sue,
Warrington Janet A,
Willey James C,
Wu Jie,
Xie Qian,
Zhang Liang,
Zhang Lu,
Zhong Sheng,
Wolfinger Russell D,
Tong Weida

Affiliations

Su Zhenqiang
Shippy Richard
Puri Raj K
Peterson Ron L
Mei Nan
Ma Yunqing
Luo Yuling
Li Quan-Zhen
Kawasaki Ernest S
Hong Huixiao
Herman Damir
Han Jing
Guo Xu
Fuscoe James C
Frueh Felix W
Fan Xiao-hui
Collins Patrick J
Chu Tzu-Ming
Bertholet Vincent
Cao Xiaoxi
Bao Wenjun
Barbacioru Catalin C
Amur Shashi
Qian Feng
Fang Hong
Boysen Cecilie
Croner Lisa J
Guo Lei
Goodsaid Federico M
Perkins Roger G
Harris Stephen C
Jensen Roderick V
Jones Wendell D
Shi Leming
Sun Yongming
Sun Hongmei
Thorn Brett
Turpaz Yaron
Wang Charles
Wang Sue
Warrington Janet A
Willey James C
Wu Jie
Xie Qian
Zhang Liang
Zhang Lu
Zhong Sheng
Wolfinger Russell D
Tong Weida

DOI: https://doi.org/10.1186/1471-2105-9-S9-S10
Journal volume & issue: Vol. 9, no. Suppl 9
p. S10

Abstract

Read online

Abstract Background Reproducibility is a fundamental requirement in scientific experiments. Some recent publications have claimed that microarrays are unreliable because lists of differentially expressed genes (DEGs) are not reproducible in similar experiments. Meanwhile, new statistical methods for identifying DEGs continue to appear in the scientific literature. The resultant variety of existing and emerging methods exacerbates confusion and continuing debate in the microarray community on the appropriate choice of methods for identifying reliable DEG lists. Results Using the data sets generated by the MicroArray Quality Control (MAQC) project, we investigated the impact on the reproducibility of DEG lists of a few widely used gene selection procedures. We present comprehensive results from inter-site comparisons using the same microarray platform, cross-platform comparisons using multiple microarray platforms, and comparisons between microarray results and those from TaqMan – the widely regarded "standard" gene expression platform. Our results demonstrate that (1) previously reported discordance between DEG lists could simply result from ranking and selecting DEGs solely by statistical significance (P) derived from widely used simple t-tests; (2) when fold change (FC) is used as the ranking criterion with a non-stringent P-value cutoff filtering, the DEG lists become much more reproducible, especially when fewer genes are selected as differentially expressed, as is the case in most microarray studies; and (3) the instability of short DEG lists solely based on P-value ranking is an expected mathematical consequence of the high variability of the t-values; the more stringent the P-value threshold, the less reproducible the DEG list is. These observations are also consistent with results from extensive simulation calculations. Conclusion We recommend the use of FC-ranking plus a non-stringent P cutoff as a straightforward and baseline practice in order to generate more reproducible DEG lists. Specifically, the P-value cutoff should not be stringent (too small) and FC should be as large as possible. Our results provide practical guidance to choose the appropriate FC and P-value cutoffs when selecting a given number of DEGs. The FC criterion enhances reproducibility, whereas the P criterion balances sensitivity and specificity.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal