Nature Communications (Jul 2024)

A real-world multi-center RNA-seq benchmarking study using the Quartet and MAQC reference materials

  • Duo Wang,
  • Yaqing Liu,
  • Yuanfeng Zhang,
  • Qingwang Chen,
  • Yanxi Han,
  • Wanwan Hou,
  • Cong Liu,
  • Ying Yu,
  • Ziyang Li,
  • Ziqiang Li,
  • Jiaxin Zhao,
  • Leming Shi,
  • Yuanting Zheng,
  • Jinming Li,
  • Rui Zhang

DOI
https://doi.org/10.1038/s41467-024-50420-y
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 21

Abstract

Read online

Abstract Translating RNA-seq into clinical diagnostics requires ensuring the reliability and cross-laboratory consistency of detecting clinically relevant subtle differential expressions, such as those between different disease subtypes or stages. As part of the Quartet project, we present an RNA-seq benchmarking study across 45 laboratories using the Quartet and MAQC reference samples spiked with ERCC controls. Based on multiple types of ‘ground truth’, we systematically assess the real-world RNA-seq performance and investigate the influencing factors involved in 26 experimental processes and 140 bioinformatics pipelines. Here we show greater inter-laboratory variations in detecting subtle differential expressions among the Quartet samples. Experimental factors including mRNA enrichment and strandedness, and each bioinformatics step, emerge as primary sources of variations in gene expression. We underscore the profound influence of experimental execution, and provide best practice recommendations for experimental designs, strategies for filtering low-expression genes, and the optimal gene annotation and analysis pipelines. In summary, this study lays the foundation for developing and quality control of RNA-seq for clinical diagnostic purposes.