Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data

Sunnie Grace McCalla; Alireza Fotuhi Siahpirani; Jiaxin Li; Saptarshi Pyne; Matthew Stone; Viswesh Periyasamy; Junha Shin; Sushmita Roy

doi:10.1093/g3journal/jkad004

G3: Genes, Genomes, Genetics (Jan 2023)

Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data

Sunnie Grace McCalla,
Alireza Fotuhi Siahpirani,
Jiaxin Li,
Saptarshi Pyne,
Matthew Stone,
Viswesh Periyasamy,
Junha Shin,
Sushmita Roy

Affiliations

Sunnie Grace McCalla: Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA
Alireza Fotuhi Siahpirani: Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA
Jiaxin Li: Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA
Saptarshi Pyne: Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA
Matthew Stone: Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA
Viswesh Periyasamy: Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA
Junha Shin: Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA
Sushmita Roy: Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA

DOI: https://doi.org/10.1093/g3journal/jkad004
Journal volume & issue: Vol. 13, no. 3

Abstract

Read online

AbstractSingle-cell RNA-sequencing (scRNA-seq) offers unparalleled insight into the transcriptional programs of different cellular states by measuring the transcriptome of thousands of individual cells. An emerging problem in the analysis of scRNA-seq is the inference of transcriptional gene regulatory networks and a number of methods with different learning frameworks have been developed to address this problem. Here, we present an expanded benchmarking study of eleven recent network inference methods on seven published scRNA-seq datasets in human, mouse, and yeast considering different types of gold standard networks and evaluation metrics. We evaluate methods based on their computing requirements as well as on their ability to recover the network structure. We find that, while most methods have a modest recovery of experimentally derived interactions based on global metrics such as Area Under the Precision Recall curve, methods are able to capture targets of regulators that are relevant to the system under study. Among the top performing methods that use only expression were SCENIC, PIDC, MERLIN or Correlation. Addition of prior biological knowledge and the estimation of transcription factor activities resulted in the best overall performance with the Inferelator and MERLIN methods that use prior knowledge outperforming methods that use expression alone. We found that imputation for network inference did not improve network inference accuracy and could be detrimental. Comparisons of inferred networks for comparable bulk conditions showed that the networks inferred from scRNA-seq datasets are often better or at par with the networks inferred from bulk datasets. Our analysis should be beneficial in selecting methods for network inference. At the same time, this highlights the need for improved methods and better gold standards for regulatory network inference from scRNAseq datasets.

Published in G3: Genes, Genomes, Genetics

ISSN: 2160-1836 (Online)
Publisher: Oxford University Press
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General): Genetics
Website: https://academic.oup.com/g3journal

About the journal