Applying causal discovery to single-cell analyses using CausalCell
Yujian Wen,
Jielong Huang,
Shuhui Guo,
Yehezqel Elyahu,
Alon Monsonego,
Hai Zhang,
Yanqing Ding,
Hao Zhu
Affiliations
Yujian Wen
Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
Jielong Huang
Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
Shuhui Guo
Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
Yehezqel Elyahu
The Shraga Segal Department of Microbiology, Immunology and Genetics, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Alon Monsonego
The Shraga Segal Department of Microbiology, Immunology and Genetics, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
Hai Zhang
Network Center, Southern Medical University, Guangzhou, China
Yanqing Ding
Department of Pathology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China; Guangdong-Hong Kong-Macao Greater Bay Area Center for Brain Science and Brain-Inspired Intelligence, Southern Medical University, Guangzhou, China; Guangdong Provincial Key Lab of Single Cell Technology and Application, Southern Medical University, Guangzhou, China
Correlation between objects is prone to occur coincidentally, and exploring correlation or association in most situations does not answer scientific questions rich in causality. Causal discovery (also called causal inference) infers causal interactions between objects from observational data. Reported causal discovery methods and single-cell datasets make applying causal discovery to single cells a promising direction. However, evaluating and choosing causal discovery methods and developing and performing proper workflow remain challenges. We report the workflow and platform CausalCell (http://www.gaemons.net/causalcell/causalDiscovery/) for performing single-cell causal discovery. The workflow/platform is developed upon benchmarking four kinds of causal discovery methods and is examined by analyzing multiple single-cell RNA-sequencing (scRNA-seq) datasets. Our results suggest that different situations need different methods and the constraint-based PC algorithm with kernel-based conditional independence tests work best in most situations. Related issues are discussed and tips for best practices are given. Inferred causal interactions in single cells provide valuable clues for investigating molecular interactions and gene regulations, identifying critical diagnostic and therapeutic targets, and designing experimental and clinical interventions.