CoRe: a robustly benchmarked R package for identifying core-fitness genes in genome-wide pooled CRISPR-Cas9 screens

Alessandro Vinceti; Emre Karakoc; Clare Pacini; Umberto Perron; Riccardo Roberto De Lucia; Mathew J. Garnett; Francesco Iorio

doi:10.1186/s12864-021-08129-5

BMC Genomics (Nov 2021)

CoRe: a robustly benchmarked R package for identifying core-fitness genes in genome-wide pooled CRISPR-Cas9 screens

Alessandro Vinceti,
Emre Karakoc,
Clare Pacini,
Umberto Perron,
Riccardo Roberto De Lucia,
Mathew J. Garnett,
Francesco Iorio

Affiliations

Alessandro Vinceti: Human Technopole
Emre Karakoc: Wellcome Sanger Institute, Wellcome Genome Campus
Clare Pacini: Wellcome Sanger Institute, Wellcome Genome Campus
Umberto Perron: Human Technopole
Riccardo Roberto De Lucia: Human Technopole
Mathew J. Garnett: Wellcome Sanger Institute, Wellcome Genome Campus
Francesco Iorio: Human Technopole

DOI: https://doi.org/10.1186/s12864-021-08129-5
Journal volume & issue: Vol. 22, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Background CRISPR-Cas9 genome-wide screens are being increasingly performed, allowing systematic explorations of cancer dependencies at unprecedented accuracy and scale. One of the major computational challenges when analysing data derived from such screens is to identify genes that are essential for cell survival invariantly across tissues, conditions, and genomic-contexts (core-fitness genes), and to distinguish them from context-specific essential genes. This is of paramount importance to assess the safety profile of candidate therapeutic targets and for elucidating mechanisms involved in tissue-specific genetic diseases. Results We have developed CoRe: an R package implementing existing and novel methods for the identification of core-fitness genes (at two different level of stringency) from joint analyses of multiple CRISPR-Cas9 screens. We demonstrate, through a fully reproducible benchmarking pipeline, that CoRe outperforms state-of-the-art tools, yielding more reliable and biologically relevant sets of core-fitness genes. Conclusions CoRe offers a flexible pipeline, compatible with many pre-processing methods for the analysis of CRISPR data, which can be tailored onto different use-cases. The CoRe package can be used for the identification of high-confidence novel core-fitness genes, as well as a means to filter out potentially cytotoxic hits while analysing cancer dependency datasets for identifying and prioritising novel selective therapeutic targets.

Published in BMC Genomics

ISSN: 1471-2164 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Biology (General): Genetics
Website: http://bmcgenomics.biomedcentral.com

About the journal

Abstract

Keywords