Scientific Data (Feb 2024)

A single cell RNAseq benchmark experiment embedding “controlled” cancer heterogeneity

  • Maddalena Arigoni,
  • Maria Luisa Ratto,
  • Federica Riccardo,
  • Elisa Balmas,
  • Lorenzo Calogero,
  • Francesca Cordero,
  • Marco Beccuti,
  • Raffaele A. Calogero,
  • Luca Alessandri

DOI
https://doi.org/10.1038/s41597-024-03002-y
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Single-cell RNA sequencing (scRNA-seq) has emerged as a vital tool in tumour research, enabling the exploration of molecular complexities at the individual cell level. It offers new technical possibilities for advancing tumour research with the potential to yield significant breakthroughs. However, deciphering meaningful insights from scRNA-seq data poses challenges, particularly in cell annotation and tumour subpopulation identification. Efficient algorithms are therefore needed to unravel the intricate biological processes of cancer. To address these challenges, benchmarking datasets are essential to validate bioinformatics methodologies for analysing single-cell omics in oncology. Here, we present a 10XGenomics scRNA-seq experiment, providing a controlled heterogeneous environment using lung cancer cell lines characterised by the expression of seven different driver genes (EGFR, ALK, MET, ERBB2, KRAS, BRAF, ROS1), leading to partially overlapping functional pathways. Our dataset provides a comprehensive framework for the development and validation of methodologies for analysing cancer heterogeneity by means of scRNA-seq.