Nature Communications (Sep 2024)

Large-scale analysis of whole genome sequencing data from formalin-fixed paraffin-embedded cancer specimens demonstrates preservation of clinical utility

  • Shadi Basyuni,
  • Laura Heskin,
  • Andrea Degasperi,
  • Daniella Black,
  • Gene C. C. Koh,
  • Lucia Chmelova,
  • Giuseppe Rinaldi,
  • Steven Bell,
  • Louise Grybowicz,
  • Greg Elgar,
  • Yasin Memari,
  • Pauline Robbe,
  • Zoya Kingsbury,
  • Carlos Caldas,
  • Jean Abraham,
  • Anna Schuh,
  • Louise Jones,
  • PARTNER Trial Group,
  • Personalised Breast Cancer Program Group,
  • Marc Tischkowitz,
  • Matthew A. Brown,
  • Helen R. Davies,
  • Serena Nik-Zainal

DOI
https://doi.org/10.1038/s41467-024-51577-2
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Whole genome sequencing (WGS) provides comprehensive, individualised cancer genomic information. However, routine tumour biopsies are formalin-fixed and paraffin-embedded (FFPE), damaging DNA, historically limiting their use in WGS. Here we analyse FFPE cancer WGS datasets from England’s 100,000 Genomes Project, comparing 578 FFPE samples with 11,014 fresh frozen (FF) samples across multiple tumour types. We use an approach that characterises rather than discards artefacts. We identify three artefactual signatures, including one known (SBS57) and two previously uncharacterised (SBS FFPE, ID FFPE), and develop an “FFPEImpact” score that quantifies sample artefacts. Despite inferior sequencing quality, FFPE-derived data identifies clinically-actionable variants, mutational signatures and permits algorithmic stratification. Matched FF/FFPE validation cohorts shows good concordance while acknowledging SBS, ID and copy-number artefacts. While FF-derived WGS data remains the gold standard, FFPE-samples can be used for WGS if required, using analytical advancements developed here, potentially democratising whole cancer genomics to many.