Cell Genomics (Jun 2022)

A multi-platform reference for somatic structural variation detection

  • Jose Espejo Valle-Inclan,
  • Nicolle J.M. Besselink,
  • Ewart de Bruijn,
  • Daniel L. Cameron,
  • Jana Ebler,
  • Joachim Kutzera,
  • Stef van Lieshout,
  • Tobias Marschall,
  • Marcel Nelen,
  • Peter Priestley,
  • Ivo Renkens,
  • Margaretha G.M. Roemer,
  • Markus J. van Roosmalen,
  • Aaron M. Wenger,
  • Bauke Ylstra,
  • Remond J.A. Fijneman,
  • Wigard P. Kloosterman,
  • Edwin Cuppen

Journal volume & issue
Vol. 2, no. 6
p. 100139

Abstract

Read online

Summary: Accurate detection of somatic structural variation (SV) in cancer genomes remains a challenging problem. This is in part due to the lack of high-quality, gold-standard datasets that enable the benchmarking of experimental approaches and bioinformatic analysis pipelines. Here, we performed somatic SV analysis of the paired melanoma and normal lymphoblastoid COLO829 cell lines using four different sequencing technologies. Based on the evidence from multiple technologies combined with extensive experimental validation, we compiled a comprehensive set of carefully curated and validated somatic SVs, comprising all SV types. We demonstrate the utility of this resource by determining the SV detection performance as a function of tumor purity and sequence depth, highlighting the importance of assessing these parameters in cancer genomics projects. The truth somatic SV dataset as well as the underlying raw multi-platform sequencing data are freely available and are an important resource for community somatic benchmarking efforts.

Keywords