A multi-platform reference for somatic structural variation detection
Jose Espejo Valle-Inclan,
Nicolle J.M. Besselink,
Ewart de Bruijn,
Daniel L. Cameron,
Jana Ebler,
Joachim Kutzera,
Stef van Lieshout,
Tobias Marschall,
Marcel Nelen,
Peter Priestley,
Ivo Renkens,
Margaretha G.M. Roemer,
Markus J. van Roosmalen,
Aaron M. Wenger,
Bauke Ylstra,
Remond J.A. Fijneman,
Wigard P. Kloosterman,
Edwin Cuppen
Affiliations
Jose Espejo Valle-Inclan
Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Nicolle J.M. Besselink
Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Ewart de Bruijn
Hartwig Medical Foundation, Amsterdam, the Netherlands
Daniel L. Cameron
Hartwig Medical Foundation, Amsterdam, the Netherlands; Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC, Australia
Jana Ebler
Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Joachim Kutzera
Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Stef van Lieshout
Hartwig Medical Foundation, Amsterdam, the Netherlands
Tobias Marschall
Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Marcel Nelen
Department of Human Genetics, Radboud UMC, Nijmegen, the Netherlands
Peter Priestley
Hartwig Medical Foundation, Amsterdam, the Netherlands
Ivo Renkens
Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Margaretha G.M. Roemer
Department of Pathology, Amsterdam UMC, Vrije Universiteit Amsterdam, Cancer Center Amsterdam, Amsterdam, the Netherlands
Markus J. van Roosmalen
Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands
Aaron M. Wenger
Pacific Biosciences, Menlo Park, CA, USA
Bauke Ylstra
Department of Pathology, Amsterdam UMC, Vrije Universiteit Amsterdam, Cancer Center Amsterdam, Amsterdam, the Netherlands
Remond J.A. Fijneman
Department of Pathology, Netherlands Cancer Institute, Amsterdam, the Netherlands
Wigard P. Kloosterman
Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands; Corresponding author
Edwin Cuppen
Center for Molecular Medicine and Oncode Institute, UMC Utrecht, Utrecht, the Netherlands; Hartwig Medical Foundation, Amsterdam, the Netherlands; Corresponding author
Summary: Accurate detection of somatic structural variation (SV) in cancer genomes remains a challenging problem. This is in part due to the lack of high-quality, gold-standard datasets that enable the benchmarking of experimental approaches and bioinformatic analysis pipelines. Here, we performed somatic SV analysis of the paired melanoma and normal lymphoblastoid COLO829 cell lines using four different sequencing technologies. Based on the evidence from multiple technologies combined with extensive experimental validation, we compiled a comprehensive set of carefully curated and validated somatic SVs, comprising all SV types. We demonstrate the utility of this resource by determining the SV detection performance as a function of tumor purity and sequence depth, highlighting the importance of assessing these parameters in cancer genomics projects. The truth somatic SV dataset as well as the underlying raw multi-platform sequencing data are freely available and are an important resource for community somatic benchmarking efforts.