A novel scatterplot-based method to detect copy number variation (CNV)

Jia-Lu Qiao; Rebecca T. Levinson; Rebecca T. Levinson; Bowang Chen; Stefan T. Engelter; Philipp Erhart; Brady J. Gaynor; Patrick F. McArdle; Kristina Schlicht; Michael Krawczak; Martin Stenman; Martin Stenman; Arne G. Lindgren; John W. Cole; John W. Cole; Caspar Grond-Ginsbach

doi:10.3389/fgene.2023.1166972

Frontiers in Genetics (Jul 2023)

A novel scatterplot-based method to detect copy number variation (CNV)

Jia-Lu Qiao,
Rebecca T. Levinson,
Rebecca T. Levinson,
Bowang Chen,
Stefan T. Engelter,
Philipp Erhart,
Brady J. Gaynor,
Patrick F. McArdle,
Kristina Schlicht,
Michael Krawczak,
Martin Stenman,
Martin Stenman,
Arne G. Lindgren,
John W. Cole,
John W. Cole,
Caspar Grond-Ginsbach

Affiliations

Jia-Lu Qiao: Department of Vascular and Endovascular Surgery, University Hospital Heidelberg, Heidelberg, Germany
Rebecca T. Levinson: Institute for Computational Biomedicine, Faculty of Medicine, Heidelberg University Hospital, Heidelberg, Germany
Rebecca T. Levinson: Department of General Internal Medicine and Psychosomatics, University Hospital Heidelberg, Heidelberg, Germany
Bowang Chen: National Center for Cardiovascular Diseases, Beijing, China
Stefan T. Engelter: Neurorehabilitation Unit, University of Basel and University Center for Medicine of Aging Felix Platter Hospital, Basel, Switzerland
Philipp Erhart: Department of Vascular and Endovascular Surgery, University Hospital Heidelberg, Heidelberg, Germany
Brady J. Gaynor: Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, United States
Patrick F. McArdle: Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, United States
Kristina Schlicht: Institute of Diabetes and Clinical Metabolic Research, University Medical Center Schleswig-Holstein, Kiel, Germany
Michael Krawczak: Institute of Medical Informatics and Statistics, Kiel University Medical Center Schleswig-Holstein, Kiel, Germany
Martin Stenman: Department of Clinical Sciences Lund, Lund University, Skåne University Hospital, Lund, Sweden
Martin Stenman: 0Department of Neurology, Lund University, Skåne University Hospital, Lund, Sweden
Arne G. Lindgren: Department of Clinical Sciences Lund, Lund University, Skåne University Hospital, Lund, Sweden
John W. Cole: Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, United States
John W. Cole: 1Veterans Affairs Maryland Healthcare System, University of Maryland School of Medicine, Baltimore, MD, United States
Caspar Grond-Ginsbach: Department of Vascular and Endovascular Surgery, University Hospital Heidelberg, Heidelberg, Germany

DOI: https://doi.org/10.3389/fgene.2023.1166972
Journal volume & issue: Vol. 14

Abstract

Read online

Objective: Most methods to detect copy number variation (CNV) have high false positive rates, especially for small CNVs and in real-life samples from clinical studies. In this study, we explored a novel scatterplot-based method to detect CNVs in microarray samples.Methods: Illumina SNP microarray data from 13,254 individuals were analyzed with scatterplots and by PennCNV. The data were analyzed without the prior exclusion of low-quality samples. For CNV scatterplot visualization, the median signal intensity of all SNPs located within a CNV region was plotted against the median signal intensity of the flanking genomic region. Since CNV causes loss or gain of signal intensities, carriers of different CNV alleles pop up in clusters. Moreover, SNPs within a deletion are not heterozygous, whereas heterozygous SNPs within a duplication show typical 1:2 signal distribution between the alleles. Scatterplot-based CNV calls were compared with standard results of PennCNV analysis. All discordant calls as well as a random selection of 100 concordant calls were individually analyzed by visual inspection after noise-reduction.Results: An algorithm for the automated scatterplot visualization of CNVs was developed and used to analyze six known CNV regions. Use of scatterplots and PennCNV yielded 1019 concordant and 108 discordant CNV calls. All concordant calls were evaluated as true CNV-findings. Among the 108 discordant calls, 7 were false positive findings by the scatterplot method, 80 were PennCNV false positives, and 21 were true CNVs detected by the scatterplot method, but missed by PennCNV (i.e., false negative findings).Conclusion: CNV visualization by scatterplots allows for a reliable and rapid detection of CNVs in large studies. This novel method may thus be used both to confirm the results of genome-wide CNV detection software and to identify known CNVs in hitherto untyped samples.

Published in Frontiers in Genetics

ISSN: 1664-8021 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Biology (General): Genetics
Website: http://journal.frontiersin.org/journal/genetics

About the journal

Abstract

Keywords