Genome Medicine (Sep 2017)

Identification of novel candidate disease genes from de novo exonic copy number variants

  • Tomasz Gambin,
  • Bo Yuan,
  • Weimin Bi,
  • Pengfei Liu,
  • Jill A. Rosenfeld,
  • Zeynep Coban-Akdemir,
  • Amber N. Pursley,
  • Sandesh C. S. Nagamani,
  • Ronit Marom,
  • Sailaja Golla,
  • Lauren Dengle,
  • Heather G. Petrie,
  • Reuben Matalon,
  • Lisa Emrick,
  • Monica B. Proud,
  • Diane Treadwell-Deering,
  • Hsiao-Tuan Chao,
  • Hannele Koillinen,
  • Chester Brown,
  • Nora Urraca,
  • Roya Mostafavi,
  • Saunder Bernes,
  • Elizabeth R. Roeder,
  • Kimberly M. Nugent,
  • Patricia I. Bader,
  • Gary Bellus,
  • Michael Cummings,
  • Hope Northrup,
  • Myla Ashfaq,
  • Rachel Westman,
  • Robert Wildin,
  • Anita E. Beck,
  • LaDonna Immken,
  • Lindsay Elton,
  • Shaun Varghese,
  • Edward Buchanan,
  • Laurence Faivre,
  • Mathilde Lefebvre,
  • Christian P. Schaaf,
  • Magdalena Walkiewicz,
  • Yaping Yang,
  • Sung-Hae L. Kang,
  • Seema R. Lalani,
  • Carlos A. Bacino,
  • Arthur L. Beaudet,
  • Amy M. Breman,
  • Janice L. Smith,
  • Sau Wai Cheung,
  • James R. Lupski,
  • Ankita Patel,
  • Chad A. Shaw,
  • Paweł Stankiewicz

DOI
https://doi.org/10.1186/s13073-017-0472-7
Journal volume & issue
Vol. 9, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Background Exon-targeted microarrays can detect small (<1000 bp) intragenic copy number variants (CNVs), including those that affect only a single exon. This genome-wide high-sensitivity approach increases the molecular diagnosis for conditions with known disease-associated genes, enables better genotype–phenotype correlations, and facilitates variant allele detection allowing novel disease gene discovery. Methods We retrospectively analyzed data from 63,127 patients referred for clinical chromosomal microarray analysis (CMA) at Baylor Genetics laboratories, including 46,755 individuals tested using exon-targeted arrays, from 2007 to 2017. Small CNVs harboring a single gene or two to five non-disease-associated genes were identified; the genes involved were evaluated for a potential disease association. Results In this clinical population, among rare CNVs involving any single gene reported in 7200 patients (11%), we identified 145 de novo autosomal CNVs (117 losses and 28 intragenic gains), 257 X-linked deletion CNVs in males, and 1049 inherited autosomal CNVs (878 losses and 171 intragenic gains); 111 known disease genes were potentially disrupted by de novo autosomal or X-linked (in males) single-gene CNVs. Ninety-one genes, either recently proposed as candidate disease genes or not yet associated with diseases, were disrupted by 147 single-gene CNVs, including 37 de novo deletions and ten de novo intragenic duplications on autosomes and 100 X-linked CNVs in males. Clinical features in individuals with de novo or X-linked CNVs encompassing at most five genes (224 bp to 1.6 Mb in size) were compared to those in individuals with larger-sized deletions (up to 5 Mb in size) in the internal CMA database or loss-of-function single nucleotide variants (SNVs) detected by clinical or research whole-exome sequencing (WES). This enabled the identification of recently published genes (BPTF, NONO, PSMD12, TANGO2, and TRIP12), novel candidate disease genes (ARGLU1 and STK3), and further confirmation of disease association for two recently proposed disease genes (MEIS2 and PTCHD1). Notably, exon-targeted CMA detected several pathogenic single-exon CNVs missed by clinical WES analyses. Conclusions Together, these data document the efficacy of exon-targeted CMA for detection of genic and exonic CNVs, complementing and extending WES in clinical diagnostics, and the potential for discovery of novel disease genes by genome-wide assay.

Keywords