A systematic assessment of the impact of rare canonical splice site variants on splicing using functional and in silico methods
Rachel Y. Oh,
Ali AlMail,
David Cheerie,
George Guirguis,
Huayun Hou,
Kyoko E. Yuki,
Bushra Haque,
Bhooma Thiruvahindrapuram,
Christian R. Marshall,
Roberto Mendoza-Londono,
Adam Shlien,
Lianna G. Kyriakopoulou,
Susan Walker,
James J. Dowling,
Michael D. Wilson,
Gregory Costain
Affiliations
Rachel Y. Oh
Division of Clinical and Metabolic Genetics, Hospital for Sick Children, Toronto, ON, Canada; Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
Ali AlMail
Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada; Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada
David Cheerie
Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
George Guirguis
Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
Huayun Hou
Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada
Kyoko E. Yuki
Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Division of Genome Diagnostics, Hospital for Sick Children, Toronto, ON, Canada
Bushra Haque
Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
Bhooma Thiruvahindrapuram
The Centre for Applied Genomics, SickKids Research Institute, Toronto, ON, Canada
Christian R. Marshall
Division of Genome Diagnostics, Hospital for Sick Children, Toronto, ON, Canada; Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
Roberto Mendoza-Londono
Division of Clinical and Metabolic Genetics, Hospital for Sick Children, Toronto, ON, Canada; Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Department of Paediatrics, University of Toronto, Toronto, ON, Canada
Adam Shlien
Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada; Division of Genome Diagnostics, Hospital for Sick Children, Toronto, ON, Canada; Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
Lianna G. Kyriakopoulou
Division of Genome Diagnostics, Hospital for Sick Children, Toronto, ON, Canada; Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
Susan Walker
The Centre for Applied Genomics, SickKids Research Institute, Toronto, ON, Canada
James J. Dowling
Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada; Department of Paediatrics, University of Toronto, Toronto, ON, Canada; Division of Neurology, Hospital for Sick Children, Toronto, ON, Canada
Michael D. Wilson
Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
Gregory Costain
Division of Clinical and Metabolic Genetics, Hospital for Sick Children, Toronto, ON, Canada; Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada; Department of Paediatrics, University of Toronto, Toronto, ON, Canada; Corresponding author
Summary: Canonical splice site variants (CSSVs) are often presumed to cause loss-of-function (LoF) and are assigned very strong evidence of pathogenicity (according to American College of Medical Genetics/Association for Molecular Pathology criterion PVS1). The exact nature and predictability of splicing effects of unselected rare CSSVs in blood-expressed genes are poorly understood. We identified 168 rare CSSVs in blood-expressed genes in 112 individuals using genome sequencing, and studied their impact on splicing using RNA sequencing (RNA-seq). There was no evidence of a frameshift, nor of reduced expression consistent with nonsense-mediated decay, for 25.6% of CSSVs: 17.9% had wildtype splicing only and normal junction depths, 3.6% resulted in cryptic splice site usage and in-frame insertions or deletions, 3.6% resulted in full exon skipping (in frame), and 0.6% resulted in full intron inclusion (in frame). Blind to these RNA-seq data, we attempted to predict the precise impact of CSSVs by applying in silico tools and the ClinGen Sequence Variant Interpretation Working Group 2018 guidelines for applying PVS1 criterion. The predicted impact on splicing using (1) SpliceAI, (2) MaxEntScan, and (3) AutoPVS1, an automatic classification tool for PVS1 interpretation of null variants that utilizes Ensembl Variant Effect Predictor and MaxEntScan, was concordant with RNA-seq analyses for 65%, 63%, and 61% of CSSVs, respectively. In summary, approximately one in four rare CSSVs did not show evidence for LoF based on analysis of RNA-seq data. Predictions from in silico methods were often discordant with findings from RNA-seq. More caution may be warranted in applying PVS1-level evidence to CSSVs in the absence of functional data.