Genome Biology (Nov 2019)

RegSNPs-intron: a computational framework for predicting pathogenic impact of intronic single nucleotide variants

  • Hai Lin,
  • Katherine A. Hargreaves,
  • Rudong Li,
  • Jill L. Reiter,
  • Yue Wang,
  • Matthew Mort,
  • David N. Cooper,
  • Yaoqi Zhou,
  • Chi Zhang,
  • Michael T. Eadon,
  • M. Eileen Dolan,
  • Joseph Ipe,
  • Todd C. Skaar,
  • Yunlong Liu

DOI
https://doi.org/10.1186/s13059-019-1847-4
Journal volume & issue
Vol. 20, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Single nucleotide variants (SNVs) in intronic regions have yet to be systematically investigated for their disease-causing potential. Using known pathogenic and neutral intronic SNVs (iSNVs) as training data, we develop the RegSNPs-intron algorithm based on a random forest classifier that integrates RNA splicing, protein structure, and evolutionary conservation features. RegSNPs-intron showed excellent performance in evaluating the pathogenic impacts of iSNVs. Using a high-throughput functional reporter assay called ASSET-seq (ASsay for Splicing using ExonTrap and sequencing), we evaluate the impact of RegSNPs-intron predictions on splicing outcome. Together, RegSNPs-intron and ASSET-seq enable effective prioritization of iSNVs for disease pathogenesis.

Keywords