BMC Genomic Data (Nov 2024)
In silico splicing analysis of the PMS2 gene: exploring alternative molecular mechanisms in PMS2-associated Lynch syndrome
Abstract
Abstract Lynch syndrome (LS) is one of the most common hereditary cancer syndrome in human populations, associated with germline variants in MLH1, MSH2/EPCAM, MSH6 and PMS2 genes. The advent of next generation sequencing has proven a significant impact in germline variant detection in the causative genes; however, a large proportion of patients with clinical criteria still receive uncertain or negative results. PMS2 is the least frequent reported gene, associated with up to 15% of LS cases with late-onset disease and low penetrance phenotype; however, the proportion of PMS2-LS cases is considered to be highly underestimated. In this context, our analysis aimed to improve the current diagnostic yield by focusing on missense and intronic PMS2 variants available in public clinical databases (ClinVar, LOVD). We performed an in silico assessment of the wild-type DNA sequence and the reported genetic variants, employing splicing bioinformatics tools known for their effectiveness in other genes. Splicing variants were predicted in silico and using GTEx short-read RNA expression data. Out of the 2384 missense variants discovered, 90% were classified with uncertain significance (VUS). 4.9% of missense variants were shown to have a potential splicing consequence (DS > 0.2) using SpliceAI. As described in the original publication, SpliceAI-visual was proven effective in annotation of short intronic variants ( 0.1). Exons 2, 3, 4, 5, 6, 7, 8, 11, 12 and 14 were consistently predicted in at least three out of eight software with weak canonical splice sites. Additionally, we noted that both Exonic Splicing Enhancers (ESEs) and Exonic Splicing Silencers (ESSs) contribute significantly to alternative splicing and exonic selection in PMS2 gene. Specifically, ESE motifs were consistently more abundant in highly expressed exons 5, 11 and 14, while ESS motifs played a fundamental role in exons 6, 7 and 10. Computational analysis performed in our study serves as a valuable filtering step for guiding further RNA experiments. Additional functional data is necessary to validate our findings.
Keywords