Computational and Structural Biotechnology Journal (Jan 2023)
3plex enables deep computational investigation of triplex forming lncRNAs
Abstract
Long non-coding RNAs (lncRNAs) regulate gene expression through different molecular mechanisms, including DNA binding via the formation of RNA:DNA:DNA triple helices (TPXs). Despite the increasing amount of experimental evidence, TPXs investigation remains challenging. Here we present 3plex, a software able to predict TPX interactions in silico. Given an RNA sequence and a set of DNA sequences, 3plex integrates 1) Hoogsteen pairing rules that describe the biochemical interactions between RNA and DNA nucleotides, 2) RNA secondary structure prediction and 3) determination of the TPX thermal stability derived from a collection of TPX experimental evidences. We systematically collected and uniformly re-analysed published experimental lncRNA binding sites on human and mouse genomes. We used these data to evaluate 3plex performance and showed that its specific features allow a reliable identification of TPX interactions. We compared 3plex with the other available software and obtained comparable or even better accuracy at a fraction of the computation time. Interestingly, by inspecting collected data with 3plex we found that TPXs tend to be shorter and more degenerated than previously expected and that the majority of analysed lncRNAs can directly bind to the genome by TPX formation. Those results suggest that an important fraction of lncRNAs can exert its biological function through this mechanism. The software is available at https://github.com/molinerisLab/3plex.