Applying Length-Dependent Stochastic Context-Free Grammars to RNA Secondary Structure Prediction

Frank Weinberg; Markus E. Nebel

doi:10.3390/a4040223

Algorithms (Oct 2011)

Applying Length-Dependent Stochastic Context-Free Grammars to RNA Secondary Structure Prediction

Frank Weinberg,
Markus E. Nebel

Affiliations

Frank Weinberg
Markus E. Nebel

DOI: https://doi.org/10.3390/a4040223
Journal volume & issue: Vol. 4, no. 4
pp. 223 – 238

Abstract

Read online

In order to be able to capture effects from co-transcriptional folding, we extend stochastic context-free grammars such that the probability of applying a rule can depend on the length of the subword that is eventually generated from the symbols introduced by the rule, and we show that existing algorithms for training and for determining the most probable parse tree can easily be adapted to the extended model without losses in performance. Furthermore, we show that the extended model is suited to improve the quality of predictions of RNA secondary structures. The extended model may also be applied to other fields where stochastic context-free grammars are used like natural language processing. Additionally some interesting questions in the field of formal languages arise from it.

Published in Algorithms

ISSN: 1999-4893 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/algorithms

About the journal

Abstract

Keywords