PLoS ONE (Jan 2014)

A generalized topological entropy for analyzing the complexity of DNA sequences.

  • Shuilin Jin,
  • Renjie Tan,
  • Qinghua Jiang,
  • Li Xu,
  • Jiajie Peng,
  • Yong Wang,
  • Yadong Wang

DOI
https://doi.org/10.1371/journal.pone.0088519
Journal volume & issue
Vol. 9, no. 2
p. e88519

Abstract

Read online

Topological entropy is one of the most difficult entropies to be used to analyze the DNA sequences, due to the finite sample and high-dimensionality problems. In order to overcome these problems, a generalized topological entropy is introduced. The relationship between the topological entropy and the generalized topological entropy is compared, which shows the topological entropy is a special case of the generalized entropy. As an application the generalized topological entropy in introns, exons and promoter regions was computed, respectively. The results indicate that the entropy of introns is higher than that of exons, and the entropy of the exons is higher than that of the promoter regions for each chromosome, which suggest that DNA sequence of the promoter regions is more regular than the exons and introns.