PLoS ONE (Jan 2014)

DFA7, a new method to distinguish between intron-containing and intronless genes.

  • Chenglong Yu,
  • Mo Deng,
  • Lu Zheng,
  • Rong Lucy He,
  • Jie Yang,
  • Stephen S-T Yau

DOI
https://doi.org/10.1371/journal.pone.0101363
Journal volume & issue
Vol. 9, no. 7
p. e101363

Abstract

Read online

Intron-containing and intronless genes have different biological properties and statistical characteristics. Here we propose a new computational method to distinguish between intron-containing and intronless gene sequences. Seven feature parameters α, β, γ, λ, θ, φ and σ based on detrended fluctuation analysis (DFA) are fully used, and thus we can compute a 7-dimensional feature vector for any given gene sequence to be discriminated. Furthermore, support vector machine (SVM) classifier with Gaussian radial basis kernel function is performed on this feature space to classify the genes into intron-containing and intronless. We investigate the performance of the proposed method in comparison with other state-of-the-art algorithms on biological datasets. The experimental results show that our new method significantly improves the accuracy over those existing techniques.