Iranian Journal of Chemistry & Chemical Engineering (Dec 2010)

Mining Biological Repetitive Sequences Using Support Vector Machines and Fuzzy SVM

  • Hesam Torabi Dashti,
  • Ali Masoudi-Nejad

Journal volume & issue
Vol. 29, no. 4
pp. 1 – 17

Abstract

Read online

Structural repetitive subsequences are most important portion of biological sequences, which play crucial roles on corresponding sequence’s fold and functionality. Biggest class of the repetitive subsequences is “Transposable Elements” which has its own sub-classes upon contexts’ structures. Many researches have been performed to criticality determine the structure and function of repetitive subsequences. The sequencing noises and the sequences’ substitutions probability are obstacles of these researches. Some statistical and approximation algorithms have introduced to tackle these obstacles. By introducing conspicuous statistical machine learning methods upon Support Vector Machines, machine learning approaches act as potent methods to solve the pattern-finding problem. Support vector machines methods are time efficient approaches, which based on their parameters can be precise and accurate. In this Review, mathematical definition of structural repetitive subsequences are introduced, thereafter proposed algorithm to tackle simple pattern finding problem, which can be applicable on structural patterns are reviewed. Theoretical aspects of Support Vector Machines on computational biology platform are considered. Finally, novel evolutionary Fuzzy SVM will be introduced, which is applicable on wide range of bioinformatics problems especially the problem of structural repetitive subsequences.

Keywords