Complexity (Jan 2019)

LipoFNT: Lipoylation Sites Identification with Flexible Neural Tree

  • Wenzheng Bao,
  • Bin Yang,
  • Rong Bao,
  • Yuehui Chen

DOI
https://doi.org/10.1155/2019/1603867
Journal volume & issue
Vol. 2019

Abstract

Read online

Lysine lipoylation is a special type of posttranslational modification in both prokaryotes’ and eukaryotes’ proteomics researches. Such a modification takes part in several significant biological processions and plays a key role in the cellular level. In order to construct and design an accurate classification algorithm for identifying lipoylation sites in the protein level, the computational approaches should be taken into account in this field. Meanwhile, several factors plays different role in the identification of modification sites. Considering such a situation, the foundational elements of the effective identification of modification sites are the available feature description and the high effective classification. With these two elements, the distinguishing between the lipoylation samples and the nonlipoylation samples can be treated as a typical classification issue in the field of machine learning. In this work, we have proposed a method named LipoFNT, which employed the two featuring sets, including the Position-Specific Scoring Matrix and bi-profile Bayesian, as the classification features. And then, the flexible neural tree algorithm is utilized to deal with the imbalance classification issue in lipoylation modification sample dataset. The proposed method can achieve 81.07% in sn%, 80.29% in sp, 80.68% in Acc, 0.8076 in F1, and 0.6136 in MCC, respectively. Meanwhile, we have demonstrated the relationship between the lengths of peptide and identification of modification sites.