MATEC Web of Conferences (Jan 2016)

Prediction of Bacterial Virulent Proteins with Composition Moment Vector Feature Encoding Method

  • Gök Murat,
  • Herand Deniz

DOI
https://doi.org/10.1051/matecconf/20164907001
Journal volume & issue
Vol. 49
p. 07001

Abstract

Read online

Prediction of bacterial virulent proteins is critical for vaccine development and understanding of virulence mechanisms in pathogens. For this purpose, a number of feature encoding methods based on sequences and evolutionary information of a given protein have been proposed and applied with some classifier algorithms so far. In this paper, we performed composition moment vector (CMV), which includes information about both composition and position of amino acid in the protein sequence to predict bacterial virulent proteins. The tests were validated in three different independent datasets. Experimental results show that CMV feature encoding method leads to better classification performance in terms of accuracy, sensitivity, f-measure and the Matthews correlation coefficient (MCC) scores on diverse classifiers.