PLoS ONE (Jan 2017)

An information-based network approach for protein classification.

  • Xiaogeng Wan,
  • Xin Zhao,
  • Stephen S T Yau

DOI
https://doi.org/10.1371/journal.pone.0174386
Journal volume & issue
Vol. 12, no. 3
p. e0174386

Abstract

Read online

Protein classification is one of the critical problems in bioinformatics. Early studies used geometric distances and polygenetic-tree to classify proteins. These methods use binary trees to present protein classification. In this paper, we propose a new protein classification method, whereby theories of information and networks are used to classify the multivariate relationships of proteins. In this study, protein universe is modeled as an undirected network, where proteins are classified according to their connections. Our method is unsupervised, multivariate, and alignment-free. It can be applied to the classification of both protein sequences and structures. Nine examples are used to demonstrate the efficiency of our new method.