PLoS ONE (Jan 2015)

Prediction of protein structural features from sequence data based on Shannon entropy and Kolmogorov complexity.

  • Robert Paul Bywater

DOI
https://doi.org/10.1371/journal.pone.0119306
Journal volume & issue
Vol. 10, no. 4
p. e0119306

Abstract

Read online

While the genome for a given organism stores the information necessary for the organism to function and flourish it is the proteins that are encoded by the genome that perhaps more than anything else characterize the phenotype for that organism. It is therefore not surprising that one of the many approaches to understanding and predicting protein folding and properties has come from genomics and more specifically from multiple sequence alignments. In this work I explore ways in which data derived from sequence alignment data can be used to investigate in a predictive way three different aspects of protein structure: secondary structures, inter-residue contacts and the dynamics of switching between different states of the protein. In particular the use of Kolmogorov complexity has identified a novel pathway towards achieving these goals.