A three-state prediction of single point mutations on protein stability changes

Rossi Ivan; Fariselli Piero; Capriotti Emidio; Casadio Rita

doi:10.1186/1471-2105-9-S2-S6

BMC Bioinformatics (Mar 2008)

A three-state prediction of single point mutations on protein stability changes

Rossi Ivan,
Fariselli Piero,
Capriotti Emidio,
Casadio Rita

Affiliations

Rossi Ivan
Fariselli Piero
Capriotti Emidio
Casadio Rita

DOI: https://doi.org/10.1186/1471-2105-9-S2-S6
Journal volume & issue: Vol. 9, no. Suppl 2
p. S6

Abstract

Read online

Abstract Background A basic question of protein structural studies is to which extent mutations affect the stability. This question may be addressed starting from sequence and/or from structure. In proteomics and genomics studies prediction of protein stability free energy change (ΔΔG) upon single point mutation may also help the annotation process. The experimental ΔΔG values are affected by uncertainty as measured by standard deviations. Most of the ΔΔG values are nearly zero (about 32% of the ΔΔG data set ranges from −0.5 to 0.5 kcal/mole) and both the value and sign of ΔΔG may be either positive or negative for the same mutation blurring the relationship among mutations and expected ΔΔG value. In order to overcome this problem we describe a new predictor that discriminates between 3 mutation classes: destabilizing mutations (ΔΔG1.0 kcal/mole) and neutral mutations (−1.0≤ΔΔG≤1.0 kcal/mole). Results In this paper a support vector machine starting from the protein sequence or structure discriminates between stabilizing, destabilizing and neutral mutations. We rank all the possible substitutions according to a three state classification system and show that the overall accuracy of our predictor is as high as 56% when performed starting from sequence information and 61% when the protein structure is available, with a mean value correlation coefficient of 0.27 and 0.35, respectively. These values are about 20 points per cent higher than those of a random predictor. Conclusions Our method improves the quality of the prediction of the free energy change due to single point protein mutations by adopting a hypothesis of thermodynamic reversibility of the existing experimental data. By this we both recast the thermodynamic symmetry of the problem and balance the distribution of the available experimental measurements of free energy changes. This eliminates possible overestimations of the previously described methods trained on an unbalanced data set comprising a number of destabilizing mutations higher than stabilizing ones.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal