Computational and Structural Biotechnology Journal (Jan 2021)
A systematic structural comparison of all solved small proteins deposited in PDB. The effect of disulfide bonds in protein fold
Abstract
Defensins are small proteins, usually ranging from 3 to 6 kDa, amphipathic, disulfide-rich, and with a small or even absent hydrophobic core. Since a hydrophobic core is generally found in globular proteins that fold in an aqueous solvent, the peculiar fold of defensins can challenge tertiary protein structure predictors. We performed a Protein Data Bank survey of small proteins (3–6 kDa) to understand the similarities of defensins with other small disulfide-rich proteins. We found no differences when we compared defensins with non-defensins regarding the proportion of apolar, polar and charged residues and their exposure to the solvent. Then we divided all small proteins (3–6 kDa) in the Protein Data Bank into two groups, one group with at least one disulfide bond (bonded, defensins included) and another group without any disulfide bond (unbonded). The group of bonded proteins contained apolar residues more exposed to the solvent than the unbonded group. The ab initio algorithm for tertiary protein structure prediction Robetta was more accurate at predicting unbonded than bonded proteins. On the other hand, the trRosetta algorithm, which uses artificial intelligence, improved the prediction of most bonded proteins, while for the unbonded group no improvement was obtained. Our work highlights one more layer of complexity for the prediction of protein tertiary structure: The ability of small disulfide-rich proteins to fold even with a poorly hydrophobic core.