The Origin of Discrepancies between Predictions and Annotations in Intrinsically Disordered Proteins

Mátyás Pajkos; Gábor Erdős; Zsuzsanna Dosztányi

doi:10.3390/biom13101442

Biomolecules (Sep 2023)

The Origin of Discrepancies between Predictions and Annotations in Intrinsically Disordered Proteins

Mátyás Pajkos,
Gábor Erdős,
Zsuzsanna Dosztányi

Affiliations

Mátyás Pajkos: Department of Biochemistry, ELTE Eötvös Loránd University, Pázmány Péter Stny 1/c, H-1117 Budapest, Hungary
Gábor Erdős: Department of Biochemistry, ELTE Eötvös Loránd University, Pázmány Péter Stny 1/c, H-1117 Budapest, Hungary
Zsuzsanna Dosztányi: Department of Biochemistry, ELTE Eötvös Loránd University, Pázmány Péter Stny 1/c, H-1117 Budapest, Hungary

DOI: https://doi.org/10.3390/biom13101442
Journal volume & issue: Vol. 13, no. 10
p. 1442

Abstract

Read online

Disorder prediction methods that can discriminate between ordered and disordered regions have contributed fundamentally to our understanding of the properties and prevalence of intrinsically disordered proteins (IDPs) in proteomes as well as their functional roles. However, a recent large-scale assessment of the performance of these methods indicated that there is still room for further improvements, necessitating novel approaches to understand the strengths and weaknesses of individual methods. In this study, we compared two methods, IUPred and disorder prediction, based on the pLDDT scores derived from AlphaFold2 (AF2) models. We evaluated these methods using a dataset from the DisProt database, consisting of experimentally characterized disordered regions and subsets associated with diverse experimental methods and functions. IUPred and AF2 provided consistent predictions in 79% of cases for long disordered regions; however, for 15% of these cases, they both suggested order in disagreement with annotations. These discrepancies arose primarily due to weak experimental support, the presence of intermediate states, or context-dependent behavior, such as binding-induced transitions. Furthermore, AF2 tended to predict helical regions with high pLDDT scores within disordered segments, while IUPred had limitations in identifying linker regions. These results provide valuable insights into the inherent limitations and potential biases of disorder prediction methods.

Published in Biomolecules

ISSN: 2218-273X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Microbiology
Website: https://www.mdpi.com/journal/biomolecules

About the journal

Abstract

Keywords