Frontiers in Molecular Biosciences (Aug 2022)

Challenges in describing the conformation and dynamics of proteins with ambiguous behavior

  • Joel Roca-Martinez,
  • Joel Roca-Martinez,
  • Tamas Lazar,
  • Tamas Lazar,
  • Jose Gavalda-Garcia,
  • Jose Gavalda-Garcia,
  • David Bickel,
  • David Bickel,
  • Rita Pancsa,
  • Bhawna Dixit,
  • Bhawna Dixit,
  • Bhawna Dixit,
  • Konstantina Tzavella,
  • Konstantina Tzavella,
  • Pathmanaban Ramasamy,
  • Pathmanaban Ramasamy,
  • Pathmanaban Ramasamy,
  • Maite Sanchez-Fornaris,
  • Maite Sanchez-Fornaris,
  • Maite Sanchez-Fornaris,
  • Isel Grau,
  • Wim F. Vranken,
  • Wim F. Vranken

DOI
https://doi.org/10.3389/fmolb.2022.959956
Journal volume & issue
Vol. 9

Abstract

Read online

Traditionally, our understanding of how proteins operate and how evolution shapes them is based on two main data sources: the overall protein fold and the protein amino acid sequence. However, a significant part of the proteome shows highly dynamic and/or structurally ambiguous behavior, which cannot be correctly represented by the traditional fixed set of static coordinates. Representing such protein behaviors remains challenging and necessarily involves a complex interpretation of conformational states, including probabilistic descriptions. Relating protein dynamics and multiple conformations to their function as well as their physiological context (e.g., post-translational modifications and subcellular localization), therefore, remains elusive for much of the proteome, with studies to investigate the effect of protein dynamics relying heavily on computational models. We here investigate the possibility of delineating three classes of protein conformational behavior: order, disorder, and ambiguity. These definitions are explored based on three different datasets, using interpretable machine learning from a set of features, from AlphaFold2 to sequence-based predictions, to understand the overlap and differences between these datasets. This forms the basis for a discussion on the current limitations in describing the behavior of dynamic and ambiguous proteins.

Keywords