Molecules (Jun 2024)
Scrutinising the Conformational Ensemble of the Intrinsically Mixed-Folded Protein Galectin-3
Abstract
Galectin-3 is a protein involved in many intra- and extra-cellular processes. It has been identified as a diagnostic or prognostic biomarker for certain types of heart disease, kidney disease and cancer. Galectin-3 comprises a carbohydrate recognition domain (CRD) and an N-terminal domain (NTD), which is unstructured and contains eight collagen-like Pro-Gly-rich tandem repeats. While the structure of the CRD has been solved using protein crystallography, current knowledge about conformations of full-length galectin-3 is limited. To fill in this knowledge gap, we performed molecular dynamics (MD) simulations of full-length galectin-3. We systematically re-scaled the solute–solvent interactions in the Martini 3 force field to obtain the best possible agreement between available data from SAXS experiments and the ensemble of conformations generated in the MD simulations. The simulation conformations were found to be very diverse, as reflected, e.g., by (i) large fluctuations in the radius of gyration, ranging from about 2 to 5 nm, and (ii) multiple transient contacts made by amino acid residues in the NTD. Consistent with evidence from NMR experiments, contacts between the CRD and NTD were observed to not involve the carbohydrate-binding site on the CRD surface. Contacts within the NTD were found to be made most frequently by aromatic residues. Formation of fuzzy complexes with unspecific stoichiometry was observed to be mediated mostly by the NTD. Taken together, we offer a detailed picture of the conformational ensemble of full-length galectin-3, which will be important for explaining the biological functions of this protein at the molecular level.
Keywords