Nature Communications (Jul 2024)

Context-aware geometric deep learning for protein sequence design

  • Lucien F. Krapp,
  • Fernando A. Meireles,
  • Luciano A. Abriata,
  • Jean Devillard,
  • Sarah Vacle,
  • Maria J. Marcaida,
  • Matteo Dal Peraro

DOI
https://doi.org/10.1038/s41467-024-50571-y
Journal volume & issue
Vol. 15, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Protein design and engineering are evolving at an unprecedented pace leveraging the advances in deep learning. Current models nonetheless cannot natively consider non-protein entities within the design process. Here, we introduce a deep learning approach based solely on a geometric transformer of atomic coordinates and element names that predicts protein sequences from backbone scaffolds aware of the restraints imposed by diverse molecular environments. To validate the method, we show that it can produce highly thermostable, catalytically active enzymes with high success rates. This concept is anticipated to improve the versatility of protein design pipelines for crafting desired functions.