Research (Jan 2023)
SARS-CoV-2 Spike Protein Post-Translational Modification Landscape and Its Impact on Protein Structure and Function via Computational Prediction
Abstract
To elucidate the role of post-translational modifications (PTMs) in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike protein’s structure and virulence, we generated a high-resolution map of 87 PTMs using liquid chromatography with tandem mass spectrometry data on the extracted spike protein from SARS-CoV-2 virions and then reconstituted its structure heterogeneity caused by PTMs. Nonetheless, Alphafold2, a high-accuracy artificial intelligence tool to perform protein structure prediction, relies solely on primary amino acid sequence, whereas the impact of PTM, which often modulates critical protein structure and function, is much ignored. To overcome this challenge, we proposed the mutagenesis approach—an in silico, site-directed amino acid substitution to mimic the influence of PTMs on protein structure due to altered physicochemical properties in the post-translationally modified amino acids—and then reconstituted the spike protein’s structure from the substituted sequences by Alphafold2. For the first time, the proposed method revealed predicted protein structures resulting from PTMs, a problem that Alphafold2 has yet to address. As an example, we performed computational analyses of the interaction of the post-translationally modified spike protein with its host factors such as angiotensin-converting enzyme 2 to illuminate binding affinity. Mechanistically, this study suggested the structural analysis of post-translationally modified protein via mutagenesis and deep learning. To summarize, the reconstructed spike protein structures showed that specific PTMs can be used to modulate host factor binding, guide antibody design, and pave the way for new therapeutic targets. The code and Supplementary Materials are freely available at https://github.com/LTZHKUSTGZ/SARS-CoV-2-spike-protein-PTM.