Integrating structure-based machine learning and co-evolution to investigate specificity in plant sesquiterpene synthases.

Janani Durairaj; Elena Melillo; Harro J Bouwmeester; Jules Beekwilder; Dick de Ridder; Aalt D J van Dijk

doi:10.1371/journal.pcbi.1008197

PLoS Computational Biology (Mar 2021)

Integrating structure-based machine learning and co-evolution to investigate specificity in plant sesquiterpene synthases.

Janani Durairaj,
Elena Melillo,
Harro J Bouwmeester,
Jules Beekwilder,
Dick de Ridder,
Aalt D J van Dijk

Affiliations

Janani Durairaj
Elena Melillo
Harro J Bouwmeester
Jules Beekwilder
Dick de Ridder
Aalt D J van Dijk

DOI: https://doi.org/10.1371/journal.pcbi.1008197
Journal volume & issue: Vol. 17, no. 3
p. e1008197

Abstract

Read online

Sesquiterpene synthases (STSs) catalyze the formation of a large class of plant volatiles called sesquiterpenes. While thousands of putative STS sequences from diverse plant species are available, only a small number of them have been functionally characterized. Sequence identity-based screening for desired enzymes, often used in biotechnological applications, is difficult to apply here as STS sequence similarity is strongly affected by species. This calls for more sophisticated computational methods for functionality prediction. We investigate the specificity of precursor cation formation in these elusive enzymes. By inspecting multi-product STSs, we demonstrate that STSs have a strong selectivity towards one precursor cation. We use a machine learning approach combining sequence and structure information to accurately predict precursor cation specificity for STSs across all plant species. We combine this with a co-evolutionary analysis on the wealth of uncharacterized putative STS sequences, to pinpoint residues and distant functional contacts influencing cation formation and reaction pathway selection. These structural factors can be used to predict and engineer enzymes with specific functions, as we demonstrate by predicting and characterizing two novel STSs from Citrus bergamia.

Published in PLoS Computational Biology

ISSN: 1553-734X (Print); 1553-7358 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Science: Biology (General)
Website: https://journals.plos.org/ploscompbiol/

About the journal