Feature sequence-based genome mining uncovers the hidden diversity of bacterial siderophore pathways

Shaohua Gu; Yuanzhe Shao; Karoline Rehm; Laurent Bigler; Di Zhang; Ruolin He; Ruichen Xu; Jiqi Shao; Alexandre Jousset; Ville-Petri Friman; Xiaoying Bian; Zhong Wei; Rolf Kümmerli; Zhiyuan Li

doi:10.7554/eLife.96719

eLife (Oct 2024)

Feature sequence-based genome mining uncovers the hidden diversity of bacterial siderophore pathways

Shaohua Gu,
Yuanzhe Shao,
Karoline Rehm,
Laurent Bigler,
Di Zhang,
Ruolin He,
Ruichen Xu,
Jiqi Shao,
Alexandre Jousset,
Ville-Petri Friman,
Xiaoying Bian,
Zhong Wei,
Rolf Kümmerli,
Zhiyuan Li

Affiliations

Shaohua Gu: ORCiD; Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
Yuanzhe Shao: ORCiD; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
Karoline Rehm: University of Zurich, Department of Chemistry, Zurich, Switzerland
Laurent Bigler: University of Zurich, Department of Chemistry, Zurich, Switzerland
Di Zhang: Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
Ruolin He: Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
Ruichen Xu: School of Life Science, Shandong University, Qingdao, China
Jiqi Shao: Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
Alexandre Jousset: Jiangsu Provincial Key Lab for Organic Solid Waste Utilization, Key Lab of Organic-based Fertilizers of China, Nanjing Agricultural University, Nanjing, China
Ville-Petri Friman: University of Helsinki, Department of Microbiology, Helsinki, Finland
Xiaoying Bian: Helmholtz International Lab for Anti-infectives, State Key Laboratory of Microbial Technology, Shandong University, Qingdao, China
Zhong Wei: ORCiD; Jiangsu Provincial Key Lab for Organic Solid Waste Utilization, Key Lab of Organic-based Fertilizers of China, Nanjing Agricultural University, Nanjing, China
Rolf Kümmerli: ORCiD; University of Zurich, Department of Quantitative Biomedicine, Zurich, Switzerland
Zhiyuan Li: ORCiD; Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China

DOI: https://doi.org/10.7554/eLife.96719
Journal volume & issue: Vol. 13

Abstract

Read online

Microbial secondary metabolites are a rich source for pharmaceutical discoveries and play crucial ecological functions. While tools exist to identify secondary metabolite clusters in genomes, precise sequence-to-function mapping remains challenging because neither function nor substrate specificity of biosynthesis enzymes can accurately be predicted. Here, we developed a knowledge-guided bioinformatic pipeline to solve these issues. We analyzed 1928 genomes of Pseudomonas bacteria and focused on iron-scavenging pyoverdines as model metabolites. Our pipeline predicted 188 chemically different pyoverdines with nearly 100% structural accuracy and the presence of 94 distinct receptor groups required for the uptake of iron-loaded pyoverdines. Our pipeline unveils an enormous yet overlooked diversity of siderophores (151 new structures) and receptors (91 new groups). Our approach, combining feature sequence with phylogenetic approaches, is extendable to other metabolites and microbial genera, and thus emerges as powerful tool to reconstruct bacterial secondary metabolism pathways based on sequence data.

Published in eLife

ISSN: 2050-084X (Online)
Publisher: eLife Sciences Publications Ltd
Country of publisher: United Kingdom
LCC subjects: Medicine; Science: Biology (General)
Website: https://elifesciences.org

About the journal

Abstract

Keywords