From DNA Sequences to Chemical Structures – Methods for Mining Microbial Genomic and Metagenomic Data Sets for New Natural Products

Jurica Zucko; Antonio Starcevic; Janko Diminic; Mouhsine Elbekali; Mohamed Lisfi; Paul F. Long; John Cullum; Daslav Hranueli

Food Technology and Biotechnology (Jan 2010)

From DNA Sequences to Chemical Structures – Methods for Mining Microbial Genomic and Metagenomic Data Sets for New Natural Products

Jurica Zucko,
Antonio Starcevic,
Janko Diminic,
Mouhsine Elbekali,
Mohamed Lisfi,
Paul F. Long,
John Cullum,
Daslav Hranueli

Affiliations

Jurica Zucko: Faculty of Food Technology and Biotechnology, University of Zagreb, Pierottijeva 6, HR-10000 Zagreb, Croatia
Antonio Starcevic: Faculty of Food Technology and Biotechnology, University of Zagreb, Pierottijeva 6, HR-10000 Zagreb, Croatia
Janko Diminic: Faculty of Food Technology and Biotechnology, University of Zagreb, Pierottijeva 6, HR-10000 Zagreb, Croatia
Mouhsine Elbekali: Department of Genetics, University of Kaiserslautern, Postfach 3049, DE-67653 Kaiserslautern, Germany
Mohamed Lisfi: Department of Genetics, University of Kaiserslautern, Postfach 3049, DE-67653 Kaiserslautern, Germany
Paul F. Long: School of Pharmacy, University of London, 29/39 Brunswick Square, London WC1N 1AX, United Kingdom
John Cullum: Department of Genetics, University of Kaiserslautern, Postfach 3049, DE-67653 Kaiserslautern, Germany
Daslav Hranueli: Faculty of Food Technology and Biotechnology, University of Zagreb, Pierottijeva 6, HR-10000 Zagreb, Croatia

Journal volume & issue: Vol. 48, no. 2
pp. 234 – 242

Abstract

Read online

Rapid mining of large genomic and metagenomic data sets for modular polyketide synthases, non-ribosomal peptide synthetases and hybrid polyketide synthase/non-ribosomal peptide synthetase biosynthetic gene clusters has been achieved using the generic computer program packages ClustScan and CompGen. These program packages perform the annotation with the hierarchical structuring into polypeptides, modules and domains, as well as storage and graphical presentations of the data. This aims to achieve the most accurate predictions of the activities and specificities of catalytically active domains that can be made with present knowledge, leading to a prediction of the most likely chemical structures produced by these enzymes. The program packages also allow generation of novel clusters by homologous recombination of the annotated genes in silico. ClustScan and CompGen were used to construct a custom database of known compounds (CSDB) and of predicted entirely novel recombinant products (r-CSDB) that can be used for in silico screening with computer aided drug design technology. The use of these programs has been exemplified by analysing genomic sequences from terrestrial prokaryotes and eukaryotic microorganisms, a marine metagenomic data set and a newly discovered example of a 'shared metabolic pathway' in marine-microbial endosymbiosis.

Published in Food Technology and Biotechnology

ISSN: 1330-9862 (Print); 1334-2606 (Online)
Publisher: University of Zagreb Faculty of Food Technology and Biotechnology
Country of publisher: Croatia
LCC subjects: Technology: Chemical technology: Biotechnology; Technology: Chemical technology: Food processing and manufacture
Website: https://www.ftb.com.hr/

About the journal

Abstract

Keywords