EBioMedicine (Aug 2018)

Harmonized Genome Wide Typing of Tubercle Bacilli Using a Web-Based Gene-By-Gene Nomenclature SystemResearch in context

  • Thomas A. Kohl,
  • Dag Harmsen,
  • Jörg Rothgänger,
  • Timothy Walker,
  • Roland Diel,
  • Stefan Niemann

Journal volume & issue
Vol. 34
pp. 131 – 138

Abstract

Read online

Background: Global tuberculosis (TB) control is challenged by uncontrolled transmission of Mycobacterium tuberculosis complex (Mtbc) strains, esp. of multidrug (MDR) or extensively resistant (XDR) variants. Precise analysis of transmission networks is the basis to trace outbreak M/XDR clones and improve TB control. However, classical genotyping tools lack discriminatory power due to the high similarity of strains of particular successful lineages, e.g. Beijing or outbreak strains. This can be overcome by whole genome sequencing (WGS) approaches, but these are not yet standardized to facilitate larger investigations encompassing different laboratories or outbreak tracing across borders. Methods: We established and improved a whole genome gene-by-gene multi locus sequence typing approach encompassing a stable set of core genome genes (cgMLST) and linked it to a web-based nomenclature server (cgMLST.org) facilitating assignment and storage of allele numbers. Findings: We evaluated and refined a previously suggested cgMLST schema by using a reference strain set (n = 251) reflecting the global diversity of the Mtbc. A set of 2891 genes showed excellent performance with at least 97% of the genes reliably identified in strains of all Mtbc lineages and in discriminating outbreak strains. cgMLST allele numbers were automatically retrieved from and stored at cgMLST.org. Interpretation: The refined cgMLST schema provides high resolution genome-based typing of clinical strains of all Mtbc lineages. Combined with a web-based nomenclature server, it facilitates rapid, high-resolution, and harmonized tracing of clinical Mtbc strains needed for prospective local and global surveillance. Keywords: Whole genome sequencing, Mycobacterium tuberculosis, Core genome MLST, Molecular epidemiology, Genotyping