Frontiers in Microbiology (Nov 2020)

Proficiency Testing of Metagenomics-Based Detection of Food-Borne Pathogens Using a Complex Artificial Sequencing Dataset

  • Dirk Höper,
  • Josephine Grützke,
  • Annika Brinkmann,
  • Joël Mossong,
  • Sébastien Matamoros,
  • Richard J. Ellis,
  • Carlus Deneke,
  • Simon H. Tausch,
  • Isabel Cuesta,
  • Sara Monzón,
  • Miguel Juliá,
  • Thomas Nordahl Petersen,
  • Rene S. Hendriksen,
  • Sünje J. Pamp,
  • Mikael Leijon,
  • Mikhayil Hakhverdyan,
  • Aaron M. Walsh,
  • Paul D. Cotter,
  • Lakshmi Chandrasekaran,
  • Moon Y. F. Tay,
  • Joergen Schlundt,
  • Claudia Sala,
  • Alessandra De Cesare,
  • Andreas Nitsche,
  • Martin Beer,
  • Claudia Wylezich

DOI
https://doi.org/10.3389/fmicb.2020.575377
Journal volume & issue
Vol. 11

Abstract

Read online

Metagenomics-based high-throughput sequencing (HTS) enables comprehensive detection of all species comprised in a sample with a single assay and is becoming a standard method for outbreak investigation. However, unlike real-time PCR or serological assays, HTS datasets generated for pathogen detection do not easily provide yes/no answers. Rather, results of the taxonomic read assignment need to be assessed by trained personnel to gain information thereof. Proficiency tests are important instruments of validation, harmonization, and standardization. Within the European Union funded project COMPARE [COllaborative Management Platform for detection and Analyses of (Re-) emerging and foodborne outbreaks in Europe], we conducted a proficiency test to scrutinize the ability to assess diagnostic metagenomics data. An artificial dataset resembling shotgun sequencing of RNA from a sample of contaminated trout was provided to 12 participants with the request to provide a table with per-read taxonomic assignments at species level and a report with a summary and assessment of their findings, considering different categories like pathogen, background, or contaminations. Analysis of the read assignment tables showed that the software used reliably classified the reads taxonomically overall. However, usage of incomplete reference databases or inappropriate data pre-processing caused difficulties. From the combination of the participants’ reports with their read assignments, we conclude that, although most species were detected, a number of important taxa were not or not correctly categorized. This implies that knowledge of and awareness for potentially dangerous species and contaminations need to be improved, hence, capacity building for the interpretation of diagnostic metagenomics datasets is necessary.

Keywords