mSystems (Sep 2024)
Whole-genome sequencing for antimicrobial surveillance: species-specific quality thresholds and data evaluation from the network of the European Union Reference Laboratory for Antimicrobial Resistance genomic proficiency tests of 2021 and 2022
Abstract
ABSTRACT As antimicrobial resistance (AMR) surveillance shifts to genomics, ensuring the quality of whole-genome sequencing (WGS) data produced across laboratories is critical. Participation in genomic proficiency tests (GPTs) not only increases individual laboratories' WGS capacity but also provides a unique opportunity to improve species-specific thresholds for WGS quality control (QC) by repeated resequencing of distinct isolates. Here, we present the results of the EU Reference Laboratory for Antimicrobial Resistance (EURL-AR) network GPTs of 2021 and 2022, which included 25 EU national reference laboratories (NLRs). A total of 392 genomes from 12 AMR-bacteria were evaluated based on WGS QC metrics. Two percent (n = 9) of the data were excluded, due to contamination, and 11% (n = 41) of the remaining genomes were identified as outliers in at least one QC metric and excluded from computation of the adjusted QC thresholds (AQT). Two QC metric correlation groups were identified through linear regression. Eight percent (n = 28) of the submitted genomes, from 11 laboratories, failed one or more of the AQTs. However, only three laboratories (12%) were identified as underperformers, failing across AQTs for uncorrelated QC metrics in at least two genomes. Finally, new species-specific thresholds for “N50” and “number of contigs > 200 bp” are presented for guidance in routine laboratory QC. The continued participation of NRLs in GPTs will reveal WGS workflow flaws and improve AMR surveillance data. GPT data will continue to contribute to the development of reliable species-specific thresholds for routine WGS QC, standardizing sequencing data QC and ensure inter- and intranational laboratory comparability.IMPORTANCEIllumina next-generation sequencing is an integral part of antimicrobial resistance (AMR) surveillance and the most widely used whole-genome sequencing (WGS) platform. The high-throughput, relative low-cost, high discriminatory power, and rapid turnaround time of WGS compared to classical biochemical methods means the technology will likely remain a fundamental tool in AMR surveillance and public health. In this study, we present the current level of WGS capacity among national reference laboratories in the EU Reference Laboratory for AMR network, summarizing applied methodology and statistically evaluating the quality of the obtained sequence data. These findings provide the basis for setting new and revised thresholds for quality metrics used in routine WGS, which have previously been arbitrarily defined. In addition, underperforming participants are identified and encouraged to evaluate their workflows to produce reliable results.
Keywords