Variability and bias in microbiome metagenomic sequencing: an interlaboratory study comparing experimental protocols

Samuel P. Forry; Stephanie L. Servetas; Jason G. Kralj; Keng Soh; Michalis Hadjithomas; Raul Cano; Martha Carlin; Maria G. de Amorim; Benjamin Auch; Matthew G. Bakker; Thais F. Bartelli; Juan P. Bustamante; Ignacio Cassol; Mauricio Chalita; Emmanuel Dias-Neto; Aaron Del Duca; Daryl M. Gohl; Jekaterina Kazantseva; Muyideen T. Haruna; Peter Menzel; Bruno S. Moda; Lorieza Neuberger-Castillo; Diana N. Nunes; Isha R. Patel; Rodrigo D. Peralta; Adrien Saliou; Rolf Schwarzer; Samantha Sevilla; Isabella K. T. M. Takenaka; Jeremy R. Wang; Rob Knight; Dirk Gevers; Scott A. Jackson

doi:10.1038/s41598-024-57981-4

Scientific Reports (Apr 2024)

Variability and bias in microbiome metagenomic sequencing: an interlaboratory study comparing experimental protocols

Samuel P. Forry,
Stephanie L. Servetas,
Jason G. Kralj,
Keng Soh,
Michalis Hadjithomas,
Raul Cano,
Martha Carlin,
Maria G. de Amorim,
Benjamin Auch,
Matthew G. Bakker,
Thais F. Bartelli,
Juan P. Bustamante,
Ignacio Cassol,
Mauricio Chalita,
Emmanuel Dias-Neto,
Aaron Del Duca,
Daryl M. Gohl,
Jekaterina Kazantseva,
Muyideen T. Haruna,
Peter Menzel,
Bruno S. Moda,
Lorieza Neuberger-Castillo,
Diana N. Nunes,
Isha R. Patel,
Rodrigo D. Peralta,
Adrien Saliou,
Rolf Schwarzer,
Samantha Sevilla,
Isabella K. T. M. Takenaka,
Jeremy R. Wang,
Rob Knight,
Dirk Gevers,
Scott A. Jackson

Affiliations

Samuel P. Forry: Complex Microbial Systems Group, National Institute of Standards and Technology (NIST)
Stephanie L. Servetas: Complex Microbial Systems Group, National Institute of Standards and Technology (NIST)
Jason G. Kralj: Complex Microbial Systems Group, National Institute of Standards and Technology (NIST)
Keng Soh: Novo Nordisk
Michalis Hadjithomas: LifeMine Therapeutics, Cambridge Discovery Park
Raul Cano: The BioCollective, LLC
Martha Carlin: The BioCollective, LLC
Maria G. de Amorim: Laboratory of Medical Genomics, A. C. Camargo Cancer Center
Benjamin Auch: University of Minnesota Genomics Center
Matthew G. Bakker: Department of Microbiology, University of Manitoba
Thais F. Bartelli: Laboratory of Medical Genomics, A. C. Camargo Cancer Center
Juan P. Bustamante: Facultad de Ingeniería, Universidad Nacional de Entre Ríos
Ignacio Cassol: Laboratorio de Investigación, Desarrollo y Transferencia de la Facultad de Ingeniería de la Universidad Austral (LIDTUA), CIC-Austral
Mauricio Chalita: CJ Bioscience
Emmanuel Dias-Neto: Laboratory of Medical Genomics, A. C. Camargo Cancer Center
Aaron Del Duca: OMX Advisors, Inc.
Daryl M. Gohl: Department of Genetics, Cell Biology, and Development, University of Minnesota
Jekaterina Kazantseva: Center of Food and Fermentation Technologies (TFTAK)
Muyideen T. Haruna: Bioenvironmental Program, Morgan State University
Peter Menzel: Labor Berlin Charité Vivantes GmbH
Bruno S. Moda: Laboratory of Computational Biology and Bioinformatics, A.C. Camargo Cancer Center
Lorieza Neuberger-Castillo: Integrated Biobank of Luxembourg (IBBL), Luxembourg Institute of Health (LIH)
Diana N. Nunes: Laboratory of Medical Genomics, A. C. Camargo Cancer Center
Isha R. Patel: Center for Food Safety and Applied Nutrition, Office of Applied Research and Safety Assessment, U. S. Food and Drug Administration
Rodrigo D. Peralta: Facultad de Ingeniería, Universidad Nacional de Entre Ríos
Adrien Saliou: OMICS Hub, BIOASTER, Microbiology Research Institute
Rolf Schwarzer: Labor Berlin Charité Vivantes GmbH
Samantha Sevilla: Center for Cancer Research, CCR Collaborative Bioinformatics Resource, National Cancer Institute, National Institutes of Health
Isabella K. T. M. Takenaka: Laboratory of Medical Genomics, A. C. Camargo Cancer Center
Jeremy R. Wang: Department of Genetics, University of North Carolina at Chapel Hill
Rob Knight: Departments of Pediatrics, Bioengineering and Computer Science & Engineering, and Center for Microbiome Innovation, University of California at San Diego
Dirk Gevers: Seed Health
Scott A. Jackson: Complex Microbial Systems Group, National Institute of Standards and Technology (NIST)

DOI: https://doi.org/10.1038/s41598-024-57981-4
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Several studies have documented the significant impact of methodological choices in microbiome analyses. The myriad of methodological options available complicate the replication of results and generally limit the comparability of findings between independent studies that use differing techniques and measurement pipelines. Here we describe the Mosaic Standards Challenge (MSC), an international interlaboratory study designed to assess the impact of methodological variables on the results. The MSC did not prescribe methods but rather asked participating labs to analyze 7 shared reference samples (5 × human stool samples and 2 × mock communities) using their standard laboratory methods. To capture the array of methodological variables, each participating lab completed a metadata reporting sheet that included 100 different questions regarding the details of their protocol. The goal of this study was to survey the methodological landscape for microbiome metagenomic sequencing (MGS) analyses and the impact of methodological decisions on metagenomic sequencing results. A total of 44 labs participated in the MSC by submitting results (16S or WGS) along with accompanying metadata; thirty 16S rRNA gene amplicon datasets and 14 WGS datasets were collected. The inclusion of two types of reference materials (human stool and mock communities) enabled analysis of both MGS measurement variability between different protocols using the biologically-relevant stool samples, and MGS bias with respect to ground truth values using the DNA mixtures. Owing to the compositional nature of MGS measurements, analyses were conducted on the ratio of Firmicutes: Bacteroidetes allowing us to directly apply common statistical methods. The resulting analysis demonstrated that protocol choices have significant effects, including both bias of the MGS measurement associated with a particular methodological choices, as well as effects on measurement robustness as observed through the spread of results between labs making similar methodological choices. In the analysis of the DNA mock communities, MGS measurement bias was observed even when there was general consensus among the participating laboratories. This study was the result of a collaborative effort that included academic, commercial, and government labs. In addition to highlighting the impact of different methodological decisions on MGS result comparability, this work also provides insights for consideration in future microbiome measurement study design.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal