Neural network based integration of assays to assess pathogenic potential

Mohammed Eslami; Yi-Pei Chen; Ainsley C. Nicholson; Mark Weston; Melissa Bell; John R. McQuiston; James Samuel; Erin J. van Schaik; Paul de Figueiredo

doi:10.1038/s41598-023-32950-5

Scientific Reports (Apr 2023)

Neural network based integration of assays to assess pathogenic potential

Mohammed Eslami,
Yi-Pei Chen,
Ainsley C. Nicholson,
Mark Weston,
Melissa Bell,
John R. McQuiston,
James Samuel,
Erin J. van Schaik,
Paul de Figueiredo

Affiliations

Mohammed Eslami: Netrias, LLC
Yi-Pei Chen: Netrias, LLC
Ainsley C. Nicholson: Special Bacteriology Reference Laboratory, Bacterial Special Pathogens Branch, Division of High-Consequence Pathogens and Pathology, Centers for Disease Control and Prevention
Mark Weston: Netrias, LLC
Melissa Bell: Special Bacteriology Reference Laboratory, Bacterial Special Pathogens Branch, Division of High-Consequence Pathogens and Pathology, Centers for Disease Control and Prevention
John R. McQuiston: Special Bacteriology Reference Laboratory, Bacterial Special Pathogens Branch, Division of High-Consequence Pathogens and Pathology, Centers for Disease Control and Prevention
James Samuel: Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center
Erin J. van Schaik: Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center
Paul de Figueiredo: Department of Microbial Pathogenesis and Immunology, Texas A&M Health Science Center

DOI: https://doi.org/10.1038/s41598-023-32950-5
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Limited data significantly hinders our capability of biothreat assessment of novel bacterial strains. Integration of data from additional sources that can provide context about the strain can address this challenge. Datasets from different sources, however, are generated with a specific objective and which makes integration challenging. Here, we developed a deep learning-based approach called the neural network embedding model (NNEM) that integrates data from conventional assays designed to classify species with new assays that interrogate hallmarks of pathogenicity for biothreat assessment. We used a dataset of metabolic characteristics from a de-identified set of known bacterial strains that the Special Bacteriology Reference Laboratory (SBRL) of the Centers for Disease Control and Prevention (CDC) has curated for use in species identification. The NNEM transformed results from SBRL assays into vectors to supplement unrelated pathogenicity assays from de-identified microbes. The enrichment resulted in a significant improvement in accuracy of 9% for biothreat. Importantly, the dataset used in our analysis is large, but noisy. Therefore, the performance of our system is expected to improve as additional types of pathogenicity assays are developed and deployed. The proposed NNEM strategy thus provides a generalizable framework for enrichment of datasets with previously collected assays indicative of species.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal