Network embedding unveils the hidden interactions in the mammalian virome

Timothée Poisot; Marie-Andrée Ouellet; Nardus Mollentze; Maxwell J. Farrell; Daniel J. Becker; Liam Brierley; Gregory F. Albery; Rory J. Gibb; Stephanie N. Seifert; Colin J. Carlson

Patterns (Jun 2023)

Network embedding unveils the hidden interactions in the mammalian virome

Timothée Poisot,
Marie-Andrée Ouellet,
Nardus Mollentze,
Maxwell J. Farrell,
Daniel J. Becker,
Liam Brierley,
Gregory F. Albery,
Rory J. Gibb,
Stephanie N. Seifert,
Colin J. Carlson

Affiliations

Timothée Poisot: Département de Sciences Biologiques, Université de Montréal, Montréal, QC, Canada; Corresponding author
Marie-Andrée Ouellet: Département de Sciences Biologiques, Université de Montréal, Montréal, QC, Canada
Nardus Mollentze: School of Biodiversity, One Health and Veterinary Medicine, University of Glasgow, Glasgow, UK; MRC – University of Glasgow Centre for Virus Research, Glasgow, UK
Maxwell J. Farrell: Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
Daniel J. Becker: Department of Biology, University of Oklahoma, Norman, OK, USA
Liam Brierley: Department of Health Data Science, University of Liverpool, Liverpool, UK
Gregory F. Albery: Department of Biology, Georgetown University, Washington, DC, USA
Rory J. Gibb: Center for Biodiversity & Environment Research, University College, London, UK
Stephanie N. Seifert: Paul G. Allen School for Global Health, Washington State University, Pullman, WA, USA
Colin J. Carlson: Center for Global Health Science and Security, Georgetown University, Washington, DC, USA

Journal volume & issue: Vol. 4, no. 6
p. 100738

Abstract

Read online

Summary: Predicting host-virus interactions is fundamentally a network science problem. We develop a method for bipartite network prediction that combines a recommender system (linear filtering) with an imputation algorithm based on low-rank graph embedding. We test this method by applying it to a global database of mammal-virus interactions and thus show that it makes biologically plausible predictions that are robust to data biases. We find that the mammalian virome is under-characterized anywhere in the world. We suggest that future virus discovery efforts could prioritize the Amazon Basin (for its unique coevolutionary assemblages) and sub-Saharan Africa (for its poorly characterized zoonotic reservoirs). Graph embedding of the imputed network improves predictions of human infection from viral genome features, providing a shortlist of priorities for laboratory studies and surveillance. Overall, our study indicates that the global structure of the mammal-virus network contains a large amount of information that is recoverable, and this provides new insights into fundamental biology and disease emergence. The bigger picture: Documenting all interactions between viruses and mammals is not feasible; viruses are too small, the world is too big, and viruses and mammals are too diverse. As a consequence, we think we only know about 1% or 2% of the interactions between mammals and viruses. This is a critical gap in our knowledge because it can lead us to missing reservoirs of possible zoonotic viruses. In this article, we develop a process to leverage the information we have about interactions between hosts and viruses to do three things: First, we predict missing interactions in this network and give them a score based on how likely the model guesses they are. Second, we map these predicted interactions in space to provide guidance about where to go and what to look for to collect data that would maximize our knowledge of host-virus interactions. Finally, based on the predicted interactions, we use information about the genome of viruses to identify possible zoonotic viruses.

DSML 3: Development/pre-production: Data science output has been rolled out/validated across multiple domains/problems

Published in Patterns

ISSN: 2666-3899 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://www.cell.com/patterns

About the journal

Abstract

Keywords