Scientific Data (Oct 2023)

A comprehensive dataset of animal-associated sarbecoviruses

  • Bo Liu,
  • Peng Zhao,
  • Panpan Xu,
  • Yelin Han,
  • Yuyang Wang,
  • Lihong Chen,
  • Zhiqiang Wu,
  • Jian Yang

DOI
https://doi.org/10.1038/s41597-023-02558-5
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Zoonotic spillover of sarbecoviruses (SarbeCoVs) from non-human animals to humans under natural conditions has led to two large-scale pandemics, the severe acute respiratory syndrome (SARS) pandemic in 2003 and the ongoing COVID-19 pandemic. Knowledge of the genetic diversity, geographical distribution, and host specificity of SarbeCoVs is therefore of interest for pandemic surveillance and origin tracing of SARS-CoV and SARS-CoV-2. This study presents a comprehensive repository of publicly available animal-associated SarbeCoVs, covering 1,535 viruses identified from 63 animal species distributed in 43 countries worldwide (as of February 14,2023). Relevant meta-information, such as host species, sampling time and location, was manually curated and included in the dataset to facilitate further research on the potential patterns of viral diversity and ecological characteristics. In addition, the dataset also provides well-annotated sequence sets of receptor-binding domains (RBDs) and receptor-binding motifs (RBMs) for the scientific community to highlight the potential determinants of successful cross-species transmission that could be aid in risk estimation and strategic design for future emerging infectious disease control and prevention.