Scientific Reports (Sep 2023)

SoyDBean: a database for SNPs reconciliation by multiple versions of soybean reference genomes

  • Yejin Lee,
  • Dong U Woo,
  • Yang Jae Kang

DOI
https://doi.org/10.1038/s41598-023-42898-1
Journal volume & issue
Vol. 13, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Due to the development of sequence technology and decreased cost, many whole genome sequences have been obtained. As a result, extensive genetic variations have been discovered from many populations and germplasms to understand the genetic diversity of soybean (Glycine max [L.] Merr.). However, assessing the quality of variation is essential because the published variants were collected using different bioinformatic methods and parameters. Furthermore, despite the enhanced genome contiguity and more efficient filling of “N” stretches in the new reference genome, there remains a dearth of endeavors to verify the caliber of variations present in it. The primary goal of this research was to discern a dependable set of SNPs that can withstand reconciliation across multiple reference genomes. Additionally, the investigation aimed to reconfirm the variations through the utilization of numerous whole genome sequencing data obtained from publicly available databases. Based on the result, we created datasets that comprised the thoroughly verified SNP coordinates between the reference assemblies. The resulting “SoyDBean” database is now publicly accessible through the following URL: http://soydbean.plantprofile.net/ .