Plants (Aug 2023)

Genome Survey and Chromosome-Level Draft Genome Assembly of <i>Glycine max</i> var. Dongfudou 3: Insights into Genome Characteristics and Protein Deficiencies

  • Yajuan Duan,
  • Yue Li,
  • Jing Zhang,
  • Yongze Song,
  • Yan Jiang,
  • Xiaohong Tong,
  • Yingdong Bi,
  • Shaodong Wang,
  • Sui Wang

DOI
https://doi.org/10.3390/plants12162994
Journal volume & issue
Vol. 12, no. 16
p. 2994

Abstract

Read online

Dongfudou 3 is a highly sought-after soybean variety due to its lack of beany flavor. To support molecular breeding efforts, we conducted a genomic survey using next-generation sequencing. We determined the genome size, complexity, and characteristics of Dongfudou 3. Furthermore, we constructed a chromosome-level draft genome and speculated on the molecular basis of protein deficiency in GmLOX1, GmLOX2, and GmLOX3. These findings set the stage for high-quality genome analysis using third-generation sequencing. The estimated genome size is approximately 1.07 Gb, with repetitive sequences accounting for 72.50%. The genome is homozygous and devoid of microbial contamination. The draft genome consists of 916.00 Mb anchored onto 20 chromosomes, with annotations of 46,446 genes and 77,391 transcripts, achieving Benchmarking Single-Copy Orthologue (BUSCO) completeness of 99.5% for genome completeness and 99.1% for annotation. Deletions and substitutions were identified in the three GmLox genes, and they also lack corresponding active proteins. Our proposed approach, involving k-mer analysis after filtering out organellar DNA sequences, is applicable to genome surveys of all plant species, allowing for accurate assessments of size and complexity. Moreover, the process of constructing chromosome-level draft genomes using closely related reference genomes offers cost-effective access to valuable information, maximizing data utilization.

Keywords