Journal of Cotton Research (Aug 2021)

Improved Gossypium raimondii genome using a Hi-C-based proximity-guided assembly

  • Qiuhong YANG,
  • Dongyun ZUO,
  • Hailiang CHENG,
  • Youping ZHANG,
  • Qiaolian WANG,
  • Ashraf JAVARIA,
  • Xiaoxu FENG,
  • Simin LI,
  • Xiaoqin CHEN,
  • Shang LIU,
  • Guoli SONG

DOI
https://doi.org/10.1186/s42397-021-00096-2
Journal volume & issue
Vol. 4, no. 1
pp. 1 – 7

Abstract

Read online

Abstract Introduction Genome sequence plays an important role in both basic and applied studies. Gossypium raimondii, the putative contributor of the D subgenome of upland cotton (G. hirsutum), highlights the need to improve the genome quality rapidly and efficiently. Methods We performed Hi-C sequencing of G. raimondii and reassembled its genome based on a set of new Hi-C data and previously published scaffolds. We also compared the reassembled genome sequence with the previously published G. raimondii genomes for gene and genome sequence collinearity. Result A total of 98.42% of scaffold sequences were clustered successfully, among which 99.72% of the clustered sequences were ordered and 99.92% of the ordered sequences were oriented with high-quality. Further evaluation of results by heat-map and collinearity analysis revealed that the current reassembled genome is significantly improved than the previous one (Nat Genet 44:98–1103, 2012). Conclusion This improvement in G. raimondii genome not only provides a better reference to increase study efficiency but also offers a new way to assemble cotton genomes. Furthermore, Hi-C data of G. raimondii may be used for 3D structure research or regulating analysis.

Keywords