BMC Biology (Jun 2021)

Cotton D genome assemblies built with long-read data unveil mechanisms of centromere evolution and stress tolerance divergence

  • Zhaoen Yang,
  • Xiaoyang Ge,
  • Weinan Li,
  • Yuying Jin,
  • Lisen Liu,
  • Wei Hu,
  • Fuyan Liu,
  • Yanli Chen,
  • Shaoliang Peng,
  • Fuguang Li

DOI
https://doi.org/10.1186/s12915-021-01041-0
Journal volume & issue
Vol. 19, no. 1
pp. 1 – 22

Abstract

Read online

Abstract Background Many of genome features which could help unravel the often complex post-speciation evolution of closely related species are obscured because of their location in chromosomal regions difficult to accurately characterize using standard genome analysis methods, including centromeres and repeat regions. Results Here, we analyze the genome evolution and diversification of two recently diverged sister cotton species based on nanopore long-read sequence assemblies and Hi-C 3D genome data. Although D genomes are conserved in gene content, they have diversified in gene order, gene structure, gene family diversification, 3D chromatin structure, long-range regulation, and stress-related traits. Inversions predominate among D genome rearrangements. Our results support roles for 5mC and 6mA in gene activation, and 3D chromatin analysis showed that diversification in proximal-vs-distal regulatory-region interactions shape the regulation of defense-related-gene expression. Using a newly developed method, we accurately positioned cotton centromeres and found that these regions have undergone obviously more rapid evolution relative to chromosome arms. We also discovered a cotton-specific LTR class that clarifies evolutionary trajectories among diverse cotton species and identified genetic networks underlying the Verticillium tolerance of Gossypium thurberi (e.g., SA signaling) and salt-stress tolerance of Gossypium davidsonii (e.g., ethylene biosynthesis). Finally, overexpression of G. thurberi genes in upland cotton demonstrated how wild cottons can be exploited for crop improvement. Conclusions Our study substantially deepens understanding about how centromeres have developed and evolutionarily impacted the divergence among closely related cotton species and reveals genes and 3D genome structures which can guide basic investigations and applied efforts to improve crops.

Keywords