Frontiers in Plant Science (Nov 2021)

Centromere-Specific Retrotransposons and Very-Long-Chain Fatty Acid Biosynthesis in the Genome of Yellowhorn (Xanthoceras sorbifolium, Sapindaceae), an Oil-Producing Tree With Significant Drought Resistance

  • Hui Liu,
  • Xue-Mei Yan,
  • Xin-rui Wang,
  • Dong-Xu Zhang,
  • Qingyuan Zhou,
  • Tian-Le Shi,
  • Kai-Hua Jia,
  • Xue-Chan Tian,
  • Shan-Shan Zhou,
  • Ren-Gang Zhang,
  • Quan-Zheng Yun,
  • Qing Wang,
  • Qiuhong Xiang,
  • Chanaka Mannapperuma,
  • Elena Van Zalen,
  • Nathaniel R. Street,
  • Ilga Porth,
  • Yousry A. El-Kassaby,
  • Wei Zhao,
  • Wei Zhao,
  • Xiao-Ru Wang,
  • Xiao-Ru Wang,
  • Wenbin Guan,
  • Jian-Feng Mao

DOI
https://doi.org/10.3389/fpls.2021.766389
Journal volume & issue
Vol. 12

Abstract

Read online

In-depth genome characterization is still lacking for most of biofuel crops, especially for centromeres, which play a fundamental role during nuclear division and in the maintenance of genome stability. This study applied long-read sequencing technologies to assemble a highly contiguous genome for yellowhorn (Xanthoceras sorbifolium), an oil-producing tree, and conducted extensive comparative analyses to understand centromere structure and evolution, and fatty acid biosynthesis. We produced a reference-level genome of yellowhorn, ∼470 Mb in length with ∼95% of contigs anchored onto 15 chromosomes. Genome annotation identified 22,049 protein-coding genes and 65.7% of the genome sequence as repetitive elements. Long terminal repeat retrotransposons (LTR-RTs) account for ∼30% of the yellowhorn genome, which is maintained by a moderate birth rate and a low removal rate. We identified the centromeric regions on each chromosome and found enrichment of centromere-specific retrotransposons of LINE1 and Gypsy in these regions, which have evolved recently (∼0.7 MYA). We compared the genomes of three cultivars and found frequent inversions. We analyzed the transcriptomes from different tissues and identified the candidate genes involved in very-long-chain fatty acid biosynthesis and their expression profiles. Collinear block analysis showed that yellowhorn shared the gamma (γ) hexaploidy event with Vitis vinifera but did not undergo any further whole-genome duplication. This study provides excellent genomic resources for understanding centromere structure and evolution and for functional studies in this important oil-producing plant.

Keywords