Genes (Oct 2022)

Genome-Wide Survey and Analysis of Microsatellites in Waterlily, and Potential for Polymorphic Marker Development

  • Xiang Huang,
  • Meihua Yang,
  • Jiaxing Guo,
  • Jiachen Liu,
  • Guangming Chu,
  • Yingchun Xu

DOI
https://doi.org/10.3390/genes13101782
Journal volume & issue
Vol. 13, no. 10
p. 1782

Abstract

Read online

Waterlily (Nymphaeaceae), a diploid dicotyledon, is an ornamental aquatic plant. In 2020, the complete draft genome for the blue-petal waterlily (Nymphaea colorata) was made available in GenBank. To date, the genome-wide mining of microsatellites or simple sequence repeats (SSRs) in waterlily is still absent. In the present study, we investigated the characteristics of genome-wide microsatellites for N. colorata and developed polymorphic SSR markers across tropical and hardy waterlilies. A total of 238,816 SSRs were identified in 14 N. colorata chromosomes with an average density of 662.60 SSRs per Mb, and the largest number of SSRs were present on chromosome 1 (n = 30,426, 705.94 SSRs per Mb). The dinucleotide was the most common type, and AT-rich repeats prevail in the N. colorata genome. The SSR occurrence frequencies decreased as the number of motif repeats increased. Among 2442 protein-coding region SSRs, trinucleotides, accounting for 63.84%, were the most abundant. Gene ontology terms for signal transduction (e.g., GO: 0045859 and GO: 0019887) and the lipoic acid metabolism (ko00785,) were overrepresented in GO and KEGG enrichment analysis, respectively. In addition, 107,152 primer pairs were identified, and 13 novel polymorphism SSR markers were employed to distinguish among nine waterlily cultivars, of which Ny-5.2 and Ny-10.1 were the most informative SSR loci. This study contributes the first detailed characterization of SSRs in N. colorata genomes and delivers 13 novel polymorphism markers, which are useful for the molecular breeding strategies, genetic diversity and population structure analysis of waterlily.

Keywords