Frontiers in Genetics (Sep 2022)

Comparative analysis of the chloroplast genomes of eight Piper species and insights into the utilization of structural variation in phylogenetic analysis

  • Jing Li,
  • Jing Li,
  • Jing Li,
  • Jing Li,
  • Rui Fan,
  • Rui Fan,
  • Rui Fan,
  • Rui Fan,
  • Jintao Xu,
  • Lisong Hu,
  • Lisong Hu,
  • Lisong Hu,
  • Lisong Hu,
  • Fan Su,
  • Fan Su,
  • Fan Su,
  • Fan Su,
  • Chaoyun Hao,
  • Chaoyun Hao,
  • Chaoyun Hao,
  • Chaoyun Hao

DOI
https://doi.org/10.3389/fgene.2022.925252
Journal volume & issue
Vol. 13

Abstract

Read online

With more than 2000 species, Piper is regarded as having high medicinal, cosmetic, and edible value. There also remain some taxonomic and evolutionary uncertainties about the genus. This study performed chloroplast genome sequencing of eight poorly studied Piper species and a comparative analysis with black pepper (Piper nigrum). All examined species were highly similar in gene content, with 79 protein-coding genes, 24 tRNAs, and four rRNAs. They also harbored significant structural differences: The number of SSRs ranged from 63 to 87, over 10,000 SNPs were detected, and over 1,000 indels were found. The spatial distribution of structural differences was uneven, with the IR and LSC being relatively more conserved and the SSC region highly variable. Such structural variations of the chloroplast genome can help in evaluating the phylogenetic relationships between species, deciding some hard-to-distinguish evolutionary relationships, or eliminating improper markers. The SSC region may be evolving at high speed, and some species showed a high degree of sequence variation in the SSC region, which seriously affected marker sequence detection. Conversely, CDS sequences tended to lack variation, and some CDSs can serve as ideal markers for phylogenetic reconstruction. All told, this study provides an effective strategy for selecting chloroplast markers, analyzing difficult-to-distinguish phylogenetic relationships and avoiding the taxonomic errors caused by high degree of sequence variations.

Keywords