Frontiers in Plant Science (Jul 2023)

Comparative analysis of repeat content in plant genomes, large and small

  • Joris Argentin,
  • Dan Bolser,
  • Paul J. Kersey,
  • Paul J. Kersey,
  • Paul Flicek

DOI
https://doi.org/10.3389/fpls.2023.1103035
Journal volume & issue
Vol. 14

Abstract

Read online

The DNA Features pipeline is the analysis pipeline at EMBL-EBI that annotates repeat elements, including transposable elements. With Ensembl’s goal to stay at the cutting edge of genome annotation, we proved that this pipeline needed an update. We then created a new analysis that allowed the Ensembl database to store the repeat classification from the PGSB repeat classification (Recat). This new dataset was then fetched using Perl scripts and used to prove that the pipeline modification induced a gain in sensitivity. Finally, we performed a comparative analysis of transposable element distribution in all plant species available, raising new questions about transposable elements in certain branches of the taxonomic tree.

Keywords