PLoS ONE (Jan 2015)

The Prediction and Validation of Small CDSs Expand the Gene Repertoire of the Smallest Known Eukaryotic Genomes.

  • Abdel Belkorchia,
  • Cyrielle Gasc,
  • Valérie Polonais,
  • Nicolas Parisot,
  • Nicolas Gallois,
  • Céline Ribière,
  • Emmanuelle Lerat,
  • Christine Gaspin,
  • Jean-François Pombert,
  • Pierre Peyret,
  • Eric Peyretaillade

DOI
https://doi.org/10.1371/journal.pone.0139075
Journal volume & issue
Vol. 10, no. 9
p. e0139075

Abstract

Read online

The proper prediction of the gene catalogue of an organism is essential to obtain a representative snapshot of its overall lifestyle, especially when it is not amenable to culturing. Microsporidia are obligate intracellular, sometimes hard to culture, eukaryotic parasites known to infect members of every animal phylum. To date, sequencing and annotation of microsporidian genomes have revealed a poor gene complement with highly reduced gene sizes. In the present paper, we investigated whether such gene sizes may have induced biases for the methodologies used for genome annotation, with an emphasis on small coding sequence (CDS) gene prediction. Using better delineated intergenic regions from four Encephalitozoon genomes, we predicted de novo new small CDSs with sizes ranging from 78 to 255 bp (median 168) and corroborated these predictions by RACE-PCR experiments in Encephalitozoon cuniculi. Most of the newly found genes are present in other distantly related microsporidian species, suggesting their biological relevance. The present study provides a better framework for annotating microsporidian genomes and to train and evaluate new computational methods dedicated at detecting ultra-small genes in various organisms.