PLoS ONE (Jan 2011)

Novel tools for conservation genomics: comparing two high-throughput approaches for SNP discovery in the transcriptome of the European hake.

  • Ilaria Milano,
  • Massimiliano Babbucci,
  • Frank Panitz,
  • Rob Ogden,
  • Rasmus O Nielsen,
  • Martin I Taylor,
  • Sarah J Helyar,
  • Gary R Carvalho,
  • Montserrat Espiñeira,
  • Miroslava Atanassova,
  • Fausto Tinti,
  • Gregory E Maes,
  • Tomaso Patarnello,
  • FishPopTrace Consortium,
  • Luca Bargelloni

DOI
https://doi.org/10.1371/journal.pone.0028008
Journal volume & issue
Vol. 6, no. 11
p. e28008

Abstract

Read online

The growing accessibility to genomic resources using next-generation sequencing (NGS) technologies has revolutionized the application of molecular genetic tools to ecology and evolutionary studies in non-model organisms. Here we present the case study of the European hake (Merluccius merluccius), one of the most important demersal resources of European fisheries. Two sequencing platforms, the Roche 454 FLX (454) and the Illumina Genome Analyzer (GAII), were used for Single Nucleotide Polymorphisms (SNPs) discovery in the hake muscle transcriptome. De novo transcriptome assembly into unique contigs, annotation, and in silico SNP detection were carried out in parallel for 454 and GAII sequence data. High-throughput genotyping using the Illumina GoldenGate assay was performed for validating 1,536 putative SNPs. Validation results were analysed to compare the performances of 454 and GAII methods and to evaluate the role of several variables (e.g. sequencing depth, intron-exon structure, sequence quality and annotation). Despite well-known differences in sequence length and throughput, the two approaches showed similar assay conversion rates (approximately 43%) and percentages of polymorphic loci (67.5% and 63.3% for GAII and 454, respectively). Both NGS platforms therefore demonstrated to be suitable for large scale identification of SNPs in transcribed regions of non-model species, although the lack of a reference genome profoundly affects the genotyping success rate. The overall efficiency, however, can be improved using strict quality and filtering criteria for SNP selection (sequence quality, intron-exon structure, target region score).