Genomics Data (Sep 2015)

Ameliorated de novo transcriptome assembly using Illumina paired end sequence data with Trinity Assembler

  • Kiran Gopinath Bankar,
  • Vivek Nagaraj Todur,
  • Rohit Nandan Shukla,
  • Madavan Vasudevan

DOI
https://doi.org/10.1016/j.gdata.2015.07.012
Journal volume & issue
Vol. 5, no. C
pp. 352 – 359

Abstract

Read online

Advent of Next Generation Sequencing has led to possibilities of de novo transcriptome assembly of organisms without availability of complete genome sequence. Among various sequencing platforms available, Illumina is the most widely used platform based on data quality, quantity and cost. Various de novo transcriptome assemblers are also available today for construction of de novo transcriptome. In this study, we aimed at obtaining an ameliorated de novo transcriptome assembly with sequence reads obtained from Illumina platform and assembled using Trinity Assembler. We found that, primary transcriptome assembly obtained as a result of Trinity can be ameliorated on the basis of transcript length, coverage, and depth and protein homology. Our approach to ameliorate is reproducible and could enhance the sensitivity and specificity of the assembled transcriptome which could be critical for validation of the assembled transcripts and for planning various downstream biological assays.