PLoS ONE (Jan 2018)

ImproveAssembly - Tool for identifying new gene products and improving genome assembly.

  • Adonney Allan de Oliveira Veras,
  • Bruno Merlin,
  • Pablo Henrique Caracciolo Gomes de Sá

DOI
https://doi.org/10.1371/journal.pone.0206000
Journal volume & issue
Vol. 13, no. 10
p. e0206000

Abstract

Read online

The availability of biological information in public databases has increased exponentially. To ensure the accuracy of this information, researchers have adopted several methods and refinements to avoid the dissemination of incorrect information; for example, several automated tools are available for annotation processes. However, manual curation ensures and enriches biological information. Additionally, the genomic finishing process is complex, resulting in increased deposition of drafts genomes. This introduces bias in other omics analyses because incomplete genomic content is used. This is also observed for complete genomes. For example, genomes generated by reference assembly may not include new products in the new sequence or errors or bias can occur during the assembly process. Thus, we developed ImproveAssembly, a tool capable of identifying new products missing from genomic sequences, which can be used for complete and draft genomes. The identified products can improve the annotation of complete genomes and drafts while significantly reducing the bias when the information is used in other omics analyses.