PGMiner reloaded, fully automated proteogenomic annotation tool linking genomes to proteomes

Has Canan; Lashin Sergey A.; Kochetov Alexey; Allmer Jens

doi:10.1515/jib-2016-293

Journal of Integrative Bioinformatics (Oct 2016)

PGMiner reloaded, fully automated proteogenomic annotation tool linking genomes to proteomes

Has Canan,
Lashin Sergey A.,
Kochetov Alexey,
Allmer Jens

Affiliations

Has Canan: Molecular Biology and Genetics, Izmir Institute of Technology, Urla, Izmir, Turkey
Lashin Sergey A.: Institute of Cytology & Genetics, SB RAS, Novosibirsk, Russian Federation
Kochetov Alexey: Institute of Cytology & Genetics, SB RAS, Novosibirsk, Russian Federation
Allmer Jens: Molecular Biology and Genetics, Izmir Institute of Technology, Urla, Izmir, Turkey

DOI: https://doi.org/10.1515/jib-2016-293
Journal volume & issue: Vol. 13, no. 4
pp. 16 – 23

Abstract

Read online

Improvements in genome sequencing technology increased the availability of full genomes and transcriptomes of many organisms. However, the major benefit of massive parallel sequencing is to better understand the organization and function of genes which then lead to understanding of phenotypes. In order to interpret genomic data with automated gene annotation studies, several tools are currently available. Even though the accuracy of computational gene annotation is increasing, a combination of multiple lines of experimental evidences should be gathered. Mass spectrometry allows the identification and sequencing of proteins as major gene products; and it is only these proteins that conclusively show whether a part of a genome is a coding region or not to result in phenotypes. Therefore, in the field of proteogenomics, the validation of computational methods is done by exploiting mass spectrometric data. As a result, identification of novel protein coding regions, validation of current gene models, and determination of upstream and downstream regions of genes can be achieved. In this paper, we present new functionality for our proteogenomic tool, PGMiner which performs all proteogenomic steps like acquisition of mass spectrometric data, peptide identification against preprocessed sequence databases, assignment of statistical confidence to identified peptides, mapping confident peptides to gene models, and result visualization. The extensions cover determining proteotypic peptides and thus unambiguous protein identification. Furthermore, peptides conflicting with gene models can now automatically assessed within the context of predicted alternative open reading frames.

Published in Journal of Integrative Bioinformatics

ISSN: 1613-4516 (Online)
Publisher: De Gruyter
Country of publisher: Germany
LCC subjects: Technology: Chemical technology: Biotechnology
Website: https://www.degruyter.com/view/j/jib

About the journal