ProtRank: bypassing the imputation of missing values in differential expression analysis of proteomic data

Matúš Medo; Daniel M. Aebersold; Michaela Medová

doi:10.1186/s12859-019-3144-3

BMC Bioinformatics (Nov 2019)

ProtRank: bypassing the imputation of missing values in differential expression analysis of proteomic data

Matúš Medo,
Daniel M. Aebersold,
Michaela Medová

Affiliations

Matúš Medo: Department of Radiation Oncology, Inselspital, Bern University Hospital and University of Bern
Daniel M. Aebersold: Department of Radiation Oncology, Inselspital, Bern University Hospital and University of Bern
Michaela Medová: Department of Radiation Oncology, Inselspital, Bern University Hospital and University of Bern

DOI: https://doi.org/10.1186/s12859-019-3144-3
Journal volume & issue: Vol. 20, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Data from discovery proteomic and phosphoproteomic experiments typically include missing values that correspond to proteins that have not been identified in the analyzed sample. Replacing the missing values with random numbers, a process known as “imputation”, avoids apparent infinite fold-change values. However, the procedure comes at a cost: Imputing a large number of missing values has the potential to significantly impact the results of the subsequent differential expression analysis. Results We propose a method that identifies differentially expressed proteins by ranking their observed changes with respect to the changes observed for other proteins. Missing values are taken into account by this method directly, without the need to impute them. We illustrate the performance of the new method on two distinct datasets and show that it is robust to missing values and, at the same time, provides results that are otherwise similar to those obtained with edgeR which is a state-of-art differential expression analysis method. Conclusions The new method for the differential expression analysis of proteomic data is available as an easy to use Python package.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal

Abstract

Keywords