Bayesian methods for proteomic biomarker development

Belinda Hernández; Stephen R Pennington; Andrew C Parnell

EuPA Open Proteomics (Dec 2015)

Bayesian methods for proteomic biomarker development

Belinda Hernández,
Stephen R Pennington,
Andrew C Parnell

Affiliations

Belinda Hernández: School of Mathematical Sciences (Statistics), University College Dublin, Belfield Campus, Dublin 4, Ireland; School of Medicine and Medical Science, UCD Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Belfield Campus, Dublin 4, Ireland; Corresponding author.
Stephen R Pennington: School of Mathematical Sciences (Statistics), University College Dublin, Belfield Campus, Dublin 4, Ireland; School of Medicine and Medical Science, UCD Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Belfield Campus, Dublin 4, Ireland
Andrew C Parnell: School of Mathematical Sciences (Statistics), University College Dublin, Belfield Campus, Dublin 4, Ireland; Insight: The National Centre for Data Analytics, University College Dublin, Belfield Campus, Dublin 4, Ireland

Journal volume & issue: Vol. 9
pp. 54 – 64

Abstract

Read online

The advent of liquid chromatography mass spectrometry has seen a dramatic increase in the amount of data derived from proteomic biomarker discovery. These experiments have seemingly identified many potential candidate biomarkers. Frustratingly, very few of these candidates have been evaluated and validated sufficiently such that that they have progressed to the stage of routine clinical use. It is becoming apparent that the statistical methods used to evaluate the performance of new candidate biomarkers are a major limitation in their development. Bayesian methods offer some advantages over traditional statistical and machine learning methods. In particular they can incorporate external information into current experiments so as to guide biomarker selection. Further, they can be more robust to over-fitting than other approaches, especially when the number of samples used for discovery is relatively small.In this review we provide an introduction to Bayesian inference and demonstrate some of the advantages of using a Bayesian framework. We summarize how Bayesian methods have been used previously in proteomics and other areas of bioinformatics. Finally, we describe some popular and emerging Bayesian models from the statistical literature and provide a worked tutorial including code snippets to show how these methods may be applied for the evaluation of proteomic biomarkers. Keywords: Bayesian statistics, R, proteomics biomarker discovery, LC–MS

Published in EuPA Open Proteomics

ISSN: 2212-9685 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Science: Biology (General): Genetics
Website: http://www.journals.elsevier.com/eupa-open-proteomics/

About the journal