Scientific Reports (Oct 2019)

An algorithm-based meta-analysis of genome- and proteome-wide data identifies a combination of potential plasma biomarkers for colorectal cancer

  • Danuta R. Gawel,
  • Eun Jung Lee,
  • Xinxiu Li,
  • Sandra Lilja,
  • Andreas Matussek,
  • Samuel Schäfer,
  • Renate Slind Olsen,
  • Margaretha Stenmarker,
  • Huan Zhang,
  • Mikael Benson

DOI
https://doi.org/10.1038/s41598-019-51999-9
Journal volume & issue
Vol. 9, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Screening programs for colorectal cancer (CRC) often rely on detection of blood in stools, which is unspecific and leads to a large number of colonoscopies of healthy subjects. Painstaking research has led to the identification of a large number of different types of biomarkers, few of which are in general clinical use. Here, we searched for highly accurate combinations of biomarkers by meta-analyses of genome- and proteome-wide data from CRC tumors. We focused on secreted proteins identified by the Human Protein Atlas and used our recently described algorithms to find optimal combinations of proteins. We identified nine proteins, three of which had been previously identified as potential biomarkers for CRC, namely CEACAM5, LCN2 and TRIM28. The remaining proteins were PLOD1, MAD1L1, P4HA1, GNS, C12orf10 and P3H1. We analyzed these proteins in plasma from 80 patients with newly diagnosed CRC and 80 healthy controls. A combination of four of these proteins, TRIM28, PLOD1, CEACAM5 and P4HA1, separated a training set consisting of 90% patients and 90% of the controls with high accuracy, which was verified in a test set consisting of the remaining 10%. Further studies are warranted to test our algorithms and proteins for early CRC diagnosis.