BMC Cancer (Apr 2009)

Prediction of breast cancer by profiling of urinary RNA metabolites using Support Vector Machine-based feature selection

  • Schwab Matthias,
  • Gleiter Christoph H,
  • Laufer Stefan,
  • Neubauer Hans,
  • Seeger Harald,
  • Friese Natascha,
  • Fux Richard,
  • Bullinger Dino,
  • Henneges Carsten,
  • Zell Andreas,
  • Kammerer Bernd

DOI
https://doi.org/10.1186/1471-2407-9-104
Journal volume & issue
Vol. 9, no. 1
p. 104

Abstract

Read online

Abstract Background Breast cancer belongs to the most frequent and severe cancer types in human. Since excretion of modified nucleosides from increased RNA metabolism has been proposed as a potential target in pathogenesis of breast cancer, the aim of the present study was to elucidate the predictability of breast cancer by means of urinary excreted nucleosides. Methods We analyzed urine samples from 85 breast cancer women and respective healthy controls to assess the metabolic profiles of nucleosides by a comprehensive bioinformatic approach. All included nucleosides/ribosylated metabolites were isolated by cis-diol specific affinity chromatography and measured with liquid chromatography ion trap mass spectrometry (LC-ITMS). A valid set of urinary metabolites was selected by exclusion of all candidates with poor linearity and/or reproducibility in the analytical setting. The bioinformatic tool of Oscillating Search Algorithm for Feature Selection (OSAF) was applied to iteratively improve features for training of Support Vector Machines (SVM) to better predict breast cancer. Results After identification of 51 nucleosides/ribosylated metabolites in the urine of breast cancer women and/or controls by LC- ITMS coupling, a valid set of 35 candidates was selected for subsequent computational analyses. OSAF resulted in 44 pairwise ratios of metabolite features by iterative optimization. Based on this approach ultimately estimates for sensitivity and specificity of 83.5% and 90.6% were obtained for best prediction of breast cancer. The classification performance was dominated by metabolite pairs with SAH which highlights its importance for RNA methylation in cancer pathogenesis. Conclusion Extensive RNA-pathway analysis based on mass spectrometric analysis of metabolites and subsequent bioinformatic feature selection allowed for the identification of significant metabolic features related to breast cancer pathogenesis. The combination of mass spectrometric analysis and subsequent SVM-based feature selection represents a promising tool for the development of a non-invasive prediction system.