iScience (Mar 2023)

Using biological constraints to improve prediction in precision oncology

  • Mohamed Omar,
  • Wikum Dinalankara,
  • Lotte Mulder,
  • Tendai Coady,
  • Claudio Zanettini,
  • Eddie Luidy Imada,
  • Laurent Younes,
  • Donald Geman,
  • Luigi Marchionni

Journal volume & issue
Vol. 26, no. 3
p. 106108

Abstract

Read online

Summary: Many gene signatures have been developed by applying machine learning (ML) on omics profiles, however, their clinical utility is often hindered by limited interpretability and unstable performance. Here, we show the importance of embedding prior biological knowledge in the decision rules yielded by ML approaches to build robust classifiers. We tested this by applying different ML algorithms on gene expression data to predict three difficult cancer phenotypes: bladder cancer progression to muscle-invasive disease, response to neoadjuvant chemotherapy in triple-negative breast cancer, and prostate cancer metastatic progression. We developed two sets of classifiers: mechanistic, by restricting the training to features capturing specific biological mechanisms; and agnostic, in which the training did not use any a priori biological information. Mechanistic models had a similar or better testing performance than their agnostic counterparts, with enhanced interpretability. Our findings support the use of biological constraints to develop robust gene signatures with high translational potential.

Keywords