PLoS ONE (Jan 2021)

A transfer learning approach to facilitate ComBat-based harmonization of multicentre radiomic features in new datasets.

  • Ronrick Da-Ano,
  • François Lucia,
  • Ingrid Masson,
  • Ronan Abgral,
  • Joanne Alfieri,
  • Caroline Rousseau,
  • Augustin Mervoyer,
  • Caroline Reinhold,
  • Olivier Pradier,
  • Ulrike Schick,
  • Dimitris Visvikis,
  • Mathieu Hatt

DOI
https://doi.org/10.1371/journal.pone.0253653
Journal volume & issue
Vol. 16, no. 7
p. e0253653

Abstract

Read online

PurposeTo facilitate the demonstration of the prognostic value of radiomics, multicenter radiomics studies are needed. Pooling radiomic features of such data in a statistical analysis is however challenging, as they are sensitive to the variability in scanner models, acquisition protocols and reconstruction settings, which is often unavoidable in a multicentre retrospective analysis. A statistical harmonization strategy called ComBat was utilized in radiomics studies to deal with the "center-effect". The goal of the present work was to integrate a transfer learning (TL) technique within ComBat-and recently developed alternate versions of ComBat with improved flexibility (M-ComBat) and robustness (B-ComBat)-to allow the use of a previously determined harmonization transform to the radiomic feature values of new patients from an already known center.Material and methodsThe proposed TL approach were incorporated in the four versions of ComBat (standard, B, M, and B-M ComBat). The proposed approach was evaluated using a dataset of 189 locally advanced cervical cancer patients from 3 centers, with magnetic resonance imaging (MRI) and positron emission tomography (PET) images, with the clinical endpoint of predicting local failure. The impact performance of the TL approach was evaluated by comparing the harmonization achieved using only parts of the data to the reference (harmonization achieved using all the available data). It was performed through three different machine learning pipelines.ResultsThe proposed TL technique was successful in harmonizing features of new patients from a known center in all versions of ComBat, leading to predictive models reaching similar performance as the ones developed using the features harmonized with all the data available.ConclusionThe proposed TL approach enables applying a previously determined ComBat transform to new, previously unseen data.