Radiation Oncology (May 2021)

Quality assurance of radiotherapy in the ongoing EORTC 1420 “Best of” trial for early stage oropharyngeal, supraglottic and hypopharyngeal carcinoma: results of the benchmark case procedure

  • J-J Stelmes,
  • E. Vu,
  • V. Grégoire,
  • C. Simon,
  • E. Clementel,
  • J. Kazmierska,
  • W. Grant,
  • M. Ozsahin,
  • M. Tomsej,
  • L. Vieillevigne,
  • C. Fortpied,
  • E. C. Hurkmans,
  • A. Branquinho,
  • N. Andratschke,
  • F. Zimmermann,
  • D.-C. Weber

DOI
https://doi.org/10.1186/s13014-021-01809-2
Journal volume & issue
Vol. 16, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Introduction The current phase III EORTC 1420 Best-of trial (NCT02984410) compares the swallowing function after transoral surgery versus intensity modulated radiotherapy (RT) in patients with early-stage carcinoma of the oropharynx, supraglottis and hypopharynx. We report the analysis of the Benchmark Case (BC) procedures before patient recruitment with special attention to dysphagia/aspiration related structures (DARS). Materials and methods Submitted RT volumes and plans from participating centers were analyzed and compared against the gold-standard expert delineations and dose distributions. Descriptive analysis of protocol deviations was conducted. Mean Sorensen-Dice similarity index (mDSI) and Hausdorff distance (mHD) were applied to evaluate the inter-observer variability (IOV). Results 65% (23/35) of the institutions needed more than one submission to achieve Quality assurance (RTQA) clearance. OAR volume delineations were the cause for rejection in 53% (40/76) of cases. IOV could be improved in 5 out of 12 OARs by more than 10 mm after resubmission (mHD). Despite this, final IOV for critical OARs in delineation remained significant among DARS by choosing an aleatory threshold of 0.7 (mDSI) and 15 mm (mHD). Conclusions This is to our knowledge the largest BC analysis among Head and neck RTQA programs performed in the framework of a prospective trial. Benchmarking identified non-common OARs and target delineations errors as the main source of deviations and IOV could be reduced in a significant number of cases after this process. Due to the substantial resources involved with benchmarking, future benchmark analyses should assess fully the impact on patients’ clinical outcome.

Keywords