Frontiers in Medicine (Jul 2022)

Machine learning-based improvement of an online rheumatology referral and triage system

  • Johannes Knitza,
  • Johannes Knitza,
  • Johannes Knitza,
  • Lena Janousek,
  • Felix Kluge,
  • Cay Benedikt von der Decken,
  • Cay Benedikt von der Decken,
  • Cay Benedikt von der Decken,
  • Stefan Kleinert,
  • Stefan Kleinert,
  • Stefan Kleinert,
  • Wolfgang Vorbrüggen,
  • Wolfgang Vorbrüggen,
  • Arnd Kleyer,
  • Arnd Kleyer,
  • David Simon,
  • David Simon,
  • Axel J. Hueber,
  • Axel J. Hueber,
  • Felix Muehlensiepen,
  • Felix Muehlensiepen,
  • Nicolas Vuillerme,
  • Nicolas Vuillerme,
  • Nicolas Vuillerme,
  • Georg Schett,
  • Georg Schett,
  • Bjoern M. Eskofier,
  • Martin Welcker,
  • Martin Welcker,
  • Peter Bartz-Bazzanella,
  • Peter Bartz-Bazzanella

DOI
https://doi.org/10.3389/fmed.2022.954056
Journal volume & issue
Vol. 9

Abstract

Read online

IntroductionRheport is an online rheumatology referral system allowing automatic appointment triaging of new rheumatology patient referrals according to the respective probability of an inflammatory rheumatic disease (IRD). Previous research reported that Rheport was well accepted among IRD patients. Its accuracy was, however, limited, currently being based on an expert-based weighted sum score. This study aimed to evaluate whether machine learning (ML) models could improve this limited accuracy.Materials and methodsData from a national rheumatology registry (RHADAR) was used to train and test nine different ML models to correctly classify IRD patients. Diagnostic performance was compared of ML models and the current algorithm was compared using the area under the receiver operating curve (AUROC). Feature importance was investigated using shapley additive explanation (SHAP).ResultsA complete data set of 2265 patients was used to train and test ML models. 30.5% of patients were diagnosed with an IRD, 69.3% were female. The diagnostic accuracy of the current Rheport algorithm (AUROC of 0.534) could be improved with all ML models, (AUROC ranging between 0.630 and 0.737). Targeting a sensitivity of 90%, the logistic regression model could double current specificity (17% vs. 33%). Finger joint pain, inflammatory marker levels, psoriasis, symptom duration and female sex were the five most important features of the best performing logistic regression model for IRD classification.ConclusionIn summary, ML could improve the accuracy of a currently used rheumatology online referral system. Including further laboratory parameters and enabling individual feature importance adaption could increase accuracy and lead to broader usage.

Keywords