BMC Women's Health (Nov 2023)
Comparative diagnostic accuracy of the IOTA SRR and LR2 scoring systems for discriminating between malignant and Benign Adnexal masses by junior physicians in Chinese patients: a retrospective observational study
Abstract
Abstract Background The accuracy of ultrasound in distinguishing benign from malignant adnexal masses is highly correlated with the experience of ultrasound physicians. In China, most of ultrasound differentiation is done by junior physicians. Purpose To compare the diagnostic performance of the International Ovarian Tumour Analysis (IOTA) Simple Rules Risk (SRR) and IOTA Logistic Regression Model 2 (LR2) scoring systems in Chinese patients with adnexal masses. Methods Retrospective analysis of ovarian cancer tumor patients who underwent surgery at a hospital in China from January 2016 to December 2021. Screening patients with at least one adnexal mass on inclusion and exclusion criteria. Two trained junior physicians evaluated each mass using the two scoring systems. A receiver operating characteristic curve was used to test the diagnostic performance of each system. Results A total of 144 adnexal masses were retrospectively collected. Forty masses were histologically diagnosed as malignant. Compared with premenopausal women, postmenopausal women had a much higher rate of malignant masses. The sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) of the SRR was 97.5% (95% CI: 86.8 -99.9%), 82.7% (95% CI: 74.0 -89.4%), 68.4% (95% CI: 58.7 -76.8%) and 98.9% (95% CI: 92.5 -99.8%). The sensitivity, specificity, PPV, NPV of the LR2 were 90.0% (95% CI: 76.5 -97.2%), 89.4% (95% CI: 81.9 -94.6%), 76.6% (95% CI: 65.0 -85.2%), and 95.9% (95% CI: 90.2 -98.3%). There was good agreement between two scoring systems, with 84.03% total agreement and a kappa value of 0.783 (95% CI: 0.70-0.864). The areas under the curve for predicting malignant tumours using SRR and LR2 were similar for all patients (P > 0.05 ). Conclusion The two scoring systems can effectively distinguish benign from malignant adnexal masses. Both scoring systems have high diagnostic efficacy, and diagnostic efficacy is stable, which can provide an important reference for clinical decision making.
Keywords