Comparison of six machine learning methods for differentiating benign and malignant thyroid nodules using ultrasonographic characteristics

Jianguang Liang; Tiantian Pang; Weixiang Liu; Xiaogang Li; Leidan Huang; Xuehao Gong; Xianfen Diao

doi:10.1186/s12880-023-01117-z

BMC Medical Imaging (Oct 2023)

Comparison of six machine learning methods for differentiating benign and malignant thyroid nodules using ultrasonographic characteristics

Jianguang Liang,
Tiantian Pang,
Weixiang Liu,
Xiaogang Li,
Leidan Huang,
Xuehao Gong,
Xianfen Diao

Affiliations

Jianguang Liang: School of Pharmacy & School of Biological and Food Engineering, Changzhou University
Tiantian Pang: Health Science Center, Shenzhen University
Weixiang Liu: Health Science Center, Shenzhen University
Xiaogang Li: Health Science Center, Shenzhen University
Leidan Huang: Guangzhou Medical University
Xuehao Gong: Department of Ultrasound, First Affiliated Hospital of Shenzhen University, Second People’s Hospital of Shenzhen
Xianfen Diao: Health Science Center, Shenzhen University

DOI: https://doi.org/10.1186/s12880-023-01117-z
Journal volume & issue: Vol. 23, no. 1
pp. 1 – 6

Abstract

Read online

Abstract Background Several machine learning (ML) classifiers for thyroid nodule diagnosis have been compared in terms of their accuracy, sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), and area under the receiver operating curve (AUC). A total of 525 patients with thyroid nodules (malignant, n = 228; benign, n = 297) underwent conventional ultrasonography, strain elastography, and contrast-enhanced ultrasound. Six algorithms were compared: support vector machine (SVM), linear discriminant analysis (LDA), random forest (RF), logistic regression (LG), GlmNet, and K-nearest neighbors (K-NN). The diagnostic performances of the 13 suspicious sonographic features for discriminating benign and malignant thyroid nodules were assessed using different ML algorithms. To compare these algorithms, a 10-fold cross-validation paired t-test was applied to the algorithm performance differences. Results The logistic regression algorithm had better diagnostic performance than the other ML algorithms. However, it was only slightly higher than those of GlmNet, LDA, and RF. The accuracy, sensitivity, specificity, NPV, PPV, and AUC obtained by running logistic regression were 86.48%, 83.33%, 88.89%, 87.42%, 85.20%, and 92.84%, respectively. Conclusions The experimental results indicate that GlmNet, SVM, LDA, LG, K-NN, and RF exhibit slight differences in classification performance.

Published in BMC Medical Imaging

ISSN: 1471-2342 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Medical technology
Website: http://bmcmedimaging.biomedcentral.com

About the journal

Abstract

Keywords