Diagnostics (Sep 2021)

The TVGH-NYCU Thal-Classifier: Development of a Machine-Learning Classifier for Differentiating Thalassemia and Non-Thalassemia Patients

  • Yi-Kai Fu,
  • Hsueng-Mei Liu,
  • Li-Hsuan Lee,
  • Ying-Ju Chen,
  • Sheng-Hsuan Chien,
  • Jeong-Shi Lin,
  • Wen-Chun Chen,
  • Ming-Hsuan Cheng,
  • Po-Heng Lin,
  • Jheng-You Lai,
  • Chyong-Mei Chen,
  • Chun-Yu Liu

DOI
https://doi.org/10.3390/diagnostics11091725
Journal volume & issue
Vol. 11, no. 9
p. 1725

Abstract

Read online

Thalassemia and iron deficiency are the most common etiologies for microcytic anemia and there are indices discriminating both from common laboratory simple automatic counters. In this study a new classifier for discriminating thalassemia and non-thalassemia microcytic anemia was generated via combination of exciting indices with machine-learning techniques. A total of 350 Taiwanese adult patients whose anemia diagnosis, complete blood cell counts, and hemoglobin gene profiles were retrospectively reviewed. Thirteen prior established indices were applied to current cohort and the sensitivity, specificity, positive and negative predictive values were calculated. A support vector machine (SVM) with Monte-Carlo cross-validation procedure was adopted to generate the classifier. The performance of our classifier was compared with original indices by calculating the average classification error rate and area under the curve (AUC) for the sampled datasets. The performance of this SVM model showed average AUC of 0.76 and average error rate of 0.26, which surpassed all other indices. In conclusion, we developed a convenient tool for primary-care physicians when deferential diagnosis contains thalassemia for the Taiwanese adult population. This approach needs to be validated in other studies or bigger database.

Keywords