Machine learning-based detection of medical service anomalies: Kazakhstan’s health insurance data

Maksut  Kulzhanov; Alexander  Wagner; Abylkair  Skakov; Iliyas  Mukhamejan; Saya  Zhorabek; Ainur B.  Qumar

doi:10.47316/cajmhe.2025.6.2.07

Central Asian Journal of Medical Hypotheses and Ethics (Jun 2025)

Machine learning-based detection of medical service anomalies: Kazakhstan’s health insurance data

Maksut Kulzhanov,
Alexander Wagner,
Abylkair Skakov,
Iliyas Mukhamejan,
Saya Zhorabek,
Ainur B. Qumar

Affiliations

Maksut Kulzhanov: Department of Health Policy and Management, Asfendiyarov Kazakh National Medical University, Almaty, Kazakhstan
Alexander Wagner: Department of Health Policy and Management, Asfendiyarov Kazakh National Medical University, Almaty, Kazakhstan
Abylkair Skakov: JSC “Social Health Insurance Fund”, Almaty, Kazakhstan
Iliyas Mukhamejan: Department of Health Policy and Management, Asfendiyarov Kazakh National Medical University, Almaty, Kazakhstan; JSC “Social Health Insurance Fund”, Almaty, Kazakhstan
Saya Zhorabek: Department of Health Policy and Management, Asfendiyarov Kazakh National Medical University, Almaty, Kazakhstan
Ainur B. Qumar: Department of Health Policy and Management, Asfendiyarov Kazakh National Medical University, Almaty, Kazakhstan

DOI: https://doi.org/10.47316/cajmhe.2025.6.2.07
Journal volume & issue: Vol. 6, no. 2
pp. 133 – 141

Abstract

Read online

Background. With the exponential growth of medical data and limited analytical resources, healthcare systems are increasingly adopting Artificial Intelligence (AI) and Machine Learning (ML) technologies to enhance their decision-making processes. This research aims to apply advanced ML algorithms to analyze data from the Republic of Kazakhstan’s Obligatory Health Insurance Fund (OHIF) and automatically detect anomalies in the structure of delivered medical services. Methods. An automated AI system was developed and tested using nine ML models, including XGBoost, Random Forest, Decision Tree, Gradient Boosting, etc. The dataset comprised 329,584 real records, including demographic and socio-economic parameters. Model performance was evaluated using accuracy, precision, recall, F1-score, and the area under the curve (AUC). ROC and PR curves were used for visual validation. Results. Among the tested models, XGBoost and XGB_grid_model achieved the highest performance, with an accuracy of 93.2% and 93.6%, respectively. Precision: 91.8% and 92.1%, Recall: 90.4% and 91.3%, F1-score: 91.1% and 91.7%, AUC: 0.874 and 0.882. These models reliably detected irregularities such as billing duplications, out-of-pattern service provision, and inconsistencies with demographic profiles. Conclusion. The results demonstrate the feasibility of using ML for automated medical billing control. This approach can significantly enhance the transparency, accuracy, and accountability of healthcare financing in Kazakhstan, laying the groundwork for broader AI integration in national health systems.

Published in Central Asian Journal of Medical Hypotheses and Ethics

ISSN: 2708-9800 (Online)
Publisher: South Kazakhstan Medical Academy
Country of publisher: Kazakhstan
LCC subjects: Medicine: Medicine (General): Medical philosophy. Medical ethics
Website: https://cajmhe.com/index.php/journal

About the journal

Abstract

Keywords