International Journal of Computational Intelligence Systems (Jul 2024)

Optimizing Patient Stratification in Healthcare: A Comparative Analysis of Clustering Algorithms for EHR Data

  • Abeer Aljohani

DOI
https://doi.org/10.1007/s44196-024-00568-8
Journal volume & issue
Vol. 17, no. 1
pp. 1 – 23

Abstract

Read online

Abstract Advanced data analytics are increasingly being employed in healthcare research to improve patient classification and personalize medicinal therapies. In this paper, we focus on the critical problem of clustering electronic health record (EHR) data to enable appropriate patient categorization. In the era of personalized medicine, optimizing patient classification is critical to healthcare analytics. This research presents a comparative assessment of different clustering algorithms for Electronic Health Record (EHR) data, with the goal of improving the efficacy and productivity of patient clustering methods. Our study focuses on Fuzzy Technique for Order of Preference by Similarity to Ideal Solution (Fuzzy TOPSIS) as a Multi-Criteria Decision-Making (MCDM) strategy, includes an in-depth assessment of eight clustering algorithms: K-Means, DBSCAN, Hierarchical Clustering, Mean Shift, Affinity Propagation, Spectral Clustering, Gaussian Mixture Models (GMM), as well as Self-Organizing Maps. The evaluation factors used for evaluation in this research are Cluster Quality Metrics, Scalability, Robustness to Noise, Cluster Shape and Density, Interpretability, Cluster Number, Dimensionality, and Consistency and Stability. These criteria and alternatives were chosen after conducting a thorough assessment of the literature and consulting with domain experts. All participated specialists actively engaged in the decision-making process, bringing unique insights into the best clustering algorithms for healthcare data. The results of this study illustrate each algorithm’s strengths and weaknesses in the setting of patient stratification, providing insight into their performance across multiple dimensions. The fuzzy TOPSIS MCDM strategy is a reliable instrument for synthesizing expert opinions and methodically evaluating the found clustering alternatives. This study advances healthcare analytics by giving practitioners and researchers with informative perspectives on the selection of clustering algorithms designed to address the unique problems of patient stratification utilizing EHR data.

Keywords