Development of a patients’ satisfaction analysis system using machine learning and lexicon-based methods

Shiva Khaleghparast; Majid Maleki; Ghasem Hajianfar; Esmaeil Soumari; Mehrdad Oveisi; Hassan Maleki Golandouz; Feridoun Noohi; Maziar Gholampour dehaki; Reza Golpira; Saeideh Mazloomzadeh; Maedeh Arabian; Samira Kalayinia

doi:10.1186/s12913-023-09260-7

BMC Health Services Research (Mar 2023)

Development of a patients’ satisfaction analysis system using machine learning and lexicon-based methods

Shiva Khaleghparast,
Majid Maleki,
Ghasem Hajianfar,
Esmaeil Soumari,
Mehrdad Oveisi,
Hassan Maleki Golandouz,
Feridoun Noohi,
Maziar Gholampour dehaki,
Reza Golpira,
Saeideh Mazloomzadeh,
Maedeh Arabian,
Samira Kalayinia

Affiliations

Shiva Khaleghparast: Cardiovascular Nursing Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Majid Maleki: Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Ghasem Hajianfar: Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Esmaeil Soumari: Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Mehrdad Oveisi: Department of Computer Science, University of British Columbia
Hassan Maleki Golandouz: Department of Community Health Sciences, University of Manitoba
Feridoun Noohi: Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Maziar Gholampour dehaki: Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Reza Golpira: Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Saeideh Mazloomzadeh: Cardiovascular Nursing Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Maedeh Arabian: Cardiovascular Nursing Research Center, Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences
Samira Kalayinia: Cardiogenetic Research Center, Medical and Research Center, Rajaie Cardiovascular, University of Medical Sciences

DOI: https://doi.org/10.1186/s12913-023-09260-7
Journal volume & issue: Vol. 23, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Patients’ rights are integral to medical ethics. This study aimed to perform sentiment analysis and opinion mining on patients’ messages by a combination of lexicon-based and machine learning methods to identify positive or negative comments and to determine the different ward and staff names mentioned in patients’ messages. Methods The level of satisfaction and observance of the rights of 250 service recipients of the hospital was evaluated through the related checklists by the evaluator. In total, 822 Persian messages, composed of 540 negative and 282 positive comments, were collected and labeled by the evaluator. Pre-processing was performed on the messages and followed by 2 feature vectors which were extracted from the messages, including the term frequency–inverse document frequency (TFIDF) vector and a combination of the multifeature (MF) (a lexicon-based method) and TFIDF (MF + TFIDF) vectors. Six feature selectors and 5 classifiers were used in this study. For the evaluations, 5-fold cross-validation with different metrics including area under the receiver operating characteristic curve (AUC), accuracy (ACC), F1 score, sensitivity (SEN), specificity (SPE) and Precision-Recall Curves (PRC) were reported. Message tag detection, which featured different hospital wards and identified staff names mentioned in the study patients’ messages, was implemented by the lexicon-based method. Results The best classifier was Multinomial Naïve Bayes in combination with MF + TFIDF feature vector and SelectFromModel (SFM) feature selection (ACC = 0.89 ± 0.03, AUC = 0.87 ± 0.03, F1 = 0.92 ± 0.03, SEN = 0.93 ± 0.04, and SPE = 0.82 ± 0.02, PRC-AUC = 0.97). Two methods of assessment by the evaluator and artificial intelligence as well as survey systems were compared. Conclusion Our results demonstrated that the lexicon-based method, in combination with machine learning classifiers, could extract sentiments in patients’ comments and classify them into positive and negative categories. We also developed an online survey system to analyze patients’ satisfaction in different wards and to remove conventional assessments by the evaluator.

Published in BMC Health Services Research

ISSN: 1472-6963 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Public aspects of medicine
Website: http://bmchealthservres.biomedcentral.com

About the journal

Abstract

Keywords