Applied Computer Systems (Jun 2023)

PhoBERT: Application in Disease Classification based on Vietnamese Symptom Analysis

  • Nguyen Hai Thanh,
  • Huynh Tuyet Ngoc,
  • Mai Nhi Thien Ngoc,
  • Le Khoa Dang Dang,
  • Thi-Ngoc-Diem Pham

DOI
https://doi.org/10.2478/acss-2023-0004
Journal volume & issue
Vol. 28, no. 1
pp. 35 – 43

Abstract

Read online

Besides the successful use of support software in cutting-edge medical procedures, the significance of determining a disease early signs and symptoms before its detection is a growing pressing requirement to raise the standard of medical examination and treatment. This creates favourable conditions, reduces patient inconvenience and hospital overcrowding. Before transferring patients to an appropriate doctor, healthcare staff must have the patient’s symptoms. This study leverages the PhoBERT model to assist in classifying patients with text classification tasks based on symptoms they provided in the first stages of Vietnamese hospital admission. The outcomes of PhoBERT on more than 200 000 text-based symptoms collected from Vietnamese hospitals can improve the classification performance compared to Bag of Words (BOW) with classic machine learning algorithms, and some considered deep learning architectures such as 1D-Convolutional Neural Networks and Long Short-Term Memory. The proposed method can achieve promising results to be deployed in automatic hospital admission procedures in Vietnam.

Keywords