MediGPT: Exploring Potentials of Conventional and Large Language Models on Medical Data

Mohammad Abu Tareq Rony; Mohammad Shariful Islam; Tipu Sultan; Samah Alshathri; Walid El-Shafai

doi:10.1109/ACCESS.2024.3428918

IEEE Access (Jan 2024)

MediGPT: Exploring Potentials of Conventional and Large Language Models on Medical Data

Mohammad Abu Tareq Rony,
Mohammad Shariful Islam,
Tipu Sultan,
Samah Alshathri,
Walid El-Shafai

Affiliations

Mohammad Abu Tareq Rony: ORCiD; Department of Statistics, Noakhali Science & Technology University, Noakhali, Bangladesh
Mohammad Shariful Islam: ORCiD; Department of Computer Science and Telecommunication Engineering, Noakhali Science & Technology University, Noakhali, Bangladesh
Tipu Sultan: ORCiD; Department of Computer Science, College of Computer and Information Sciences, Fordham University, Bronx, NY, USA
Samah Alshathri: ORCiD; Department of Information Technology, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P. O. Box, 84428, Riyadh, Saudi Arabia
Walid El-Shafai: ORCiD; Computer Science Department, Security Engineering Laboratory, Prince Sultan University, Riyadh, Saudi Arabia

DOI: https://doi.org/10.1109/ACCESS.2024.3428918
Journal volume & issue: Vol. 12
pp. 103473 – 103487

Abstract

Read online

Medical text classification organizes medical documents into categories to streamline information retrieval and support clinical decision-making. Traditional machine learning techniques, including pre-trained language models, are effective but require extensive domain-specific training data, often underperform across languages, and are costly and complex to deploy on a large scale. In this study, we employed four datasets: Clinical trials on cancer, encompassing 6 million statements from interventional cancer clinical trial protocols; the Illness-dataset, consisting of 22,660 categorized tweets from 2018 and 2019; the Multi-View active learning for short medical text classification in user-generated data, an extended version of the Illness-dataset including 22,660 documents from the same period; and the Symptom2Disease dataset, containing 1,200 data points used to predict diseases based on symptom descriptions. This study uses ChatGPT, particularly its ChatGPT-3.5 and ChatGPT-4 versions, as a viable alternative for classifying medical texts. We investigate essential aspects, including the construction of prompts, the parsing of responses, and the various strategic use of GPT models to optimize outcomes. Through comparative analysis with established methods like pre-trained language model fine-tuning and prompt-tuning, our findings indicate that ChatGPT addresses these challenges efficiently and matches the performance of traditional methods. Furthermore, the enhanced capabilities of the proposed MediGPT (Medical Generative Pre-Trained Transformers) have led to performance improvements of 14.3%, 22.3%, 13.6%, and 13.7% across the datasets, highlighting its adaptability and robustness in diverse medical text scenarios without the need for specialized domain adjustments. This research underscores the capability of ChatGPT to facilitate a versatile AI framework in medical text processing, which could revolutionize medical informatics practices.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords