A pilot feasibility study comparing large language models in extracting key information from ICU patient text records from an Irish population

Emma Urquhart; John Ryan; Sean Hartigan; Ciprian Nita; Ciara Hanley; Peter Moran; John Bates; Rachel Jooste; Conor Judge; John G. Laffey; Michael G. Madden; Bairbre A. McNicholas

doi:10.1186/s40635-024-00656-1

Intensive Care Medicine Experimental (Aug 2024)

A pilot feasibility study comparing large language models in extracting key information from ICU patient text records from an Irish population

Emma Urquhart,
John Ryan,
Sean Hartigan,
Ciprian Nita,
Ciara Hanley,
Peter Moran,
John Bates,
Rachel Jooste,
Conor Judge,
John G. Laffey,
Michael G. Madden,
Bairbre A. McNicholas

Affiliations

Emma Urquhart: School of Computer Science, University of Galway
John Ryan: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway
Sean Hartigan: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway
Ciprian Nita: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway
Ciara Hanley: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway
Peter Moran: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway
John Bates: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway
Rachel Jooste: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway
Conor Judge: School of Medicine, University of Galway
John G. Laffey: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway
Michael G. Madden: School of Computer Science, University of Galway
Bairbre A. McNicholas: Department of Anaesthesia and Intensive Care Medicine, University Hospital Galway

DOI: https://doi.org/10.1186/s40635-024-00656-1
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Background Artificial intelligence, through improved data management and automated summarisation, has the potential to enhance intensive care unit (ICU) care. Large language models (LLMs) can interrogate and summarise large volumes of medical notes to create succinct discharge summaries. In this study, we aim to investigate the potential of LLMs to accurately and concisely synthesise ICU discharge summaries. Methods Anonymised clinical notes from ICU admissions were used to train and validate a prompting structure in three separate LLMs (ChatGPT, GPT-4 API and Llama 2) to generate concise clinical summaries. Summaries were adjudicated by staff intensivists on ability to identify and appropriately order a pre-defined list of important clinical events as well as readability, organisation, succinctness, and overall rank. Results In the development phase, text from five ICU episodes was used to develop a series of prompts to best capture clinical summaries. In the testing phase, a summary produced by each LLM from an additional six ICU episodes was utilised for evaluation. Overall ability to identify a pre-defined list of important clinical events in the summary was 41.5 ± 15.2% for GPT-4 API, 19.2 ± 20.9% for ChatGPT and 16.5 ± 14.1% for Llama2 (p = 0.002). GPT-4 API followed by ChatGPT had the highest score to appropriately order a pre-defined list of important clinical events in the summary as well as readability, organisation, succinctness, and overall rank, whilst Llama2 scored lowest for all. GPT-4 API produced minor hallucinations, which were not present in the other models. Conclusion Differences exist in large language model performance in readability, organisation, succinctness, and sequencing of clinical events compared to others. All encountered issues with narrative coherence and omitted key clinical data and only moderately captured all clinically meaningful data in the correct order. However, these technologies suggest future potential for creating succinct discharge summaries.

Published in Intensive Care Medicine Experimental

ISSN: 2197-425X (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Medical emergencies. Critical care. Intensive care. First aid
Website: https://icm-experimental.springeropen.com/

About the journal

Abstract

Keywords