Journal of Medical Internet Research (May 2024)

Redefining Health Care Data Interoperability: Empirical Exploration of Large Language Models in Information Exchange

  • Dukyong Yoon,
  • Changho Han,
  • Dong Won Kim,
  • Songsoo Kim,
  • SungA Bae,
  • Jee An Ryu,
  • Yujin Choi

DOI
https://doi.org/10.2196/56614
Journal volume & issue
Vol. 26
p. e56614

Abstract

Read online

BackgroundEfficient data exchange and health care interoperability are impeded by medical records often being in nonstandardized or unstructured natural language format. Advanced language models, such as large language models (LLMs), may help overcome current challenges in information exchange. ObjectiveThis study aims to evaluate the capability of LLMs in transforming and transferring health care data to support interoperability. MethodsUsing data from the Medical Information Mart for Intensive Care III and UK Biobank, the study conducted 3 experiments. Experiment 1 assessed the accuracy of transforming structured laboratory results into unstructured format. Experiment 2 explored the conversion of diagnostic codes between the coding frameworks of the ICD-9-CM (International Classification of Diseases, Ninth Revision, Clinical Modification), and Systematized Nomenclature of Medicine Clinical Terms (SNOMED-CT) using a traditional mapping table and a text-based approach facilitated by the LLM ChatGPT. Experiment 3 focused on extracting targeted information from unstructured records that included comprehensive clinical information (discharge notes). ResultsThe text-based approach showed a high conversion accuracy in transforming laboratory results (experiment 1) and an enhanced consistency in diagnostic code conversion, particularly for frequently used diagnostic names, compared with the traditional mapping approach (experiment 2). In experiment 3, the LLM showed a positive predictive value of 87.2% in extracting generic drug names. ConclusionsThis study highlighted the potential role of LLMs in significantly improving health care data interoperability, demonstrated by their high accuracy and efficiency in data transformation and exchange. The LLMs hold vast potential for enhancing medical data exchange without complex standardization for medical terms and data structure.