Leveraging large language models for the deidentification and temporal normalization of sensitive health information in electronic health records

Hong-Jie Dai; Tatheer Hussain Mir; Ching-Tai Chen; Chien-Chang Chen; Hao-Ping Yang; Chung-Hong Lee; Yi-Yun Chou; Yu-Chin Teng; Shalini Gupta; Omkar Panchal; Divyabharathy Ramesh Nadar; Wei-Hsiang Liao; Yu-Chuan Lin; Zi-Rui Zhao; Richard Tzong-Han Tsai; Yung-Chun Chang; Jitendra Jonnagaddala

doi:10.1038/s41746-025-01921-7

npj Digital Medicine (Aug 2025)

Leveraging large language models for the deidentification and temporal normalization of sensitive health information in electronic health records

Hong-Jie Dai,
Tatheer Hussain Mir,
Ching-Tai Chen,
Chien-Chang Chen,
Hao-Ping Yang,
Chung-Hong Lee,
Yi-Yun Chou,
Yu-Chin Teng,
Shalini Gupta,
Omkar Panchal,
Divyabharathy Ramesh Nadar,
Wei-Hsiang Liao,
Yu-Chuan Lin,
Zi-Rui Zhao,
Richard Tzong-Han Tsai,
Yung-Chun Chang,
Jitendra Jonnagaddala

Affiliations

Hong-Jie Dai: Intelligent System Laboratory, Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Kaohsiung University of Science and Technology
Tatheer Hussain Mir: Intelligent System Laboratory, Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Kaohsiung University of Science and Technology
Ching-Tai Chen: Department of Bioinformatics and Medical Engineering, Asia University
Chien-Chang Chen: Electromagnetic Sensing Control and AI Computing System Laboratory, Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Kaohsiung University of Science and Technology
Hao-Ping Yang: Intelligent System Laboratory, Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Kaohsiung University of Science and Technology
Chung-Hong Lee: Knowledge Discovery and Data Mining Lab, Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Kaohsiung University of Science and Technology
Yi-Yun Chou: Department of Bioinformatics and Medical Engineering, Asia University
Yu-Chin Teng: Center for Precision Health Research, Asia University
Shalini Gupta: CGD Health Pvt. Ltd
Omkar Panchal: CGD Health Pvt. Ltd
Divyabharathy Ramesh Nadar: Knowledge Discovery and Data Mining Lab, Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Kaohsiung University of Science and Technology
Wei-Hsiang Liao: Department of Bioinformatics and Medical Engineering, Asia University
Yu-Chuan Lin: Department of Bioinformatics and Medical Engineering, Asia University
Zi-Rui Zhao: Intelligent System Laboratory, Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Kaohsiung University of Science and Technology
Richard Tzong-Han Tsai: Department of Computer Science and Information Engineering, National Central University
Yung-Chun Chang: Graduate Institute of Data Science, College of Management, Taipei Medical University
Jitendra Jonnagaddala: School of Population health, University of New South Wales

DOI: https://doi.org/10.1038/s41746-025-01921-7
Journal volume & issue: Vol. 8, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Secondary use of electronic health record notes enhances clinical outcomes and personalized medicine, but risks sensitive health information (SHI) exposure. Inconsistent time formats hinder interpretation, necessitating deidentification and temporal normalization. The SREDH/AI CUP 2023 competition explored large language models (LLMs) for these tasks using 3,244 pathology reports with surrogated SHIs and normalized dates. The competition drew 291 teams; the top teams achieved macro-F1 scores >0.8. Results were presented at the IW-DMRN workshop in 2024. Notably, 77.2% used LLMs, highlighting their growing role in healthcare. This study compares competition results with in-context learning and fine-tuned LLMs. Findings show that fine-tuning, especially with lower-rank adaptation, boosts performance but plateaus or degrades in models over 6 B parameters due to overfitting. Our findings highlight the value of data augmentation, training strategies, and hybrid approaches. Effective LLM-based deidentification requires balancing performance with legal and ethical demands, ensuring privacy and interpretability in regulated healthcare settings.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal