An in-depth evaluation of federated learning on biomedical natural language processing for information extraction

Le Peng; Gaoxiang Luo; Sicheng Zhou; Jiandong Chen; Ziyue Xu; Ju Sun; Rui Zhang

doi:10.1038/s41746-024-01126-4

npj Digital Medicine (May 2024)

An in-depth evaluation of federated learning on biomedical natural language processing for information extraction

Le Peng,
Gaoxiang Luo,
Sicheng Zhou,
Jiandong Chen,
Ziyue Xu,
Ju Sun,
Rui Zhang

Affiliations

Le Peng: Department of Computer Science and Engineering, University of Minnesota
Gaoxiang Luo: Department of Computer and Information Science, University of Pennsylvania
Sicheng Zhou: Institute for Health Informatics, University of Minnesota
Jiandong Chen: Institute for Health Informatics, University of Minnesota
Ziyue Xu: Nvidia Corporation
Ju Sun: Department of Computer Science and Engineering, University of Minnesota
Rui Zhang: Division of Computational Health Sciences, Department of Surgery, University of Minnesota

DOI: https://doi.org/10.1038/s41746-024-01126-4
Journal volume & issue: Vol. 7, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Language models (LMs) such as BERT and GPT have revolutionized natural language processing (NLP). However, the medical field faces challenges in training LMs due to limited data access and privacy constraints imposed by regulations like the Health Insurance Portability and Accountability Act (HIPPA) and the General Data Protection Regulation (GDPR). Federated learning (FL) offers a decentralized solution that enables collaborative learning while ensuring data privacy. In this study, we evaluated FL on 2 biomedical NLP tasks encompassing 8 corpora using 6 LMs. Our results show that: (1) FL models consistently outperformed models trained on individual clients’ data and sometimes performed comparably with models trained with polled data; (2) with the fixed number of total data, FL models training with more clients produced inferior performance but pre-trained transformer-based models exhibited great resilience. (3) FL models significantly outperformed pre-trained LLMs with few-shot prompting.

Published in npj Digital Medicine

ISSN: 2398-6352 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.nature.com/npjdigitalmed/

About the journal