A Systematic Review of Natural Language Processing Methods and Applications in Thyroidology

Ricardo Loor-Torres, MD; Mayra Duran, MD; David Toro-Tobon, MD; Maria Mateo Chavez, MD; Oscar Ponce, MD; Cristian Soto Jacome, MD; Danny Segura Torres, MD; Sandra Algarin Perneth, MD; Victor Montori, BA; Elizabeth Golembiewski, PhD, MPH; Mariana Borras Osorio, MD; Jungwei W. Fan, PhD; Naykky Singh Ospina, MD; Yonghui Wu, PhD; Juan P. Brito, MD, MS

Mayo Clinic Proceedings: Digital Health (Jun 2024)

A Systematic Review of Natural Language Processing Methods and Applications in Thyroidology

Ricardo Loor-Torres, MD,
Mayra Duran, MD,
David Toro-Tobon, MD,
Maria Mateo Chavez, MD,
Oscar Ponce, MD,
Cristian Soto Jacome, MD,
Danny Segura Torres, MD,
Sandra Algarin Perneth, MD,
Victor Montori, BA,
Elizabeth Golembiewski, PhD, MPH,
Mariana Borras Osorio, MD,
Jungwei W. Fan, PhD,
Naykky Singh Ospina, MD,
Yonghui Wu, PhD,
Juan P. Brito, MD, MS

Affiliations

Ricardo Loor-Torres, MD: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN
Mayra Duran, MD: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN
David Toro-Tobon, MD: Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Mayo Clinic, Rochester, MN
Maria Mateo Chavez, MD: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN
Oscar Ponce, MD: University of Edinburgh, Edinburgh, Scotland, United Kingdom
Cristian Soto Jacome, MD: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN
Danny Segura Torres, MD: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN; University of Edinburgh, Edinburgh, Scotland, United Kingdom; Respiratory, Cardiovascular, and Renal Pathobiology and Bioengineering, Universitat de Barcelona, Spain
Sandra Algarin Perneth, MD: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN
Victor Montori, BA: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN
Elizabeth Golembiewski, PhD, MPH: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN
Mariana Borras Osorio, MD: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN
Jungwei W. Fan, PhD: Montefiore Health Center, Albert Einstein College of Medicine, New York, NY
Naykky Singh Ospina, MD: Department of Medicine, and Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN; Division of Endocrinology, Department of Medicine, University of Florida, Gainesville, FL
Yonghui Wu, PhD: Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL
Juan P. Brito, MD, MS: Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN; Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Mayo Clinic, Rochester, MN; Correspondence: Address to Juan P. Brito, MD, MS, Division of Endocrinology, Diabetes, Nutrition and Metabolism Mayo Clinic, 200 First Street SW, Rochester, MN 55902.

Journal volume & issue: Vol. 2, no. 2
pp. 270 – 279

Abstract

Read online

This study aimed to review the application of natural language processing (NLP) in thyroid-related conditions and to summarize current challenges and potential future directions. We performed a systematic search of databases for studies describing NLP applications in thyroid conditions published in English between January 1, 2012 and November 4, 2022. In addition, we used a snowballing technique to identify studies missed in the initial search or published after our search timeline until April 1, 2023. For included studies, we extracted the NLP method (eg, rule-based, machine learning, deep learning, or hybrid), NLP application (eg, identification, classification, and automation), thyroid condition (eg, thyroid cancer, thyroid nodule, and functional or autoimmune disease), data source (eg, electronic health records, health forums, medical literature databases, or genomic databases), performance metrics, and stages of development. We identified 24 eligible NLP studies focusing on thyroid-related conditions. Deep learning-based methods were the most common (38%), followed by rule-based (21%), and traditional machine learning (21%) methods. Thyroid nodules (54%) and thyroid cancer (29%) were the primary conditions under investigation. Electronic health records were the dominant data source (17/24, 71%), with imaging reports being the most frequently used (15/17, 88%). There is increasing interest in NLP applications for thyroid-related studies, mostly addressing thyroid nodules and using deep learning-based methodologies with limited external validation. However, none of the reviewed NLP applications have reached clinical practice. Several limitations, including inconsistent clinical documentation and model portability, need to be addressed to promote the evaluation and implementation of NLP applications to support patient care in thyroidology.

Published in Mayo Clinic Proceedings: Digital Health

ISSN: 2949-7612 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.sciencedirect.com/journal/mayo-clinic-proceedings-digital-health

About the journal