Under-specification as the source of ambiguity and vagueness in narrative phenotype algorithm definitions

Jingzhi Yu; Jennifer A. Pacheco; Anika S. Ghosh; Yuan Luo; Chunhua Weng; Ning Shang; Barbara Benoit; David S. Carrell; Robert J. Carroll; Ozan Dikilitas; Robert R. Freimuth; Vivian S. Gainer; Hakon Hakonarson; George Hripcsak; Iftikhar J. Kullo; Frank Mentch; Shawn N. Murphy; Peggy L. Peissig; Andrea H. Ramirez; Nephi Walton; Wei-Qi Wei; Luke V. Rasmussen

doi:10.1186/s12911-022-01759-z

BMC Medical Informatics and Decision Making (Jan 2022)

Under-specification as the source of ambiguity and vagueness in narrative phenotype algorithm definitions

Jingzhi Yu,
Jennifer A. Pacheco,
Anika S. Ghosh,
Yuan Luo,
Chunhua Weng,
Ning Shang,
Barbara Benoit,
David S. Carrell,
Robert J. Carroll,
Ozan Dikilitas,
Robert R. Freimuth,
Vivian S. Gainer,
Hakon Hakonarson,
George Hripcsak,
Iftikhar J. Kullo,
Frank Mentch,
Shawn N. Murphy,
Peggy L. Peissig,
Andrea H. Ramirez,
Nephi Walton,
Wei-Qi Wei,
Luke V. Rasmussen

Affiliations

Jingzhi Yu: Center for Health Information Partnerships (CHIP), Northwestern University Feinberg School of Medicine
Jennifer A. Pacheco: Northwestern University Feinberg School of Medicine
Anika S. Ghosh: Northwestern University Feinberg School of Medicine
Yuan Luo: Northwestern University Feinberg School of Medicine
Chunhua Weng: Department of Biomedical Informatics, Columbia University
Ning Shang: Department of Biomedical Informatics, Columbia University
Barbara Benoit: Research IS and Computing, Massachusetts General Hospital Brigham
David S. Carrell: Kaiser Permanente Washington Health Research Institute
Robert J. Carroll: Department of Biomedical Informatics, Vanderbilt University Medical Center
Ozan Dikilitas: Department of Cardiovascular Medicine, Mayo Clinic
Robert R. Freimuth: Department of Health Sciences Research, Mayo Clinic
Vivian S. Gainer: Research IS and Computing, Massachusetts General Hospital Brigham
Hakon Hakonarson: Center for Applied Genomics, Children’s Hospital of Philadelphia
George Hripcsak: Department of Biomedical Informatics, Columbia University
Iftikhar J. Kullo: Department of Cardiovascular Medicine, Mayo Clinic
Frank Mentch: Center for Applied Genomics, Children’s Hospital of Philadelphia
Shawn N. Murphy: Research IS and Computing, Massachusetts General Hospital Brigham
Peggy L. Peissig: Biomedical Informatics Research Center, Marshfield Clinic Research Institute
Andrea H. Ramirez: Department of Biomedical Informatics, Vanderbilt University Medical Center
Nephi Walton: Intermountain Precision Genomics, Intermountain Healthcare
Wei-Qi Wei: Department of Biomedical Informatics, Vanderbilt University Medical Center
Luke V. Rasmussen: Department of Preventive Medicine, Northwestern University Feinberg School of Medicine

DOI: https://doi.org/10.1186/s12911-022-01759-z
Journal volume & issue: Vol. 22, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Introduction Currently, one of the commonly used methods for disseminating electronic health record (EHR)-based phenotype algorithms is providing a narrative description of the algorithm logic, often accompanied by flowcharts. A challenge with this mode of dissemination is the potential for under-specification in the algorithm definition, which leads to ambiguity and vagueness. Methods This study examines incidents of under-specification that occurred during the implementation of 34 narrative phenotyping algorithms in the electronic Medical Record and Genomics (eMERGE) network. We reviewed the online communication history between algorithm developers and implementers within the Phenotype Knowledge Base (PheKB) platform, where questions could be raised and answered regarding the intended implementation of a phenotype algorithm. Results We developed a taxonomy of under-specification categories via an iterative review process between two groups of annotators. Under-specifications that lead to ambiguity and vagueness were consistently found across narrative phenotype algorithms developed by all involved eMERGE sites. Discussion and conclusion Our findings highlight that under-specification is an impediment to the accuracy and efficiency of the implementation of current narrative phenotyping algorithms, and we propose approaches for mitigating these issues and improved methods for disseminating EHR phenotyping algorithms.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords