Prediction of persistent chronic cough in patients with chronic cough using machine learning

Wansu Chen; Michael Schatz; Yichen Zhou; Fagen Xie; Vishal Bali; Amar Das; Jonathan Schelfhout; Julie A. Stern; Robert S. Zeiger

doi:10.1183/23120541.00471-2022

ERJ Open Research (Mar 2023)

Prediction of persistent chronic cough in patients with chronic cough using machine learning

Wansu Chen,
Michael Schatz,
Yichen Zhou,
Fagen Xie,
Vishal Bali,
Amar Das,
Jonathan Schelfhout,
Julie A. Stern,
Robert S. Zeiger

Affiliations

Wansu Chen: Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, USA
Michael Schatz: Department of Allergy, Kaiser Permanente Southern California, San Diego, CA, USA
Yichen Zhou: Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, USA
Fagen Xie: Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, USA
Vishal Bali: Center for Observational and Real-World Evidence (CORE), Merck & Co., Inc., Kenilworth, NJ, USA
Amar Das: Center for Observational and Real-World Evidence (CORE), Merck & Co., Inc., Kenilworth, NJ, USA
Jonathan Schelfhout: Center for Observational and Real-World Evidence (CORE), Merck & Co., Inc., Kenilworth, NJ, USA
Julie A. Stern: Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, USA
Robert S. Zeiger: Department of Allergy, Kaiser Permanente Southern California, San Diego, CA, USA

DOI: https://doi.org/10.1183/23120541.00471-2022
Journal volume & issue: Vol. 9, no. 2

Abstract

Read online

Introduction The aim of this study was to develop and validate prediction models for risk of persistent chronic cough (PCC) in patients with chronic cough (CC). This was a retrospective cohort study. Methods Two retrospective cohorts of patients 18–85 years of age were identified for years 2011–2016: a specialist cohort which included CC patients diagnosed by specialists, and an event cohort which comprised CC patients identified by at least three cough events. A cough event could be a cough diagnosis, dispensing of cough medication or any indication of cough in clinical notes. Model training and validation were conducted using two machine-learning approaches and 400+ features. Sensitivity analyses were also conducted. PCC was defined as a CC diagnosis or any two (specialist cohort) or three (event cohort) cough events in year 2 and again in year 3 after the index date. Results 8581 and 52 010 patients met the eligibility criteria for the specialist and event cohorts (mean age 60.0 and 55.5 years), respectively. 38.2% and 12.4% of patients in the specialist and event cohorts, respectively, developed PCC. The utilisation-based models were mainly based on baseline healthcare utilisations associated with CC or respiratory diseases, while the diagnosis-based models incorporated traditional parameters including age, asthma, pulmonary fibrosis, obstructive pulmonary disease, gastro-oesophageal reflux, hypertension and bronchiectasis. All final models were parsimonious (five to seven predictors) and moderately accurate (area under the curve: 0.74–0.76 for utilisation-based models and 0.71 for diagnosis-based models). Conclusions The application of our risk prediction models may be used to identify high-risk PCC patients at any stage of the clinical testing/evaluation to facilitate decision making.

Published in ERJ Open Research

ISSN: 2312-0541 (Online)
Publisher: European Respiratory Society
Country of publisher: United Kingdom
LCC subjects: Medicine
Website: https://openres.ersjournals.com/

About the journal