Multi‐Model Prediction of West Nile Virus Neuroinvasive Disease With Machine Learning for Identification of Important Regional Climatic Drivers

Karen M. Holcomb; J. Erin Staples; Randall J. Nett; Charles B. Beard; Lyle R. Petersen; Stanley G. Benjamin; Benjamin W. Green; Hunter Jones; Michael A. Johansson

doi:10.1029/2023GH000906

GeoHealth (Nov 2023)

Multi‐Model Prediction of West Nile Virus Neuroinvasive Disease With Machine Learning for Identification of Important Regional Climatic Drivers

Karen M. Holcomb,
J. Erin Staples,
Randall J. Nett,
Charles B. Beard,
Lyle R. Petersen,
Stanley G. Benjamin,
Benjamin W. Green,
Hunter Jones,
Michael A. Johansson

Affiliations

Karen M. Holcomb: Global Systems Laboratory National Oceanic and Atmospheric Administration Boulder CO USA
J. Erin Staples: Division of Vector‐Borne Diseases Centers for Disease Control and Prevention Fort Collins CO USA
Randall J. Nett: Division of Vector‐Borne Diseases Centers for Disease Control and Prevention Fort Collins CO USA
Charles B. Beard: Division of Vector‐Borne Diseases Centers for Disease Control and Prevention Fort Collins CO USA
Lyle R. Petersen: Division of Vector‐Borne Diseases Centers for Disease Control and Prevention Fort Collins CO USA
Stanley G. Benjamin: Global Systems Laboratory National Oceanic and Atmospheric Administration Boulder CO USA
Benjamin W. Green: Global Systems Laboratory National Oceanic and Atmospheric Administration Boulder CO USA
Hunter Jones: Climate Prediction Office National Oceanic and Atmospheric Administration Silver Spring MD USA
Michael A. Johansson: Division of Vector‐Borne Diseases Centers for Disease Control and Prevention San Juan PR USA

DOI: https://doi.org/10.1029/2023GH000906
Journal volume & issue: Vol. 7, no. 11
pp. n/a – n/a

Abstract

Read online

Abstract West Nile virus (WNV) is the leading cause of mosquito‐borne illness in the continental United States (CONUS). Spatial heterogeneity in historical incidence, environmental factors, and complex ecology make prediction of spatiotemporal variation in WNV transmission challenging. Machine learning provides promising tools for identification of important variables in such situations. To predict annual WNV neuroinvasive disease (WNND) cases in CONUS (2015–2021), we fitted 10 probabilistic models with variation in complexity from naïve to machine learning algorithm and an ensemble. We made predictions in each of nine climate regions on a hexagonal grid and evaluated each model's predictive accuracy. Using the machine learning models (random forest and neural network), we identified the relative importance and variation in ranking of predictors (historical WNND cases, climate anomalies, human demographics, and land use) across regions. We found that historical WNND cases and population density were among the most important factors while anomalies in temperature and precipitation often had relatively low importance. While the relative performance of each model varied across climatic regions, the magnitude of difference between models was small. All models except the naïve model had non‐significant differences in performance relative to the baseline model (negative binomial model fit per hexagon). No model, including the ensemble or more complex machine learning models, outperformed models based on historical case counts on the hexagon or region level; these models are good forecasting benchmarks. Further work is needed to assess if predictive capacity can be improved beyond that of these historical baselines.

Published in GeoHealth

ISSN: 2471-1403 (Online)
Publisher: American Geophysical Union (AGU)
Country of publisher: United States
LCC subjects: Technology: Environmental technology. Sanitary engineering: Environmental protection
Website: https://agupubs.onlinelibrary.wiley.com/journal/24711403

About the journal

Abstract

Keywords