npj Natural Hazards (Dec 2024)

Simulating flood risk in Tampa Bay using a machine learning driven approach

  • Hemal Dey,
  • Md Munjurul Haque,
  • Wanyun Shao,
  • Matthew VanDyke,
  • Feng Hao

DOI
https://doi.org/10.1038/s44304-024-00045-4
Journal volume & issue
Vol. 1, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Machine learning (ML) models can simulate flood risk by identifying critical non-linear relationships between flood damage locations and flood risk factors (FRFs). To explore it, Tampa Bay, Florida, is selected as a test site. The study’s goal is to simulate flood risk and identify dominant FRFs using historical flood damage data as target variable, with 16 FRFs as predictor variables. Five different ML models such as decision tree (DT), support vector machine (SVM), adaptive boosting (AdaBoost), extreme gradient boosting (XGBoost), and random forest (RF) were adopted. RF classifies 2.42% of Tampa Bay as very high risk and 2.54% as high risk, while XGBoost classifies 3.85% as very high risk and 1.11% as high risk. Moreover, the communities reside at low altitudes and near the waterbodies, with dense man-made infrastructure, are at high flood risk. This study introduces a comprehensive framework for flood risk assessment and helps policymakers mitigate flood risk.