Analysis of extreme annual rainfall in North-Eastern India using machine learning techniques

Shivam Agarwal; Disha Mukherjee; Nilotpal Debbarma

doi:10.2166/aqua.2023.016

Aqua (Dec 2023)

Analysis of extreme annual rainfall in North-Eastern India using machine learning techniques

Shivam Agarwal,
Disha Mukherjee,
Nilotpal Debbarma

Affiliations

Shivam Agarwal: Civil Engineering Department, National Institute of Technology Silchar, Assam 788010, India
Disha Mukherjee: Civil Engineering Department, National Institute of Technology Agartala, Barjala, Jirania, Agartala 799046, India
Nilotpal Debbarma: Civil Engineering Department, National Institute of Technology Agartala, Barjala, Jirania, Agartala 799046, India

DOI: https://doi.org/10.2166/aqua.2023.016
Journal volume & issue: Vol. 72, no. 12
pp. 2201 – 2215

Abstract

Read online

The machine learning techniques of Multiple Linear Regression (MLR), Generalized Additive Models (GAMs), and the Random Forest (RF) Method have been used to analyze the extreme annual rainfall in the six states of Assam, Meghalaya, Tripura, Mizoram, Manipur, and Nagaland in North-Eastern (NE) India. Latitude, longitude, altitude, and temperature were the covariates that were used in this study. Ordinary Kriging was used to interpolate the predicted outcomes of each dataset. Statistical metrics like Mean Absolute Errors (MAE), Root Mean Square Error (RMSE), Coefficients of Determination (COD-R2), and Nash–Sutcliffe Efficiency (NSE) were also assessed. When compared to satellite rainfall data, all techniques performed significantly better for ground rainfall data. For prediction, GAM's predicted rainfall values triumph over MLR or RF. RF ranks a close second, while the linearity of MLR prohibits it from making precise predictions for a physical phenomenon like rainfall. The MAE and RMSE of GAM forecasts are significantly lower than those of MLR and RF in most circumstances. Additionally, the COD and NSE of GAM predictions are significantly better than both MLR and RF in most cases, showing that GAM, out of MLR, GAM, and RF, is the best model for predicting rain in our research area. HIGHLIGHTS The map for RF + OK is much more realistic.; The testing (rain gauge data) maps were much better for all the methods than their training counterparts.; Temperature inclusion has an effect on latitude, longitude, and elevation at each point as an attribute.; The use of splines in the linear model allows GAMs to get around linearity constraints.; Comparing the MAE and RMSE of GAM predictions for the various datasets.;

Published in Aqua

ISSN: 2709-8028 (Print); 2709-8036 (Online)
Publisher: IWA Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Environmental technology. Sanitary engineering; Geography. Anthropology. Recreation: Environmental sciences
Website: https://iwaponline.com/aqua

About the journal

Abstract

Keywords