Water Practice and Technology (Oct 2021)

Comparision of performance of multi criteria decision making ensemble-clustering algorithms in rainfall frequency analysis

  • Nilotpal Debbarma,
  • Parthasarathi Choudhury,
  • Parthajit Roy

DOI
https://doi.org/10.2166/wpt.2021.086
Journal volume & issue
Vol. 16, no. 4
pp. 1446 – 1464

Abstract

Read online

Non-availability of adequate extreme rainfall information at any place of interest are solved using regionalization where subjective grouping of similar attributes of nearby gauged stations is performed. K-Means and Fuzzy C-Means are commonly used methods in regionalization of rainfall, but application of genetic algorithms is very rarely explored. Genetic algorithms (GA) are highly efficient evolutionary algorithms, and through an appropriate objective function can effectively achieve the purpose of clustering. In the present study, Davies-Bouldin index is considered and validation is performed using a set of validation measures. Taking into account the varied output obtained in each validation measure, an ensemble approach involving multi criteria decision making is applied to obtain optimal ranked solutions, and the procedure is extended to K-Means and Fuzzy C-Means for comparison. From the results obtained, GA-based clustering is found to outperform the other two algorithms in formation of homogenous regions with better performance in leave-one-out cross validation (LOOCV) test and sensitivity analysis. Accuracy of regional growth curves of regions assessed using regional relative bias and RMSE suggest low uncertainty and accurate quantile estimates in GA regions. Further, information transfer index based on entropy evaluated among GA regions is found to be highest and K-Means lowest. HIGHLIGHTS Comparison of genetic algorithm-based clustering to K-Means and Fuzzy C-Means with ensemble MCDM technique.; Optimum cluster based on several cluster validation indices and MCDM.; Sensitivity analysis of MCDM rankings.; Regional growth curve comparision for regions delineated by all methods.; Information transfer among stations in cluster regions with entropy based information transfer index.;

Keywords