Seasonal Prediction of Summer Precipitation in the Middle and Lower Reaches of the Yangtze River Valley: Comparison of Machine Learning and Climate Model Predictions

Chentao He; Jiangfeng Wei; Yuanyuan Song; Jing-Jia Luo

doi:10.3390/w13223294

Water (Nov 2021)

Seasonal Prediction of Summer Precipitation in the Middle and Lower Reaches of the Yangtze River Valley: Comparison of Machine Learning and Climate Model Predictions

Chentao He,
Jiangfeng Wei,
Yuanyuan Song,
Jing-Jia Luo

Affiliations

Chentao He: Changwang School of Honors, Nanjing University of Information Science and Technology, Nanjing 210044, China
Jiangfeng Wei: School of Atmospheric Sciences, Nanjing University of Information Science and Technology, Nanjing 210044, China
Yuanyuan Song: School of Atmospheric Sciences, Nanjing University of Information Science and Technology, Nanjing 210044, China
Jing-Jia Luo: School of Atmospheric Sciences, Nanjing University of Information Science and Technology, Nanjing 210044, China

DOI: https://doi.org/10.3390/w13223294
Journal volume & issue: Vol. 13, no. 22
p. 3294

Abstract

Read online

The middle and lower reaches of the Yangtze River valley (YRV), which are among the most densely populated regions in China, are subject to frequent flooding. In this study, the predictor importance analysis model was used to sort and select predictors, and five methods (multiple linear regression (MLR), decision tree (DT), random forest (RF), backpropagation neural network (BPNN), and convolutional neural network (CNN)) were used to predict the interannual variation of summer precipitation over the middle and lower reaches of the YRV. Predictions from eight climate models were used for comparison. Of the five tested methods, RF demonstrated the best predictive skill. Starting the RF prediction in December, when its prediction skill was highest, the 70-year correlation coefficient from cross validation of average predictions was 0.473. Using the same five predictors in December 2019, the RF model successfully predicted the YRV wet anomaly in summer 2020, although it had weaker amplitude. It was found that the enhanced warm pool area in the Indian Ocean was the most important causal factor. The BPNN and CNN methods demonstrated the poorest performance. The RF, DT, and climate models all showed higher prediction skills when the predictions start in winter than in early spring, and the RF, DT, and MLR methods all showed better prediction skills than the numerical climate models. Lack of training data was a factor that limited the performance of the machine learning methods. Future studies should use deep learning methods to take full advantage of the potential of ocean, land, sea ice, and other factors for more accurate climate predictions.

Published in Water

ISSN: 2073-4441 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Hydraulic engineering; Technology: Environmental technology. Sanitary engineering: Water supply for domestic and industrial purposes
Website: http://www.mdpi.com/journal/water/

About the journal

Abstract

Keywords