Remote Sensing (Apr 2022)

Multi-Source Precipitation Data Merging for Heavy Rainfall Events Based on Cokriging and Machine Learning Methods

  • Junmin Zhang,
  • Jianhui Xu,
  • Xiaoai Dai,
  • Huihua Ruan,
  • Xulong Liu,
  • Wenlong Jing

DOI
https://doi.org/10.3390/rs14071750
Journal volume & issue
Vol. 14, no. 7
p. 1750

Abstract

Read online

Gridded precipitation data with a high spatiotemporal resolution are of great importance for studies in hydrology, meteorology, and agronomy. Observational data from meteorological stations cannot accurately reflect the spatiotemporal distribution and variations of precipitation over a large area. Meanwhile, radar-derived precipitation data are restricted by low accuracy in areas of complex terrain and satellite-based precipitation data by low spatial resolution. Therefore, hourly precipitation models were employed to merge data from meteorological stations, Radar, and satellites; the models used five machine learning algorithms (XGBoost, gradient boosting decision tree, random forests (RF), LightGBM, and multiple linear regression (MLR)), as well as the CoKriging method. In the north of Guangdong Province, data of four heavy rainfall events in 2018 were processed with geographic data to obtain merged hourly precipitation data. The CoKriging method secured the best prediction of spatial distribution of accumulated precipitation, followed by the tree-based machine learning (ML) algorithms, and significantly, the prediction of MLR deviated from the actual pattern. All machine learning methods showed poor performances for timepoints with little precipitation during the heavy rainfall events. The tree-based ML method showed poor performance at some timepoints when precipitation was over-related to latitude, longitude, and distance from the coast.

Keywords