Remote Sensing (Aug 2022)

Can Machine Learning Algorithms Successfully Predict Grassland Aboveground Biomass?

  • Yue Wang,
  • Rongzhu Qin,
  • Huzi Cheng,
  • Tiangang Liang,
  • Kaiping Zhang,
  • Ning Chai,
  • Jinlong Gao,
  • Qisheng Feng,
  • Mengjing Hou,
  • Jie Liu,
  • Chenli Liu,
  • Wenjuan Zhang,
  • Yanjie Fang,
  • Jie Huang,
  • Feng Zhang

DOI
https://doi.org/10.3390/rs14163843
Journal volume & issue
Vol. 14, no. 16
p. 3843

Abstract

Read online

The timely and accurate estimation of grassland aboveground biomass (AGB) is important. Machine learning (ML) has been widely used in the past few decades to deal with complex relationships. In this study, based on an 11-year period (2005–2015) of AGB data (1620 valid AGB measurements) on the Three-River Headwaters Region (TRHR), combined with remote sensing data, weather data, terrain data, and soil data, we compared the predictive performance of a linear statistical method, machine learning (ML) methods, and evaluated their temporal and spatial scalability. The results show that machine learning can predict grassland biomass well, and the existence of an independent validation set can help us better understand the prediction performance of the model. Our findings show the following: (1) The random forest (RF) based on variables obtained through stepwise regression analysis (SRA) was the best model (R2vad = 0.60, RMSEvad = 1245.85 kg DW (dry matter weight)/ha, AIC = 5583.51, and BIC = 5631.10). It also had the best predictive capability of years with unknown areas (R2indep = 0.50, RMSEindep = 1332.59 kg DW/ha). (2) Variable screening improved the accuracy of all of the models. (3) All models’ predictive accuracy varied between 0.45 and 0.60, and the RMSE values were lower than 1457.26 kg DW/ha, indicating that the results were reliably accurate.

Keywords