Improving Spatial Disaggregation of Crop Yield by Incorporating Machine Learning with Multisource Data: A Case Study of Chinese Maize Yield

Shuo Chen; Weihang Liu; Puyu Feng; Tao Ye; Yuchi Ma; Zhou Zhang

doi:10.3390/rs14102340

Remote Sensing (May 2022)

Improving Spatial Disaggregation of Crop Yield by Incorporating Machine Learning with Multisource Data: A Case Study of Chinese Maize Yield

Shuo Chen,
Weihang Liu,
Puyu Feng,
Tao Ye,
Yuchi Ma,
Zhou Zhang

Affiliations

Shuo Chen: State Key Laboratory of Earth Surface Processes and Resource Ecology (ESPRE), Beijing Normal University, Beijing 100875, China
Weihang Liu: State Key Laboratory of Earth Surface Processes and Resource Ecology (ESPRE), Beijing Normal University, Beijing 100875, China
Puyu Feng: College of Land Science and Technology, China Agricultural University, Beijing 100193, China
Tao Ye: State Key Laboratory of Earth Surface Processes and Resource Ecology (ESPRE), Beijing Normal University, Beijing 100875, China
Yuchi Ma: Department of Biological Systems Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
Zhou Zhang: Department of Biological Systems Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA

DOI: https://doi.org/10.3390/rs14102340
Journal volume & issue: Vol. 14, no. 10
p. 2340

Abstract

Read online

Spatially explicit crop yield datasets with continuous long-term series are essential for understanding the spatiotemporal variation of crop yield and the impact of climate change on it. There are several spatial disaggregation methods to generate gridded yield maps, but these either use an oversimplified approach with only a couple of ancillary data or an overly complex approach with limited flexibility and scalability. This study developed a spatial disaggregation method using improved spatial weights generated from machine learning. When applied to Chinese maize yield, extreme gradient boosting (XGB) derived the best prediction results, with a cross-validation coefficient of determination (R2) of 0.81 at the municipal level. The disaggregated yield at 1 km grids could explain 54% of the variance of the county-level statistical yield, which is superior to the existing gridded maize yield dataset in China. At the site level, the disaggregated yields also showed much better agreement with observations than the existing gridded maize yield dataset. This lightweight method is promising for generating spatially explicit crop yield datasets with finer resolution and higher accuracy, and for providing necessary information for maize production risk assessment in China under climate change.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords