Remote Sensing (Mar 2024)
County-Level Poverty Evaluation Using Machine Learning, Nighttime Light, and Geospatial Data
Abstract
The accurate and timely acquisition of poverty information within a specific region is crucial for formulating effective development policies. Nighttime light (NL) remote sensing data and geospatial information provide the means for conducting precise and timely evaluations of poverty levels. However, current assessment methods predominantly rely on NL data, and the potential of combining multi-source geospatial data for poverty identification remains underexplored. Therefore, we propose an approach that assesses poverty based on both NL and geospatial data using machine learning models. This study uses the multidimensional poverty index (MPI), derived from county-level statistical data with social, economic, and environmental dimensions, as an indicator to assess poverty levels. We extracted a total of 17 independent variables from NL and geospatial data. Machine learning models (random forest (RF), support vector machine (SVM), adaptive boosting (AdaBoost), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM)) and traditional linear regression (LR) were used to model the relationship between the MPI and independent variables. The results indicate that the RF model achieved significantly higher accuracy, with a coefficient of determination (R2) of 0.928, a mean absolute error (MAE) of 0.030, and a root mean square error (RMSE) of 0.037. The top five most important variables comprise two (NL_MAX and NL_MIN) from the NL data and three (POI_Ed, POI_Me, and POI_Ca) from the geographical spatial data, highlighting the significant roles of NL data and geographical data in MPI modeling. The MPI map that was generated by the RF model depicted the detailed spatial distribution of poverty in Fujian province. This study presents an approach to county-level poverty evaluation that integrates NL and geospatial data using a machine learning model, which can contribute to a more reliable and efficient estimate of poverty.
Keywords