Correcting Measurement Error in Satellite Aerosol Optical Depth with Machine Learning for Modeling PM2.5 in the Northeastern USA

Allan  C. Just; Margherita  M. De Carli; Alexandra Shtein; Michael Dorman; Alexei Lyapustin; Itai Kloog

doi:10.3390/rs10050803

Remote Sensing (May 2018)

Correcting Measurement Error in Satellite Aerosol Optical Depth with Machine Learning for Modeling PM2.5 in the Northeastern USA

Allan C. Just,
Margherita M. De Carli,
Alexandra Shtein,
Michael Dorman,
Alexei Lyapustin,
Itai Kloog

Affiliations

Allan C. Just: Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
Margherita M. De Carli: Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
Alexandra Shtein: Department of Geography and Environmental Development, Ben-Gurion University of the Negev, Beer Sheva 8410501, Israel
Michael Dorman: Department of Geography and Environmental Development, Ben-Gurion University of the Negev, Beer Sheva 8410501, Israel
Alexei Lyapustin: National Aeronautics and Space Administration (NASA) Goddard Space Flight Center (GSFC), Greenbelt, MD 20771, USA
Itai Kloog: Department of Geography and Environmental Development, Ben-Gurion University of the Negev, Beer Sheva 8410501, Israel

DOI: https://doi.org/10.3390/rs10050803
Journal volume & issue: Vol. 10, no. 5
p. 803

Abstract

Read online

Satellite-derived estimates of aerosol optical depth (AOD) are key predictors in particulate air pollution models. The multi-step retrieval algorithms that estimate AOD also produce quality control variables but these have not been systematically used to address the measurement error in AOD. We compare three machine-learning methods: random forests, gradient boosting, and extreme gradient boosting (XGBoost) to characterize and correct measurement error in the Multi-Angle Implementation of Atmospheric Correction (MAIAC) 1 × 1 km AOD product for Aqua and Terra satellites across the Northeastern/Mid-Atlantic USA versus collocated measures from 79 ground-based AERONET stations over 14 years. Models included 52 quality control, land use, meteorology, and spatially-derived features. Variable importance measures suggest relative azimuth, AOD uncertainty, and the AOD difference in 30–210 km moving windows are among the most important features for predicting measurement error. XGBoost outperformed the other machine-learning approaches, decreasing the root mean squared error in withheld testing data by 43% and 44% for Aqua and Terra. After correction using XGBoost, the correlation of collocated AOD and daily PM2.5 monitors across the region increased by 10 and 9 percentage points for Aqua and Terra. We demonstrate how machine learning with quality control and spatial features substantially improves satellite-derived AOD products for air pollution modeling.

Published in Remote Sensing

ISSN: 2072-4292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science
Website: http://www.mdpi.com/journal/remotesensing/

About the journal

Abstract

Keywords