Spatiotemporal lagging of predictors improves machine learning estimates of atmosphere–forest CO<sub>2</sub> exchange

M. Kämäräinen; J.-P. Tuovinen; M. Kulmala; I. Mammarella; J. Aalto; J. Aalto; H. Vekuri; A. Lohila; A. Lohila; A. Lintunen; A. Lintunen

doi:10.5194/bg-20-897-2023

Biogeosciences (Mar 2023)

Spatiotemporal lagging of predictors improves machine learning estimates of atmosphere–forest CO<sub>2</sub> exchange

M. Kämäräinen,
J.-P. Tuovinen,
M. Kulmala,
I. Mammarella,
J. Aalto,
J. Aalto,
H. Vekuri,
A. Lohila,
A. Lohila,
A. Lintunen,
A. Lintunen

Affiliations

M. Kämäräinen: Weather and Climate Change Impact Research, Finnish Meteorological Institute, Helsinki, Finland
J.-P. Tuovinen: Climate System Research, Finnish Meteorological Institute, Helsinki, Finland
M. Kulmala: Institute for Atmospheric and Earth System Research/Physics, Faculty of Science, University of Helsinki, Helsinki, Finland
I. Mammarella: Institute for Atmospheric and Earth System Research/Physics, Faculty of Science, University of Helsinki, Helsinki, Finland
J. Aalto: Weather and Climate Change Impact Research, Finnish Meteorological Institute, Helsinki, Finland
J. Aalto: Department of Geosciences and Geography, University of Helsinki, Helsinki, Finland
H. Vekuri: Climate System Research, Finnish Meteorological Institute, Helsinki, Finland
A. Lohila: Climate System Research, Finnish Meteorological Institute, Helsinki, Finland
A. Lohila: Institute for Atmospheric and Earth System Research/Physics, Faculty of Science, University of Helsinki, Helsinki, Finland
A. Lintunen: Institute for Atmospheric and Earth System Research/Physics, Faculty of Science, University of Helsinki, Helsinki, Finland
A. Lintunen: Institute for Atmospheric and Earth System Research/Forest Sciences, Faculty of Agriculture and Forestry, University of Helsinki, Helsinki, Finland

DOI: https://doi.org/10.5194/bg-20-897-2023
Journal volume & issue: Vol. 20
pp. 897 – 909

Abstract

Read online

Accurate estimates of net ecosystem CO2 exchange (NEE) would improve the understanding of natural carbon sources and sinks and their role in the regulation of global atmospheric carbon. In this work, we use and compare the random forest (RF) and the gradient boosting (GB) machine learning (ML) methods for predicting year-round 6 h NEE over 1996–2018 in a pine-dominated boreal forest in southern Finland and analyze the predictability of NEE. Additionally, aggregation to weekly NEE values was applied to get information about longer term behavior of the method. The meteorological ERA5 reanalysis variables were used as predictors. Spatial and temporal neighborhood (predictor lagging) was used to provide the models more data to learn from, which was found to improve considerably the accuracy of both ML approaches compared to using only the nearest grid cell and time step. Both ML methods can explain temporal variability of NEE in the observational site of this study with meteorological predictors, but the GB method was more accurate. Only minor signs of overfitting could be detected for the GB algorithm when redundant variables were included. The accuracy of the approaches, measured mainly using cross-validated R2 score between the model result and the observed NEE, was high, reaching a best estimate value of 0.92 for GB and 0.88 for RF. In addition to the standard RF approach, we recommend using GB for modeling the CO2 fluxes of the ecosystems due to its potential for better performance.

Published in Biogeosciences

ISSN: 1726-4170 (Print); 1726-4189 (Online)
Publisher: Copernicus Publications
Country of publisher: Germany
LCC subjects: Science: Biology (General): Ecology; Science: Biology (General): Life; Science: Geology
Website: http://www.biogeosciences.net

About the journal