Machine-learning based reconstructions of primary and secondary climate variables from North American and European fossil pollen data

J. Sakari Salonen; Mikko Korpela; John W. Williams; Miska Luoto

doi:10.1038/s41598-019-52293-4

Scientific Reports (Nov 2019)

Machine-learning based reconstructions of primary and secondary climate variables from North American and European fossil pollen data

J. Sakari Salonen,
Mikko Korpela,
John W. Williams,
Miska Luoto

Affiliations

J. Sakari Salonen: Department of Geosciences and Geography, University of Helsinki
Mikko Korpela: Department of Geosciences and Geography, University of Helsinki
John W. Williams: Department of Geography and Center for Climatic Research, University of Wisconsin–Madison
Miska Luoto: Department of Geosciences and Geography, University of Helsinki

DOI: https://doi.org/10.1038/s41598-019-52293-4
Journal volume & issue: Vol. 9, no. 1
pp. 1 – 13

Abstract

Read online

Abstract We test several quantitative algorithms as palaeoclimate reconstruction tools for North American and European fossil pollen data, using both classical methods and newer machine-learning approaches based on regression tree ensembles and artificial neural networks. We focus on the reconstruction of secondary climate variables (here, January temperature and annual water balance), as their comparatively small ecological influence compared to the primary variable (July temperature) presents special challenges to palaeo-reconstructions. We test the pollen–climate models using a novel and comprehensive cross-validation approach, running a series of h-block cross-validations using h values of 100–1500 km. Our study illustrates major benefits of this variable h-block cross-validation scheme, as the effect of spatial autocorrelation is minimized, while the cross-validations with increasing h values can reveal instabilities in the calibration model and approximate challenges faced in palaeo-reconstructions with poor modern analogues. We achieve well-performing calibration models for both primary and secondary climate variables, with boosted regression trees providing the overall most robust performance, while the palaeoclimate reconstructions from fossil datasets show major independent features for the primary and secondary variables. Our results suggest that with careful variable selection and consideration of ecological processes, robust reconstruction of both primary and secondary climate variables is possible.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal