Environmental Data Science (Jan 2022)

A spatiotemporal stochastic climate model for benchmarking causal discovery methods for teleconnections

  • Xavier-Andoni Tibau,
  • Christian Reimers,
  • Andreas Gerhardus,
  • Joachim Denzler,
  • Veronika Eyring,
  • Jakob Runge

DOI
https://doi.org/10.1017/eds.2022.11
Journal volume & issue
Vol. 1

Abstract

Read online

Teleconnections that link climate processes at widely separated spatial locations form a key component of the climate system. Their analysis has traditionally been based on means, climatologies, correlations, or spectral properties, which cannot always reveal the dynamical mechanisms between different climatological processes. More recently, causal discovery methods based either on time series at grid locations or on modes of variability, estimated through dimension-reduction methods, have been introduced. A major challenge in the development of such analysis methods is a lack of ground truth benchmark datasets that have facilitated improvements in many parts of machine learning. Here, we present a simplified stochastic climate model that outputs gridded data and represents climate modes and their teleconnections through a spatially aggregated vector-autoregressive model. The model is used to construct benchmarks and evaluate a range of analysis methods. The results highlight that the model can be successfully used to benchmark different causal discovery methods for spatiotemporal data and show their strengths and weaknesses. Furthermore, we introduce a novel causal discovery method at the grid level and demonstrate that it has orders of magnitude better performance than the current approaches. Improved causal analysis tools for spatiotemporal climate data are pivotal to advance process-based understanding and climate model evaluation.

Keywords