A unifying perspective on non-stationary kernels for deeper Gaussian processes

Marcus M. Noack; Hengrui Luo; Mark D. Risser

doi:10.1063/5.0176963

APL Machine Learning (Mar 2024)

A unifying perspective on non-stationary kernels for deeper Gaussian processes

Marcus M. Noack,
Hengrui Luo,
Mark D. Risser

Affiliations

Marcus M. Noack: Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
Hengrui Luo: Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
Mark D. Risser: Climate and Ecosystem Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA

DOI: https://doi.org/10.1063/5.0176963
Journal volume & issue: Vol. 2, no. 1
pp. 010902 – 010902-27

Abstract

Read online

The Gaussian process (GP) is a popular statistical technique for stochastic function approximation and uncertainty quantification from data. GPs have been adopted into the realm of machine learning (ML) in the last two decades because of their superior prediction abilities, especially in data-sparse scenarios, and their inherent ability to provide robust uncertainty estimates. Even so, their performance highly depends on intricate customizations of the core methodology, which often leads to dissatisfaction among practitioners when standard setups and off-the-shelf software tools are being deployed. Arguably, the most important building block of a GP is the kernel function, which assumes the role of a covariance operator. Stationary kernels of the Matérn class are used in the vast majority of applied studies; poor prediction performance and unrealistic uncertainty quantification are often the consequences. Non-stationary kernels show improved performance but are rarely used due to their more complicated functional form and the associated effort and expertise needed to define and tune them optimally. In this perspective, we want to help ML practitioners make sense of some of the most common forms of non-stationarity for Gaussian processes. We show a variety of kernels in action using representative datasets, carefully study their properties, and compare their performances. Based on our findings, we propose a new kernel that combines some of the identified advantages of existing kernels.

Published in APL Machine Learning

ISSN: 2770-9019 (Online)
Publisher: AIP Publishing LLC
Country of publisher: United States
LCC subjects: Science: Physics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://pubs.aip.org/aip/aml

About the journal