Cross-Validation, Information Theory, or Maximum Likelihood? A Comparison of Tuning Methods for Penalized Splines

Lauren N. Berry; Nathaniel E. Helwig

doi:10.3390/stats4030042

Stats (Sep 2021)

Cross-Validation, Information Theory, or Maximum Likelihood? A Comparison of Tuning Methods for Penalized Splines

Lauren N. Berry,
Nathaniel E. Helwig

Affiliations

Lauren N. Berry: Department of Psychology, University of Minnesota, Minneapolis, MN 55455, USA
Nathaniel E. Helwig: Department of Psychology, University of Minnesota, Minneapolis, MN 55455, USA

DOI: https://doi.org/10.3390/stats4030042
Journal volume & issue: Vol. 4, no. 3
pp. 701 – 724

Abstract

Read online

Functional data analysis techniques, such as penalized splines, have become common tools used in a variety of applied research settings. Penalized spline estimators are frequently used in applied research to estimate unknown functions from noisy data. The success of these estimators depends on choosing a tuning parameter that provides the correct balance between fitting and smoothing the data. Several different smoothing parameter selection methods have been proposed for choosing a reasonable tuning parameter. The proposed methods generally fall into one of three categories: cross-validation methods, information theoretic methods, or maximum likelihood methods. Despite the well-known importance of selecting an ideal smoothing parameter, there is little agreement in the literature regarding which method(s) should be considered when analyzing real data. In this paper, we address this issue by exploring the practical performance of six popular tuning methods under a variety of simulated and real data situations. Our results reveal that maximum likelihood methods outperform the popular cross-validation methods in most situations—especially in the presence of correlated errors. Furthermore, our results reveal that the maximum likelihood methods perform well even when the errors are non-Gaussian and/or heteroscedastic. For real data applications, we recommend comparing results using cross-validation and maximum likelihood tuning methods, given that these methods tend to perform similarly (differently) when the model is correctly (incorrectly) specified.

Published in Stats

ISSN: 2571-905X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Social Sciences: Statistics
Website: https://www.mdpi.com/journal/stats

About the journal

Abstract

Keywords