Cross-Validation Visualized: A Narrative Guide to Advanced Methods

Johannes Allgaier; Rüdiger Pryss

doi:10.3390/make6020065

Machine Learning and Knowledge Extraction (Jun 2024)

Cross-Validation Visualized: A Narrative Guide to Advanced Methods

Johannes Allgaier,
Rüdiger Pryss

Affiliations

Johannes Allgaier: Institute of Medical Data Science, University Hospital Würzburg, 97080 Würzburg, Germany
Rüdiger Pryss: Institute of Medical Data Science, University Hospital Würzburg, 97080 Würzburg, Germany

DOI: https://doi.org/10.3390/make6020065
Journal volume & issue: Vol. 6, no. 2
pp. 1378 – 1388

Abstract

Read online

This study delves into the multifaceted nature of cross-validation (CV) techniques in machine learning model evaluation and selection, underscoring the challenge of choosing the most appropriate method due to the plethora of available variants. It aims to clarify and standardize terminology such as sets, groups, folds, and samples pivotal in the CV domain, and introduces an exhaustive compilation of advanced CV methods like leave-one-out, leave-p-out, Monte Carlo, grouped, stratified, and time-split CV within a hold-out CV framework. Through graphical representations, the paper enhances the comprehension of these methodologies, facilitating more informed decision making for practitioners. It further explores the synergy between different CV strategies and advocates for a unified approach to reporting model performance by consolidating essential metrics. The paper culminates in a comprehensive overview of the CV techniques discussed, illustrated with practical examples, offering valuable insights for both novice and experienced researchers in the field.

Published in Machine Learning and Knowledge Extraction

ISSN: 2504-4990 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware
Website: https://www.mdpi.com/journal/make

About the journal

Abstract

Keywords