Utilizing Statistical Tests for Comparing Machine Learning Algorithms

Hozan Khalid Hamarashid

doi:10.24017/science.2021.1.8

Kurdistan Journal of Applied Research (Jul 2021)

Utilizing Statistical Tests for Comparing Machine Learning Algorithms

Hozan Khalid Hamarashid

Affiliations

Hozan Khalid Hamarashid: Information Technology Department, Computer Science Institute, Sulaimani Polytechnic University, Sulaimani, Iraq

DOI: https://doi.org/10.24017/science.2021.1.8
Journal volume & issue: Vol. 6, no. 1

Abstract

Read online

The mean result of machine learning models is determined by utilizing k-fold cross-validation. The algorithm with the best average performance should surpass those with the poorest. But what if the difference in average outcomes is the consequence of a statistical anomaly? To conduct whether or not the mean result differences between two algorithms is genuine then statistical hypothesis test is utilized. Using statistical hypothesis testing, this study will demonstrate how to compare machine learning algorithms. The output of several machine learning algorithms or simulation pipelines is compared during model selection. The model that performs the best based on your performance measure becomes the last model, which can be utilized to make predictions on new data. With classification and regression prediction models it can be conducted by utilizing traditional machine learning and deep learning methods. The difficulty is to identify whether or not the difference between two models is accurate.

machine learning, machine learning assessment, statistical tests, machine learning algorithm, machine learning comparison.

Published in Kurdistan Journal of Applied Research

ISSN: 2411-7684 (Print); 2411-7706 (Online)
Publisher: Sulaimani Polytechnic University
Country of publisher: Iraq
LCC subjects: Technology: Technology (General); Science
Website: https://www.kjar.spu.edu.iq/index.php/kjar/index

About the journal

Abstract

Keywords