Classifier uncertainty: evidence, potential impact, and probabilistic treatment

Niklas Tötsch; Daniel Hoffmann

doi:10.7717/peerj-cs.398

PeerJ Computer Science (Mar 2021)

Classifier uncertainty: evidence, potential impact, and probabilistic treatment

Niklas Tötsch,
Daniel Hoffmann

Affiliations

Niklas Tötsch
Daniel Hoffmann

DOI: https://doi.org/10.7717/peerj-cs.398
Journal volume & issue: Vol. 7
p. e398

Abstract

Read online Read online

Classifiers are often tested on relatively small data sets, which should lead to uncertain performance metrics. Nevertheless, these metrics are usually taken at face value. We present an approach to quantify the uncertainty of classification performance metrics, based on a probability model of the confusion matrix. Application of our approach to classifiers from the scientific literature and a classification competition shows that uncertainties can be surprisingly large and limit performance evaluation. In fact, some published classifiers may be misleading. The application of our approach is simple and requires only the confusion matrix. It is agnostic of the underlying classifier. Our method can also be used for the estimation of sample sizes that achieve a desired precision of a performance metric.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords