Looking at the posterior: accuracy and uncertainty of neural-network predictions

Hampus Linander; Oleksandr Balabanov; Henry Yang; Bernhard Mehlig

doi:10.1088/2632-2153/ad0ab4

Machine Learning: Science and Technology (Jan 2023)

Looking at the posterior: accuracy and uncertainty of neural-network predictions

Hampus Linander,
Oleksandr Balabanov,
Henry Yang,
Bernhard Mehlig

Affiliations

Hampus Linander: ORCiD; Department of Physics, University of Gothenburg , 41296 Gothenburg, Sweden; Department of Mathematical Sciences, Chalmers University of Technology, University of Gothenburg , 41296 Gothenburg, Sweden
Oleksandr Balabanov: ORCiD; Department of Physics, Stockholm University , 10691 Stockholm, Sweden
Henry Yang: Department of Physics, University of Gothenburg , 41296 Gothenburg, Sweden
Bernhard Mehlig: ORCiD; Department of Physics, University of Gothenburg , 41296 Gothenburg, Sweden

DOI: https://doi.org/10.1088/2632-2153/ad0ab4
Journal volume & issue: Vol. 4, no. 4
p. 045032

Abstract

Read online

Bayesian inference can quantify uncertainty in the predictions of neural networks using posterior distributions for model parameters and network output. By looking at these posterior distributions, one can separate the origin of uncertainty into aleatoric and epistemic contributions. One goal of uncertainty quantification is to inform on prediction accuracy. Here we show that prediction accuracy depends on both epistemic and aleatoric uncertainty in an intricate fashion that cannot be understood in terms of marginalized uncertainty distributions alone. How the accuracy relates to epistemic and aleatoric uncertainties depends not only on the model architecture, but also on the properties of the dataset. We discuss the significance of these results for active learning and introduce a novel acquisition function that outperforms common uncertainty-based methods. To arrive at our results, we approximated the posteriors using deep ensembles, for fully-connected, convolutional and attention-based neural networks.

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords