Evidence Networks: simple losses for fast, amortized, neural Bayesian model comparison

Niall Jeffrey; Benjamin D Wandelt

doi:10.1088/2632-2153/ad1a4d

Machine Learning: Science and Technology (Jan 2024)

Evidence Networks: simple losses for fast, amortized, neural Bayesian model comparison

Niall Jeffrey,
Benjamin D Wandelt

Affiliations

Niall Jeffrey: ORCiD; Department of Physics & Astronomy, University College London , Gower St., London, United Kingdom
Benjamin D Wandelt: Institut d’Astrophysique de Paris (IAP), UMR 7095 CNRS, Sorbonne Université , Paris, France; Center for Computational Astrophysics, Flatiron Institute , 162 5th Avenue, New York, NY, United States of America

DOI: https://doi.org/10.1088/2632-2153/ad1a4d
Journal volume & issue: Vol. 5, no. 1
p. 015008

Abstract

Read online

Evidence Networks can enable Bayesian model comparison when state-of-the-art methods (e.g. nested sampling) fail and even when likelihoods or priors are intractable or unknown. Bayesian model comparison, i.e. the computation of Bayes factors or evidence ratios, can be cast as an optimization problem. Though the Bayesian interpretation of optimal classification is well-known, here we change perspective and present classes of loss functions that result in fast, amortized neural estimators that directly estimate convenient functions of the Bayes factor. This mitigates numerical inaccuracies associated with estimating individual model probabilities. We introduce the leaky parity-odd power (l-POP) transform, leading to the novel ‘l-POP-Exponential’ loss function. We explore neural density estimation for data probability in different models, showing it to be less accurate and scalable than Evidence Networks. Multiple real-world and synthetic examples illustrate that Evidence Networks are explicitly independent of dimensionality of the parameter space and scale mildly with the complexity of the posterior probability density function. This simple yet powerful approach has broad implications for model inference tasks. As an application of Evidence Networks to real-world data we compute the Bayes factor for two models with gravitational lensing data of the Dark Energy Survey. We briefly discuss applications of our methods to other, related problems of model comparison and evaluation in implicit inference settings.

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords