Bayesian Neural Networks via MCMC: A Python-Based Tutorial

Rohitash Chandra; Joshua Simmons

doi:10.1109/ACCESS.2024.3401234

IEEE Access (Jan 2024)

Bayesian Neural Networks via MCMC: A Python-Based Tutorial

Rohitash Chandra,
Joshua Simmons

Affiliations

Rohitash Chandra: ORCiD; Transitional Artificial Intelligence Research Group, School of Mathematics and Statistics, UNSW Sydney, Sydney, NSW, Australia
Joshua Simmons: ORCiD; ARC Training Centre in Data Analytics for Resources and Environments, The University of Sydney, Camperdown, NSW, Australia

DOI: https://doi.org/10.1109/ACCESS.2024.3401234
Journal volume & issue: Vol. 12
pp. 70519 – 70549

Abstract

Read online

Bayesian inference provides a methodology for parameter estimation and uncertainty quantification in machine learning and deep learning methods. Variational inference and Markov Chain Monte-Carlo (MCMC) sampling methods are used to implement Bayesian inference. In the past few decades, MCMC sampling methods have faced challenges in being adapted to larger models (such as deep learning models) and big data problems. Advanced proposal distributions that incorporate gradients, such as a Langevin proposal distribution, provide a means to address some of the limitations of MCMC sampling for Bayesian neural networks. Furthermore, MCMC methods have typically been constrained to statisticians, and hence not well-known among deep learning researchers. We present a tutorial for MCMC methods that covers simple Bayesian linear and logistic models, and Bayesian neural networks. The aim of this tutorial is to bridge the gap between theory and implementation via Python code, given a general sparsity of libraries and tutorials. This tutorial provides code in Python with data and instructions that enable their use and extension. We provide results for selected benchmark problems showing the strengths and weaknesses of implementing the respective Bayesian models via MCMC. We highlight the challenges in sampling multi-modal posterior distributions for the case of Bayesian neural networks and the need for further improvement of convergence diagnosis methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords