Clarifying trust of materials property predictions using neural networks with distribution-specific uncertainty quantification

Cameron J Gruich; Varun Madhavan; Yixin Wang; Bryan R Goldsmith

doi:10.1088/2632-2153/accace

Machine Learning: Science and Technology (Jan 2023)

Clarifying trust of materials property predictions using neural networks with distribution-specific uncertainty quantification

Cameron J Gruich,
Varun Madhavan,
Yixin Wang,
Bryan R Goldsmith

Affiliations

Cameron J Gruich: ORCiD; Department of Chemical Engineering, University of Michigan , Ann Arbor, MI 48109-2136, United States of America; Catalysis Science and Technology Institute, University of Michigan , Ann Arbor, MI 48109-2136, United States of America
Varun Madhavan: ORCiD; Department of Chemical Engineering, University of Michigan , Ann Arbor, MI 48109-2136, United States of America
Yixin Wang: ORCiD; Department of Statistics, University of Michigan , 1085 S University Ave, Ann Arbor, MI 48109-1107, United States of America
Bryan R Goldsmith: Department of Chemical Engineering, University of Michigan , Ann Arbor, MI 48109-2136, United States of America; Catalysis Science and Technology Institute, University of Michigan , Ann Arbor, MI 48109-2136, United States of America

DOI: https://doi.org/10.1088/2632-2153/accace
Journal volume & issue: Vol. 4, no. 2
p. 025019

Abstract

Read online

It is critical that machine learning (ML) model predictions be trustworthy for high-throughput catalyst discovery approaches. Uncertainty quantification (UQ) methods allow estimation of the trustworthiness of an ML model, but these methods have not been well explored in the field of heterogeneous catalysis. Herein, we investigate different UQ methods applied to a crystal graph convolutional neural network to predict adsorption energies of molecules on alloys from the Open Catalyst 2020 dataset, the largest existing heterogeneous catalyst dataset. We apply three UQ methods to the adsorption energy predictions, namely k -fold ensembling, Monte Carlo dropout, and evidential regression. The effectiveness of each UQ method is assessed based on accuracy, sharpness, dispersion, calibration, and tightness. Evidential regression is demonstrated to be a powerful approach for rapidly obtaining tunable, competitively trustworthy UQ estimates for heterogeneous catalysis applications when using neural networks. Recalibration of model uncertainties is shown to be essential in practical screening applications of catalysts using uncertainties.

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords