Calibration of uncertainty in the active learning of machine learning force fields

Adam Thomas-Mitchell; Glenn Hawe; Paul L A Popelier

doi:10.1088/2632-2153/ad0ab5

Machine Learning: Science and Technology (Jan 2023)

Calibration of uncertainty in the active learning of machine learning force fields

Adam Thomas-Mitchell,
Glenn Hawe,
Paul L A Popelier

Affiliations

Adam Thomas-Mitchell: ORCiD; School of Computing, Ulster University , 2-24 York Street, BT15 1AP Belfast, United Kingdom
Glenn Hawe: ORCiD; School of Computing, Ulster University , 2-24 York Street, BT15 1AP Belfast, United Kingdom
Paul L A Popelier: ORCiD; Department of Chemistry, The University of Manchester , Oxford Road, M13 9PL Manchester, United Kingdom

DOI: https://doi.org/10.1088/2632-2153/ad0ab5
Journal volume & issue: Vol. 4, no. 4
p. 045034

Abstract

Read online

FFLUX is a machine learning force field that uses the maximum expected prediction error (MEPE) active learning algorithm to improve the efficiency of model training. MEPE uses the predictive uncertainty of a Gaussian process (GP) to balance exploration and exploitation when selecting the next training sample. However, the predictive uncertainty of a GP is unlikely to be accurate or precise immediately after training. We hypothesize that calibrating the uncertainty quantification within MEPE will improve active learning performance. We develop and test two methods to improve uncertainty estimates: post-hoc calibration of predictive uncertainty using the CRUDE algorithm, and replacing the GP with a student- t process. We investigate the impact of these methods on MEPE for single sample and batch sample active learning. Our findings suggest that post-hoc calibration does not improve the performance of active learning using the MEPE method. However, we do find that the student- t process can outperform active learning strategies and random sampling using a GP if the training set is sufficiently large.

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords