Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions

Rubaiyat Mohammad Khondaker; Stephen Gow; Samantha Kanza; Jeremy G Frey; Mahesan Niranjan

doi:10.1186/s13321-022-00641-4

Journal of Cheminformatics (Sep 2022)

Robustness under parameter and problem domain alterations of Bayesian optimization methods for chemical reactions

Rubaiyat Mohammad Khondaker,
Stephen Gow,
Samantha Kanza,
Jeremy G Frey,
Mahesan Niranjan

Affiliations

Rubaiyat Mohammad Khondaker: Department of Mathematics, University of Cambridge
Stephen Gow: Department of Chemistry, University of Southampton
Samantha Kanza: Department of Chemistry, University of Southampton
Jeremy G Frey: Department of Chemistry, University of Southampton
Mahesan Niranjan: Department of Electronics and Computer Science, University of Southampton

DOI: https://doi.org/10.1186/s13321-022-00641-4
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 10

Abstract

Read online

Abstract The related problems of chemical reaction optimization and reaction scope search concern the discovery of reaction pathways and conditions that provide the best percentage yield of a target product. The space of possible reaction pathways or conditions is too large to search in full, so identifying a globally optimal set of conditions must instead draw on mathematical methods to identify areas of the space that should be investigated. An intriguing contribution to this area of research is the recent development of the Experimental Design for Bayesian optimization (EDBO) optimizer [1]. Bayesian optimization works by building an approximation to the true function to be optimized based on a small set of simulations, and selecting the next point (or points) to be tested based on an acquisition function reflecting the value of different points within the input space. In this work, we evaluated the robustness of the EDBO optimizer under several changes to its specification. We investigated the effect on the performance of the optimizer of altering the acquisition function and batch size, applied the method to other existing reaction yield data sets, and considered its performance in the new problem domain of molecular power conversion efficiency in photovoltaic cells. Our results indicated that the EDBO optimizer broadly performs well under these changes; of particular note is the competitive performance of the computationally cheaper acquisition function Thompson Sampling when compared to the original Expected Improvement function, and some concerns around the method’s performance for “incomplete” input domains.

Published in Journal of Cheminformatics

ISSN: 1758-2946 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Chemistry
Website: https://jcheminf.biomedcentral.com/

About the journal

Abstract

Keywords