Extrapolative Bayesian Optimization with Gaussian Process and Neural Network Ensemble Surrogate Models

Yee-Fun Lim; Chee Koon Ng; U.S. Vaitesswar; Kedar Hippalgaonkar

doi:10.1002/aisy.202100101

Advanced Intelligent Systems (Nov 2021)

Extrapolative Bayesian Optimization with Gaussian Process and Neural Network Ensemble Surrogate Models

Yee-Fun Lim,
Chee Koon Ng,
U.S. Vaitesswar,
Kedar Hippalgaonkar

Affiliations

Yee-Fun Lim: Institute of Materials Research and Engineering A*STAR (Agency for Science Technology and Research) 2 Fusionopolis Way #08-03 Innovis Singapore 138634 Republic of Singapore
Chee Koon Ng: Institute of Materials Research and Engineering A*STAR (Agency for Science Technology and Research) 2 Fusionopolis Way #08-03 Innovis Singapore 138634 Republic of Singapore
U.S. Vaitesswar: Department of Materials Science and Engineering Nanyang Technological University 50 Nanyang Avenue Singapore 639798 Republic of Singapore
Kedar Hippalgaonkar: Institute of Materials Research and Engineering A*STAR (Agency for Science Technology and Research) 2 Fusionopolis Way #08-03 Innovis Singapore 138634 Republic of Singapore

DOI: https://doi.org/10.1002/aisy.202100101
Journal volume & issue: Vol. 3, no. 11
pp. n/a – n/a

Abstract

Read online

Bayesian optimization (BO) has emerged as the algorithm of choice for guiding the selection of experimental parameters in automated active learning driven high throughput experiments in materials science and chemistry. Previous studies suggest that optimization performance of the typical surrogate model in the BO algorithm, Gaussian processes (GPs), may be limited due to its inability to handle complex datasets. Herein, various surrogate models for BO, including GPs and neural network ensembles (NNEs), are investigated. Two materials datasets of different complexity with different properties are used, to compare the performance of GP and NNE—the first is the compressive strength of concrete (8 inputs and 1 target), and the second is a simulated high‐dimensional dataset of thermoelectric properties of inorganic materials (22 inputs and 1 target). While NNEs can converge faster toward optimum values, GPs with optimized kernels are able to ultimately achieve the best evaluated values after 100 iterations, even for the most complex dataset. This surprising result is contrary to expectations. It is believed that these findings shed new light on the understanding of surrogate models for BO, and can help accelerate the inverse design of new materials with better structural and functional performance.

Published in Advanced Intelligent Systems

ISSN: 2640-4567 (Online)
Publisher: Wiley
Country of publisher: Germany
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Technology: Mechanical engineering and machinery: Control engineering systems. Automatic machinery (General)
Website: https://onlinelibrary.wiley.com/journal/26404567

About the journal

Abstract

Keywords