New Journal of Physics (Jan 2020)

Data integration for accelerated materials design via preference learning

  • Xiaolin Sun,
  • Zhufeng Hou,
  • Masato Sumita,
  • Shinsuke Ishihara,
  • Ryo Tamura,
  • Koji Tsuda

DOI
https://doi.org/10.1088/1367-2630/ab82b9
Journal volume & issue
Vol. 22, no. 5
p. 055001

Abstract

Read online

Machine learning applications in materials science are often hampered by shortage of experimental data. Integration with external datasets from past experiments is a viable way to solve the problem. But complex calibration is often necessary to use the data obtained under different conditions. In this paper, we present a novel calibration-free strategy to enhance the performance of Bayesian optimization with preference learning. The entire learning process is solely based on pairwise comparison of quantities (i.e., higher or lower) in the same dataset, and experimental design can be done without comparing quantities in different datasets. We demonstrate that Bayesian optimization is significantly enhanced via data integration for organic molecules and inorganic solid-state materials. Our method increases the chance that public datasets are reused and may encourage data sharing in various fields of physics.

Keywords