Hyperparameter Optimization for Machine Learning Models Based on Bayesian Optimizationb

Jia Wu; Xiu-Yun Chen; Hao Zhang; Li-Dong Xiong; Hang Lei; Si-Hao Deng

doi:10.11989/jest.1674-862x.80904120

Journal of Electronic Science and Technology (Mar 2019)

Hyperparameter Optimization for Machine Learning Models Based on Bayesian Optimizationb

Jia Wu,
Xiu-Yun Chen,
Hao Zhang,
Li-Dong Xiong,
Hang Lei,
Si-Hao Deng

Affiliations

Jia Wu: School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China; Corresponding author (e-mail: [email protected]; [email protected]; [email protected]; [email protected]; [email protected]; e-mail: [email protected])
Xiu-Yun Chen: School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China
Hao Zhang: School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China
Li-Dong Xiong: School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China
Hang Lei: School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China
Si-Hao Deng: Université de Technologie de Belfort-Montbéliard, Belfort 90010, France

DOI: https://doi.org/10.11989/jest.1674-862x.80904120
Journal volume & issue: Vol. 17, no. 1
pp. 26 – 40

Abstract

Read online

Hyperparameters are important for machine learning algorithms since they directly control the behaviors of training algorithms and have a significant effect on the performance of machine learning models. Several techniques have been developed and successfully applied for certain application domains. However, this work demands professional knowledge and expert experience. And sometimes it has to resort to the brute-force search. Therefore, if an efficient hyperparameter optimization algorithm can be developed to optimize any given machine learning method, it will greatly improve the efficiency of machine learning. In this paper, we consider building the relationship between the performance of the machine learning models and their hyperparameters by Gaussian processes. In this way, the hyperparameter tuning problem can be abstracted as an optimization problem and Bayesian optimization is used to solve the problem. Bayesian optimization is based on the Bayesian theorem. It sets a prior over the optimization function and gathers the information from the previous sample to update the posterior of the optimization function. A utility function selects the next sample point to maximize the optimization function. Several experiments were conducted on standard test datasets. Experiment results show that the proposed method can find the best hyperparameters for the widely used machine learning models, such as the random forest algorithm and the neural networks, even multi-grained cascade forest under the consideration of time cost.

Published in Journal of Electronic Science and Technology

ISSN: 1674-862X (Print); 2666-223X (Online)
Publisher: KeAi Communications Co., Ltd.
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://www.keaipublishing.com/en/journals/journal-of-electronic-science-and-technology/

About the journal

Abstract

Keywords