Multi-Fidelity Neural Architecture Search With Knowledge Distillation

Ilya Trofimov; Nikita Klyuchnikov; Mikhail Salnikov; Alexander Filippov; Evgeny Burnaev

doi:10.1109/ACCESS.2023.3234810

IEEE Access (Jan 2023)

Multi-Fidelity Neural Architecture Search With Knowledge Distillation

Ilya Trofimov,
Nikita Klyuchnikov,
Mikhail Salnikov,
Alexander Filippov,
Evgeny Burnaev

Affiliations

Ilya Trofimov: ORCiD; Skolkovo Institute of Science and Technology, Moscow, Russia
Nikita Klyuchnikov: ORCiD; Skolkovo Institute of Science and Technology, Moscow, Russia
Mikhail Salnikov: Skolkovo Institute of Science and Technology, Moscow, Russia
Alexander Filippov: ORCiD; Huawei, Moscow, Russia
Evgeny Burnaev: ORCiD; Skolkovo Institute of Science and Technology, Moscow, Russia

DOI: https://doi.org/10.1109/ACCESS.2023.3234810
Journal volume & issue: Vol. 11
pp. 59217 – 59225

Abstract

Read online

Neural architecture search (NAS) targets at finding the optimal architecture of a neural network for a problem or a family of problems. Evaluations of neural architectures are very time-consuming. One of the possible ways to mitigate this issue is to use low-fidelity evaluations, namely training on a part of a dataset, fewer epochs, with fewer channels, etc. In this paper, we propose a Bayesian multi-fidelity (MF) method for neural architecture search: MF-KD. The method relies on a new approach to low-fidelity evaluations of neural architectures by training for a few epochs using a knowledge distillation (KD). Knowledge distillation adds to a loss function a term forcing a network to mimic some teacher network. We carry out experiments on CIFAR-10, CIFAR-100, and ImageNet-16-120. We show that training for a few epochs with such a modified loss function leads to a better selection of neural architectures than training for a few epochs with a logistic loss. The proposed method outperforms several state-of-the-art baselines.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords