Improving neural ordinary differential equations via knowledge distillation

Haoyu Chu; Shikui Wei; Qiming Lu; Yao Zhao

doi:10.1049/cvi2.12248

IET Computer Vision (Mar 2024)

Improving neural ordinary differential equations via knowledge distillation

Haoyu Chu,
Shikui Wei,
Qiming Lu,
Yao Zhao

Affiliations

Haoyu Chu: Institute of Information Science Beijing Jiaotong University Beijing China
Shikui Wei: Institute of Information Science Beijing Jiaotong University Beijing China
Qiming Lu: Institute of Information Science Beijing Jiaotong University Beijing China
Yao Zhao: Institute of Information Science Beijing Jiaotong University Beijing China

DOI: https://doi.org/10.1049/cvi2.12248
Journal volume & issue: Vol. 18, no. 2
pp. 304 – 314

Abstract

Read online

Abstract Neural ordinary differential equations (ODEs) (Neural ODEs) construct the continuous dynamics of hidden units using ODEs specified by a neural network, demonstrating promising results on many tasks. However, Neural ODEs still do not perform well on image recognition tasks. The possible reason is that the one‐hot encoding vector commonly used in Neural ODEs can not provide enough supervised information. A new training based on knowledge distillation is proposed to construct more powerful and robust Neural ODEs fitting image recognition tasks. Specially, the training of Neural ODEs is modelled into a teacher‐student learning process, in which ResNets are proposed as the teacher model to provide richer supervised information. The experimental results show that the new training manner can improve the classification accuracy of Neural ODEs by 5.17%, 24.75%, 7.20%, and 8.99%, on Street View House Numbers, CIFAR10, CIFAR100, and Food‐101, respectively. In addition, the effect of knowledge distillation is also evaluated in Neural ODEs on robustness against adversarial examples. The authors discover that incorporating knowledge distillation, coupled with the increase of the time horizon, can significantly enhance the robustness of Neural ODEs. The performance improvement is analysed from the perspective of the underlying dynamical system.

Published in IET Computer Vision

ISSN: 1751-9632 (Print); 1751-9640 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519640

About the journal

Abstract

Keywords