On Systems of Neural ODEs with Generalized Power Activation Functions

Vasiliy Ye. Belozyorov; Yevhen V. Koshel

doi:10.15421/142409

Journal of Optimization, Differential Equations and Their Applications (Aug 2024)

On Systems of Neural ODEs with Generalized Power Activation Functions

Vasiliy Ye. Belozyorov,
Yevhen V. Koshel

Affiliations

Vasiliy Ye. Belozyorov: Faculty of Applied Mathematics and Information Technologies, Oles Honchar Dnipro National University
Yevhen V. Koshel: Faculty of Applied Mathematics and Information Technologies, Oles Honchar Dnipro National University

DOI: https://doi.org/10.15421/142409
Journal volume & issue: Vol. 32, no. 2
pp. 56 – 91

Abstract

Read online

When constructing neural network-based models, it is common practice to use time-tested activation functions such as the hyperbolic tangent, the sigmoid or the ReLU functions. These choices, however, may be suboptimal. The hyperbolic tangent and the sigmoid functions are differentiable but bounded, which can lead to vanishing gradient problem. The ReLU is not bounded but is not differentiable in the point 0, which may lead to suboptimal training in some optimizers. One can attempt to use sigmoid-like functions like the cubic root, but it is also not differentiable in the point 0. One activation function that is often overlooked is the identity function. Even though it doesn’t induce nonlinear behavior in the model by itself, it can help build more explainable models more quickly due to non-existent cost of its evaluation, while the non-linearities can be provided by the model’s evaluation rule. In this article, we explore the use of specially-designed unbounded differentiable generalized power activation function, the identity function, and their combinations for approximating univariate time series data with neural ordinary differential equations. Examples are given.

Published in Journal of Optimization, Differential Equations and Their Applications

ISSN: 2617-0108 (Print); 2663-6824 (Online)
Publisher: Oles Honchar Dnipro National University
Country of publisher: Ukraine
LCC subjects: Science: Mathematics
Website: https://model-dnu.dp.ua

About the journal

Abstract

Keywords