Nonsmooth Optimization-Based Hyperparameter-Free Neural Networks for Large-Scale Regression

Napsu Karmitsa; Sona Taheri; Kaisa Joki; Pauliina Paasivirta; Adil M. Bagirov; Marko M. Mäkelä

doi:10.3390/a16090444

Algorithms (Sep 2023)

Nonsmooth Optimization-Based Hyperparameter-Free Neural Networks for Large-Scale Regression

Napsu Karmitsa,
Sona Taheri,
Kaisa Joki,
Pauliina Paasivirta,
Adil M. Bagirov,
Marko M. Mäkelä

Affiliations

Napsu Karmitsa: Department of Computing, University of Turku, FI-20014 Turku, Finland
Sona Taheri: School of Science, RMIT University, Melbourne 3000, Australia
Kaisa Joki: Department of Mathematics and Statistics, University of Turku, FI-20014 Turku, Finland
Pauliina Paasivirta: Siili Solutions Oyj, FI-60100 Seinäjoki, Finland
Adil M. Bagirov: Centre for Smart Analytics, Federation University Australia, Ballarat 3350, Australia
Marko M. Mäkelä: Department of Mathematics and Statistics, University of Turku, FI-20014 Turku, Finland

DOI: https://doi.org/10.3390/a16090444
Journal volume & issue: Vol. 16, no. 9
p. 444

Abstract

Read online

In this paper, a new nonsmooth optimization-based algorithm for solving large-scale regression problems is introduced. The regression problem is modeled as fully-connected feedforward neural networks with one hidden layer, piecewise linear activation, and the L1-loss functions. A modified version of the limited memory bundle method is applied to minimize this nonsmooth objective. In addition, a novel constructive approach for automated determination of the proper number of hidden nodes is developed. Finally, large real-world data sets are used to evaluate the proposed algorithm and to compare it with some state-of-the-art neural network algorithms for regression. The results demonstrate the superiority of the proposed algorithm as a predictive tool in most data sets used in numerical experiments.

Published in Algorithms

ISSN: 1999-4893 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/algorithms

About the journal

Abstract

Keywords