Tensor Network Methods for Hyperparameter Optimization and Compression of Convolutional Neural Networks

A. Naumov; A. Melnikov; M. Perelshtein; Ar. Melnikov; V. Abronin; F. Oksanichenko

doi:10.3390/app15041852

Applied Sciences (Feb 2025)

Tensor Network Methods for Hyperparameter Optimization and Compression of Convolutional Neural Networks

A. Naumov,
A. Melnikov,
M. Perelshtein,
Ar. Melnikov,
V. Abronin,
F. Oksanichenko

Affiliations

A. Naumov: Terra Quantum AG, Kornhausstrasse 25, 9000 St. Gallen, Switzerland
A. Melnikov: Terra Quantum AG, Kornhausstrasse 25, 9000 St. Gallen, Switzerland
M. Perelshtein: Terra Quantum AG, Kornhausstrasse 25, 9000 St. Gallen, Switzerland
Ar. Melnikov: Terra Quantum AG, Kornhausstrasse 25, 9000 St. Gallen, Switzerland
V. Abronin: Terra Quantum AG, Kornhausstrasse 25, 9000 St. Gallen, Switzerland
F. Oksanichenko: Terra Quantum AG, Kornhausstrasse 25, 9000 St. Gallen, Switzerland

DOI: https://doi.org/10.3390/app15041852
Journal volume & issue: Vol. 15, no. 4
p. 1852

Abstract

Read online

Neural networks have become a cornerstone of computer vision applications, with tasks ranging from image classification to object detection. However, challenges such as hyperparameter optimization (HPO) and model compression remain critical for improving performance and deploying models on resource-constrained devices. In this work, we address these challenges using Tensor Network-based methods. For HPO, we propose and evaluate the TetraOpt algorithm against various optimization algorithms. These evaluations were conducted on subsets of the NATS-Bench dataset, including CIFAR-10, CIFAR-100, and ImageNet subsets. TetraOpt consistently demonstrated superior performance, effectively exploring the global optimization space and identifying configurations with higher accuracies. For model compression, we introduce a novel iterative method that combines CP, SVD, and Tucker tensor decompositions. Applied to ResNet-18 and ResNet-152, we evaluated our method on the CIFAR-10 and Tiny ImageNet datasets. Our method achieved compression ratios of up to 14.5× for ResNet-18 and 2.5× for ResNet-152. Additionally, the inference time for processing an image on a CPU remained largely unaffected, demonstrating the practicality of the method.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords