SymbolNet: neural symbolic regression with adaptive dynamic pruning for compression

Ho Fung Tsoi; Vladimir Loncar; Sridhara Dasu; Philip Harris

doi:10.1088/2632-2153/adaad8

Machine Learning: Science and Technology (Jan 2025)

SymbolNet: neural symbolic regression with adaptive dynamic pruning for compression

Ho Fung Tsoi,
Vladimir Loncar,
Sridhara Dasu,
Philip Harris

Affiliations

Ho Fung Tsoi: ORCiD; University of Wisconsin-Madison , Madison, WI, 53706, United States of America
Vladimir Loncar: ORCiD; Massachusetts Institute of Technology , Cambridge, MA, 02139, United States of America; Institute of Physics , Belgrade, Serbia
Sridhara Dasu: ORCiD; University of Wisconsin-Madison , Madison, WI, 53706, United States of America
Philip Harris: ORCiD; Massachusetts Institute of Technology , Cambridge, MA, 02139, United States of America; Institute for Artificial Intelligence and Fundamental Interactions , Cambridge, MA, 02139, United States of America

DOI: https://doi.org/10.1088/2632-2153/adaad8
Journal volume & issue: Vol. 6, no. 1
p. 015021

Abstract

Read online

Compact symbolic expressions have been shown to be more efficient than neural network (NN) models in terms of resource consumption and inference speed when implemented on custom hardware such as field-programmable gate arrays (FPGAs), while maintaining comparable accuracy (Tsoi et al 2024 EPJ Web Conf. 295 09036). These capabilities are highly valuable in environments with stringent computational resource constraints, such as high-energy physics experiments at the CERN Large Hadron Collider. However, finding compact expressions for high-dimensional datasets remains challenging due to the inherent limitations of genetic programming (GP), the search algorithm of most symbolic regression (SR) methods. Contrary to GP, the NN approach to SR offers scalability to high-dimensional inputs and leverages gradient methods for faster equation searching. Common ways of constraining expression complexity often involve multistage pruning with fine-tuning, which can result in significant performance loss. In this work, we propose $\tt{SymbolNet}$ , a NN approach to SR specifically designed as a model compression technique, aimed at enabling low-latency inference for high-dimensional inputs on custom hardware such as FPGAs. This framework allows dynamic pruning of model weights, input features, and mathematical operators in a single training process, where both training loss and expression complexity are optimized simultaneously. We introduce a sparsity regularization term for each pruning type, which can adaptively adjust its strength, leading to convergence at a target sparsity ratio. Unlike most existing SR methods that struggle with datasets containing more than $\mathcal{O}(10)$ inputs, we demonstrate the effectiveness of our model on the LHC jet tagging task (16 inputs), MNIST (784 inputs), and SVHN (3072 inputs).

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords