Unifying O(3) equivariant neural networks design with tensor-network formalism

Zimu Li; Zihan Pengmei; Han Zheng; Erik Thiede; Junyu Liu; Risi Kondor

doi:10.1088/2632-2153/ad4a04

Machine Learning: Science and Technology (Jan 2024)

Unifying O(3) equivariant neural networks design with tensor-network formalism

Zimu Li,
Zihan Pengmei,
Han Zheng,
Erik Thiede,
Junyu Liu,
Risi Kondor

Affiliations

Zimu Li: ORCiD; DAMTP, Center for Mathematical Sciences, University of Cambridge , Cambridge CB30WA, United Kingdom
Zihan Pengmei: Department of Chemistry, The University of Chicago , Chicago, IL 60637, United States of America
Han Zheng: Department of Computer Science, The University of Chicago , Chicago, IL 60637, United States of America
Erik Thiede: Center for Computational Mathematics, Flatiron Institute , New York, NY 10010, United States of America
Junyu Liu: ORCiD; Pritzker School of Molecular Engineering, The University of Chicago , Chicago, IL 60637, United States of America; Kadanoff Center for Theoretical Physics, The University of Chicago , Chicago, IL 60637, United States of America; SeQure , Chicago, IL 60615, United States of America
Risi Kondor: Department of Computer Science, The University of Chicago , Chicago, IL 60637, United States of America

DOI: https://doi.org/10.1088/2632-2153/ad4a04
Journal volume & issue: Vol. 5, no. 2
p. 025044

Abstract

Read online

Many learning tasks, including learning potential energy surfaces from ab initio calculations, involve global spatial symmetries and permutational symmetry between atoms or general particles. Equivariant graph neural networks are a standard approach to such problems, with one of the most successful methods employing tensor products between various tensors that transform under the spatial group. However, as the number of different tensors and the complexity of relationships between them increase, maintaining parsimony and equivariance becomes increasingly challenging. In this paper, we propose using fusion diagrams, a technique widely employed in simulating SU(2)-symmetric quantum many-body problems, to design new spatial equivariant components for neural networks. This results in a diagrammatic approach to constructing novel neural network architectures. When applied to particles within a given local neighborhood, the resulting components, which we term ‘fusion blocks,’ serve as universal approximators of any continuous equivariant function defined on the neighborhood. We incorporate a fusion block into pre-existing equivariant architectures (Cormorant and MACE), leading to improved performance with fewer parameters on a range of challenging chemical problems. Furthermore, we apply group-equivariant neural networks to study non-adiabatic molecular dynamics of stilbene cis-trans isomerization. Our approach, which combines tensor networks with equivariant neural networks, suggests a potentially fruitful direction for designing more expressive equivariant neural networks.

Published in Machine Learning: Science and Technology

ISSN: 2632-2153 (Online)
Publisher: IOP Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://iopscience.iop.org/journal/2632-2153

About the journal

Abstract

Keywords