Nature Communications (Feb 2023)

Separation of scales and a thermodynamic description of feature learning in some CNNs

  • Inbar Seroussi,
  • Gadi Naveh,
  • Zohar Ringel

DOI
https://doi.org/10.1038/s41467-023-36361-y
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 12

Abstract

Read online

In the quest to understand how deep neural networks work, identification of slow and fast variables is a desirable step. Inspired by tools from theoretical physics, the authors propose a simplified description of finite deep neural networks based on two matrix variables per layer and provide analytic predictions for feature learning effects.