Modeling conditional distributions of neural and behavioral data with masked variational autoencoders
Auguste Schulz,
Julius Vetter,
Richard Gao,
Daniel Morales,
Victor Lobato-Rios,
Pavan Ramdya,
Pedro J. Gonçalves,
Jakob H. Macke
Affiliations
Auguste Schulz
Machine Learning in Science, University of Tübingen & Tübingen AI Center, Tübingen, Germany; Corresponding author
Julius Vetter
Machine Learning in Science, University of Tübingen & Tübingen AI Center, Tübingen, Germany
Richard Gao
Machine Learning in Science, University of Tübingen & Tübingen AI Center, Tübingen, Germany
Daniel Morales
Neuroengineering Laboratory, Brain Mind Institute & Interfaculty Institute of Bioengineering, EPFL, Lausanne, Switzerland
Victor Lobato-Rios
Neuroengineering Laboratory, Brain Mind Institute & Interfaculty Institute of Bioengineering, EPFL, Lausanne, Switzerland
Pavan Ramdya
Neuroengineering Laboratory, Brain Mind Institute & Interfaculty Institute of Bioengineering, EPFL, Lausanne, Switzerland
Pedro J. Gonçalves
Machine Learning in Science, University of Tübingen & Tübingen AI Center, Tübingen, Germany; VIB-Neuroelectronics Research Flanders (NERF), Leuven, Belgium; Imec, Leuven, Belgium; Department of Computer Science, KU Leuven, Leuven, Belgium; Department of Electrical Engineering, KU Leuven, Leuven, Belgium
Jakob H. Macke
Machine Learning in Science, University of Tübingen & Tübingen AI Center, Tübingen, Germany; Max Planck Institute for Intelligent Systems, Tübingen, Germany; Corresponding author
Summary: Extracting the relationship between high-dimensional neural recordings and complex behavior is a ubiquitous problem in neuroscience. Encoding and decoding models target the conditional distribution of neural activity given behavior and vice versa, while dimensionality reduction techniques extract low-dimensional representations thereof. Variational autoencoders (VAEs) are flexible tools for inferring such low-dimensional embeddings but struggle to accurately model arbitrary conditional distributions such as those arising in neural encoding and decoding, let alone simultaneously. Here, we present a VAE-based approach for calculating such conditional distributions. We first validate our approach on a task with known ground truth. Next, we retrieve conditional distributions over masked body parts of walking flies. Finally, we decode motor trajectories from neural activity in a monkey-reach task and query the same VAE for the encoding distribution. Our approach unifies dimensionality reduction and learning conditional distributions, allowing the scaling of common analyses in neuroscience to today’s high-dimensional multi-modal datasets.