Department of Electrical Engineering, Columbia University, New York, United States; Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
Hassan Akbari
Department of Electrical Engineering, Columbia University, New York, United States; Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
Bahar Khalighinejad
Department of Electrical Engineering, Columbia University, New York, United States; Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
Jose L Herrero
Feinstein Institute for Medical Research, Manhasset, United States; Department of Neurosurgery, Hofstra-Northwell School of Medicine and Feinstein Institute for Medical Research, Manhasset, United States
Feinstein Institute for Medical Research, Manhasset, United States; Department of Neurosurgery, Hofstra-Northwell School of Medicine and Feinstein Institute for Medical Research, Manhasset, United States
Department of Electrical Engineering, Columbia University, New York, United States; Zuckerman Mind Brain Behavior Institute, Columbia University, New York, United States
Our understanding of nonlinear stimulus transformations by neural circuits is hindered by the lack of comprehensive yet interpretable computational modeling frameworks. Here, we propose a data-driven approach based on deep neural networks to directly model arbitrarily nonlinear stimulus-response mappings. Reformulating the exact function of a trained neural network as a collection of stimulus-dependent linear functions enables a locally linear receptive field interpretation of the neural network. Predicting the neural responses recorded invasively from the auditory cortex of neurosurgical patients as they listened to speech, this approach significantly improves the prediction accuracy of auditory cortical responses, particularly in nonprimary areas. Moreover, interpreting the functions learned by neural networks uncovered three distinct types of nonlinear transformations of speech that varied considerably from primary to nonprimary auditory regions. The ability of this framework to capture arbitrary stimulus-response mappings while maintaining model interpretability leads to a better understanding of cortical processing of sensory signals.