Intermediate Sensory Feedback Assisted Multi-Step Neural Decoding for Reinforcement Learning Based Brain-Machine Interfaces

Xiang Shen; Xiang Zhang; Yifan Huang; Shuhang Chen; Zhuliang Yu; Yiwen Wang

doi:10.1109/TNSRE.2022.3210700

IEEE Transactions on Neural Systems and Rehabilitation Engineering (Jan 2022)

Intermediate Sensory Feedback Assisted Multi-Step Neural Decoding for Reinforcement Learning Based Brain-Machine Interfaces

Xiang Shen,
Xiang Zhang,
Yifan Huang,
Shuhang Chen,
Zhuliang Yu,
Yiwen Wang

Affiliations

Xiang Shen: ORCiD; Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Sai Kung, Hong Kong
Xiang Zhang: ORCiD; Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Sai Kung, Hong Kong
Yifan Huang: ORCiD; Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Sai Kung, Hong Kong
Shuhang Chen: ORCiD; Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Sai Kung, Hong Kong
Zhuliang Yu: ORCiD; School of Automation Science and Engineering, South China University of Technology and Pazhou Laboratory, Guangzhou, China
Yiwen Wang: ORCiD; Department of Electronic and Computer Engineering and Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Sai Kung, Hong Kong

DOI: https://doi.org/10.1109/TNSRE.2022.3210700
Journal volume & issue: Vol. 30
pp. 2834 – 2844

Abstract

Read online

Reinforcement-learning (RL)-based brain-machine interfaces (BMIs) interpret dynamic neural activity into movement intention without patients’ real limb movements, which is promising for clinical applications. A movement task generally requires the subjects to reach the target within one step and rewards the subjects instantaneously. However, a real BMI scenario involves tasks that require multiple steps, during which sensory feedback is provided to indicate the status of the prosthesis, and the reward is only given at the end of the trial. Actually, subjects internally evaluate the sensory feedback to adjust motor activity. Existing RL-BMI tasks have not fully utilized the internal evaluation from the brain upon the sensory feedback to guide the decoder training, and there lacks an effective tool to assign credit for the multi-step decoding task. We propose first to extract intermediate guidance from the medial prefrontal cortex (mPFC) to assist the learning of multi-step decoding in an RL framework. To effectively explore the neural-action mapping in a large state-action space, a temporal difference (TD) method is incorporated into quantized attention-gated kernel reinforcement learning (QAGKRL) to assign the credit over the temporal sequence of movement, but also discriminate spatially in the Reproducing Kernel Hilbert Space (RKHS). We test our approach on the data collected from the primary motor cortex (M1) and the mPFC of rats when they brain control the cursor to reach the target within multiple steps. Compared with the models which only utilize the final reward, the intermediate evaluation interpreted from the mPFC can help improve the prediction accuracy by 10.9% on average across subjects, with faster convergence and more stability. Moreover, our proposed algorithm further increases 18.2% decoding accuracy compared with existing TD-RL methods. The results reveal the possibility of achieving better multi-step decoding performance for more complicated BMI tasks.

Published in IEEE Transactions on Neural Systems and Rehabilitation Engineering

ISSN: 1534-4320 (Print); 1558-0210 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Medical technology; Medicine: Therapeutics. Pharmacology
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7333

About the journal

Abstract

Keywords