Rethinking the Methods and Algorithms for Inner Speech Decoding and Making Them Reproducible

Foteini Simistira Liwicki; Vibha Gupta; Rajkumar Saini; Kanjar De; Marcus Liwicki

doi:10.3390/neurosci3020017

NeuroSci (Apr 2022)

Rethinking the Methods and Algorithms for Inner Speech Decoding and Making Them Reproducible

Foteini Simistira Liwicki,
Vibha Gupta,
Rajkumar Saini,
Kanjar De,
Marcus Liwicki

Affiliations

Foteini Simistira Liwicki: Embedded Intelligent Systems LAB, Machine Learning, Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, 97187 Luleå, Sweden
Vibha Gupta: Embedded Intelligent Systems LAB, Machine Learning, Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, 97187 Luleå, Sweden
Rajkumar Saini: Embedded Intelligent Systems LAB, Machine Learning, Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, 97187 Luleå, Sweden
Kanjar De: Embedded Intelligent Systems LAB, Machine Learning, Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, 97187 Luleå, Sweden
Marcus Liwicki: Embedded Intelligent Systems LAB, Machine Learning, Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, 97187 Luleå, Sweden

DOI: https://doi.org/10.3390/neurosci3020017
Journal volume & issue: Vol. 3, no. 2
pp. 226 – 244

Abstract

Read online

This study focuses on the automatic decoding of inner speech using noninvasive methods, such as Electroencephalography (EEG). While inner speech has been a research topic in philosophy and psychology for half a century, recent attempts have been made to decode nonvoiced spoken words by using various brain–computer interfaces. The main shortcomings of existing work are reproducibility and the availability of data and code. In this work, we investigate various methods (using Convolutional Neural Network (CNN), Gated Recurrent Unit (GRU), Long Short-Term Memory Networks (LSTM)) for the detection task of five vowels and six words on a publicly available EEG dataset. The main contributions of this work are (1) subject dependent vs. subject-independent approaches, (2) the effect of different preprocessing steps (Independent Component Analysis (ICA), down-sampling and filtering), and (3) word classification (where we achieve state-of-the-art performance on a publicly available dataset). Overall we achieve a performance accuracy of 35.20% and 29.21% when classifying five vowels and six words, respectively, in a publicly available dataset, using our tuned iSpeech-CNN architecture. All of our code and processed data are publicly available to ensure reproducibility. As such, this work contributes to a deeper understanding and reproducibility of experiments in the area of inner speech detection.

Published in NeuroSci

ISSN: 2673-4087 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: https://www.mdpi.com/journal/neurosci

About the journal

Abstract

Keywords