Lipreading Using Liquid State Machine with STDP-Tuning

Xuhu Yu; Zhong Wan; Zehao Shi; Lei Wang

doi:10.3390/app122010484

Applied Sciences (Oct 2022)

Lipreading Using Liquid State Machine with STDP-Tuning

Xuhu Yu,
Zhong Wan,
Zehao Shi,
Lei Wang

Affiliations

Xuhu Yu: The College of Computer Science, National University of Defence Technology, Changsha 410073, China
Zhong Wan: The College of Computer Science, National University of Defence Technology, Changsha 410073, China
Zehao Shi: The College of Computer Science, National University of Defence Technology, Changsha 410073, China
Lei Wang: The College of Computer Science, National University of Defence Technology, Changsha 410073, China

DOI: https://doi.org/10.3390/app122010484
Journal volume & issue: Vol. 12, no. 20
p. 10484

Abstract

Read online

Lipreading refers to the task of decoding the text content of a speaker based on visual information about the movement of the speaker’s lips. With the development of deep learning in recent years, lipreading has attracted extensive research. However, the deep learning method requires a lot of computing resources, which is not conducive to the migration of the system to edge devices. Inspired by the work of Spiking Neural Networks (SNNs) in recognizing human actions and gestures, we propose a lipreading system based on SNNs. Specifically, we construct the front-end feature extractor of the system using Liquid State Machine (LSM). On the other hand, a heuristic algorithm is used to select appropriate parameters for the classifier in the backend. On small-scale lipreading datasets, our recognition accuracy achieves good results. We claim that our network performs better in terms of accuracy and ratio of learned parameters compared to other networks, and has superior advantages in terms of network complexity and training cost. On the AVLetters dataset, our model achieves a 5% improvement in accuracy over traditional methods and a 90% reduction in parameters over the state-of-the-art.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords