Real-Time Speech Emotion Recognition Using a Pre-trained Image Classification Network: Effects of Bandwidth Reduction and Companding

Margaret Lech; Melissa Stolar; Christopher Best; Robert Bolia

doi:10.3389/fcomp.2020.00014

Frontiers in Computer Science (May 2020)

Real-Time Speech Emotion Recognition Using a Pre-trained Image Classification Network: Effects of Bandwidth Reduction and Companding

Margaret Lech,
Melissa Stolar,
Christopher Best,
Robert Bolia

Affiliations

Margaret Lech: School of Engineering, RMIT University, Melbourne, VIC, Australia
Melissa Stolar: School of Engineering, RMIT University, Melbourne, VIC, Australia
Christopher Best: Human Factors, Aerospace Division, Defence Science Technology Group, Melbourne, VIC, Australia
Robert Bolia: Human Factors, Aerospace Division, Defence Science Technology Group, Melbourne, VIC, Australia

DOI: https://doi.org/10.3389/fcomp.2020.00014
Journal volume & issue: Vol. 2

Abstract

Read online

This paper examines the effects of reduced speech bandwidth and the μ-low companding procedure used in transmission systems on the accuracy of speech emotion recognition (SER). A step by step description of a real-time speech emotion recognition implementation using a pre-trained image classification network AlexNet is given. The results showed that the baseline approach achieved an average accuracy of 82% when trained on the Berlin Emotional Speech (EMO-DB) data with seven categorical emotions. Reduction of the sampling frequency from the baseline 16–8 kHz (i.e., bandwidth reduction from 8 to 4 kHz, respectively) led to a decrease of SER accuracy by about 3.3%. The companding procedure on its own reduced the average accuracy by 3.8%, and the combined effect of companding and band reduction decreased the accuracy by about 7% compared to the baseline results. The SER was implemented in real-time with emotional labels generated every 1.033–1.026 s. Real-time implementation timelines are presented.

Published in Frontiers in Computer Science

ISSN: 2624-9898 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/computer-science#

About the journal

Abstract

Keywords