Semi-Supervised Training of Transformer and Causal Dilated Convolution Network with Applications to Speech Topic Classification

Jinxiang Zeng; Du Zhang; Zhiyi Li; Xiaolin Li

doi:10.3390/app11125712

Applied Sciences (Jun 2021)

Semi-Supervised Training of Transformer and Causal Dilated Convolution Network with Applications to Speech Topic Classification

Jinxiang Zeng,
Du Zhang,
Zhiyi Li,
Xiaolin Li

Affiliations

Jinxiang Zeng: School of Economics and Management, South China Normal University, Guangzhou 510006, China
Du Zhang: Faculty of Information Technology, Macau University of Science and Technology, Macau 999078, China
Zhiyi Li: School of Economics and Management, South China Normal University, Guangzhou 510006, China
Xiaolin Li: School of Economics and Management, South China Normal University, Guangzhou 510006, China

DOI: https://doi.org/10.3390/app11125712
Journal volume & issue: Vol. 11, no. 12
p. 5712

Abstract

Read online

Aiming at the audio event recognition problem of speech recognition, a decision fusion method based on the Transformer and Causal Dilated Convolutional Network (TCDCN) framework is proposed. This method can adjust the model sound events for a long time and capture the time correlation, and can effectively deal with the sparsity of audio data. At the same time, our dataset comes from audio clips cropped by YouTube. In order to reliably and stably identify audio topics, we extract different features and different loss function calculation methods to find the best model solution. The experimental results from different test models show that the TCDCN model proposed in this paper achieves better recognition results than the classification using neural networks and other fusion methods.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords