Deep Learning-based Speech Emotion Recognition: An Investigation into a sustainably Emotion-Speech Relationship

Pavithra Avvari; Ledalla Sukanya; Devi J. Sirisha; Dinesh Golla; Singh Monika; Reddy G. Vijendar

doi:10.1051/e3sconf/202343001091

E3S Web of Conferences (Jan 2023)

Deep Learning-based Speech Emotion Recognition: An Investigation into a sustainably Emotion-Speech Relationship

Pavithra Avvari,
Ledalla Sukanya,
Devi J. Sirisha,
Dinesh Golla,
Singh Monika,
Reddy G. Vijendar

Affiliations

Pavithra Avvari: Department of Information Technology, Gokaraju Rangaraju Institute of Engineering and Technology, JNTUH
Ledalla Sukanya: Department of Information Technology, Gokaraju Rangaraju Institute of Engineering and Technology, JNTUH
Devi J. Sirisha: Department of Information Technology, Gokaraju Rangaraju Institute of Engineering and Technology, JNTUH
Dinesh Golla: Department of Information Technology, Gokaraju Rangaraju Institute of Engineering and Technology, JNTUH
Singh Monika: Assistant professor, School of Applied and Life Sciences, Uttaranchal University
Reddy G. Vijendar: Department of Information Technology, Gokaraju Rangaraju Institute of Engineering and Technology, JNTUH

DOI: https://doi.org/10.1051/e3sconf/202343001091
Journal volume & issue: Vol. 430
p. 01091

Abstract

Read online

Speech Emotion Recognition (SER) poses a significant challenge with promising applications in psychology, speech therapy, and customer service. This research paper proposes the development of an SER system utilizing machine learning techniques, particularly deep learning and recurrent neural networks. The model will be trained on a carefully labeled dataset of diverse speech samples representing various emotions. By analyzing crucial audio features such as pitch, rhythm, and prosody, the system aims to achieve accurate emotion recognition for novel speech samples. The primary objective of this paper is to contribute to the advancement of SER by improving accuracy, reliability, and gaining deeper insights into establishing a sustainable complex relationship between emotions and speech. This innovative system has the potential to facilitate the practical implementation of emotion recognition technologies across multiple domains.

Published in E3S Web of Conferences

ISSN: 2267-1242 (Online)
Publisher: EDP Sciences
Country of publisher: France
LCC subjects: Geography. Anthropology. Recreation: Environmental sciences
Website: http://www.e3s-conferences.org/

About the journal