Nihon Kikai Gakkai ronbunshu (May 2024)

Real-time emotion identification system using voice information

  • Riki FUKUYOSHI,
  • Masashi NAKAYAMA

DOI
https://doi.org/10.1299/transjsme.23-00293
Journal volume & issue
Vol. 90, no. 933
pp. 23-00293 – 23-00293

Abstract

Read online

Conventional speech emotion identification often uses sentence units as analysis length generally. However, human emotions frequently change their emotions instantaneously when they hear a specific word or keyword that affects each speaker’s emotion, and it is important to capture more detailed emotional expressions for recognition of the emotion. We propose an emotion identification by using acoustic features that analyze speech at each frame, which are shorter than conventional units such as sentences and phrases for capturing and expressing actual emotion. Therefore, we propose a real-time emotion identification system that uses frames as the unit of analysis for acoustic features to the emotion in units of words and morphemes, which are shorter than conventional linguistic units.

Keywords