A speech-driven embodied entrainment character system with a delayed voice back-channel based on negative emotional expression utterances

Makiko NISHIDA; Yutaka ISHII; Tomio WATANABE

doi:10.1299/transjsme.20-00104

Nihon Kikai Gakkai ronbunshu (Apr 2021)

A speech-driven embodied entrainment character system with a delayed voice back-channel based on negative emotional expression utterances

Makiko NISHIDA,
Yutaka ISHII,
Tomio WATANABE

Affiliations

Makiko NISHIDA: Graduate School of Computer Science and Systems Engineering, Okayama Prefectural University
Yutaka ISHII: Faculty of Computer Science and Systems Engineering, Okayama Prefectural University
Tomio WATANABE: Faculty of Computer Science and Systems Engineering, Okayama Prefectural University

DOI: https://doi.org/10.1299/transjsme.20-00104
Journal volume & issue: Vol. 87, no. 897
pp. 20-00104 – 20-00104

Abstract

Read online

The prior research includes the development of a speech-driven embodied entrainment computer-generated character called ”InterActor”, which automatically generates communicative motions and actions such as nods for entrained interaction from voice rhythm based on only speech input. However, the conventional InterActor generates only positive actions and back-channels without negative reactions, which may promote negative emotions of the user in the case of negative utterances such as self-denial. In this study, we developed an embodied character system with a delayed voice back-channel based on negative emotional expression utterances. Two experiments were performed to confirm the evaluation of back-channel feedback timing by sensory evaluation. In the first experiment, the timing of the voice back-channel to nodding motion was examined. In the second experiment, the timing of the voice back-channel to nodding motion in negative utterances was examined. As a result, it was shown that the timing of the voice back-channel delay of about 600ms was allowed from the start of the nodding motion estimated by InterActor. In negative utterances, the timing of allowance was about 900ms. Finally, we developed a prototype system based on the speaker’s emotional state using speech recognition.

Published in Nihon Kikai Gakkai ronbunshu

ISSN: 2187-9761 (Online)
Publisher: The Japan Society of Mechanical Engineers
Country of publisher: Japan
LCC subjects: Technology: Mechanical engineering and machinery; Technology: Engineering (General). Civil engineering (General): Engineering machinery, tools, and implements
Website: https://www.jsme.or.jp/publish/transact/

About the journal

Abstract

Keywords