Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion

Gia-Nhu Nguyen; Trung-Nghia Phung

doi:10.1186/s13636-017-0113-5

EURASIP Journal on Audio, Speech, and Music Processing (Jun 2017)

Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion

Gia-Nhu Nguyen,
Trung-Nghia Phung

Affiliations

Gia-Nhu Nguyen: Duy Tan University
Trung-Nghia Phung: Thai Nguyen University of Information and Communication Technology

DOI: https://doi.org/10.1186/s13636-017-0113-5
Journal volume & issue: Vol. 2017, no. 1
pp. 1 – 7

Abstract

Read online

Abstract Speech synthesis has been applied in many kinds of practical applications. Currently, state-of-the-art speech synthesis uses statistical methods based on hidden Markov model (HMM). Speech synthesized by statistical methods can be considered over-smooth caused by the averaging in statistical processing. In the literature, there have been many studies attempting to solve over-smoothness in speech synthesized by an HMM. However, they are still limited. In this paper, a hybrid synthesis between HMM and exemplar-based voice conversion has been proposed. The experimental results show that the proposed method outperforms state-of-the-art HMM synthesis using global variance.

Published in EURASIP Journal on Audio, Speech, and Music Processing

ISSN: 1687-4722 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Science: Physics: Acoustics. Sound; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://asmp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords