Planning the development of text-to-speech synthesis models and datasets with dynamic deep learning

Hawraz A. Ahmad; Tarik A. Rashid

Journal of King Saud University: Computer and Information Sciences (Sep 2024)

Planning the development of text-to-speech synthesis models and datasets with dynamic deep learning

Hawraz A. Ahmad,
Tarik A. Rashid

Affiliations

Hawraz A. Ahmad: Software and Informatics Department, College of Engineering, Salahaddin University-Erbil, Erbil, Iraq
Tarik A. Rashid: Computer Science and Engineering Department, University of Kurdistan Hewlêr, Erbil, Iraq; Corresponding author.

Journal volume & issue: Vol. 36, no. 7
p. 102131

Abstract

Read online

Synthesis of Text-to-speech (TTS) is a process that involves translating a natural language text into a speech. Speech synthesisers face a major challenge when recognizing the prosodic elements of written text, such as intonation (the rise and fall of the voice in speaking), and length. In contrast, continuous speech features are influenced by the personality and emotions of the artist. A database is maintained to store the synthesized speech pieces. Its output is determined by how similar the person utters the words and how capable they are of being implied. In the past few years, the field of text-to-speech synthesis has been heavily impacted by the emergence of deep learning, an AI technology that has gained widespread popularity. This review paper presents a taxonomy of models and architectures that are based on deep learning and discusses the various datasets that are utilised in the TTS process. It also covers the evaluation matrices that are commonly used. The paper ends with a look at the future directions of the system and reaches to some Deep learning models that give promising results in this field.

Published in Journal of King Saud University: Computer and Information Sciences

ISSN: 1319-1578 (Print)
Publisher: Elsevier
Country of publisher: Saudi Arabia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.journals.elsevier.com/journal-of-king-saud-university-computer-and-information-sciences/

About the journal

Abstract

Keywords