Bangla text normalization for text-to-speech synthesizer using machine learning algorithms

Md. Rezaul Islam; Arif Ahmad; Mohammad Shahidur Rahman

Journal of King Saud University: Computer and Information Sciences (Jan 2024)

Bangla text normalization for text-to-speech synthesizer using machine learning algorithms

Md. Rezaul Islam,
Arif Ahmad,
Mohammad Shahidur Rahman

Affiliations

Md. Rezaul Islam: Corresponding authors.; Department of Computer Science and Engineering, Shahjalal University of Science and Technology, Sylhet 3114, Bangladesh
Arif Ahmad: Department of Computer Science and Engineering, Shahjalal University of Science and Technology, Sylhet 3114, Bangladesh
Mohammad Shahidur Rahman: Corresponding authors.; Department of Computer Science and Engineering, Shahjalal University of Science and Technology, Sylhet 3114, Bangladesh

Journal volume & issue: Vol. 36, no. 1
p. 101807

Abstract

Read online

Text normalization (TN) for text-to-speech (TTS) synthesizer is the transformation of non-standard words like times, ordinal numbers, equations, ranges, dates, etc. into standard words that have similarities with their pronunciations. An essential part of all TTS synthesizers is text normalization. Without text normalization, generated voice from the TTS synthesizer will be unintelligible. For the unsatisfactory performance of previous research, a text normalization method for the Bangla language is proposed in this paper. At first, we have produced a tokenized dataset with a semiotic class using regular expressions from a Bangla corpus. Then, each token has been trained using the XGBClassifier algorithm. After that, it identifies the semiotic class for each token in a new Bangla text corpus using the trained XGBClassifier model. Finally, it produces a normalized text for each token by calling the class function according to the predicted class. This text normalization method will help the Bangla TTS synthesizer in producing more intelligible voices. The token classification accuracy of this method is 99.997%.

Published in Journal of King Saud University: Computer and Information Sciences

ISSN: 1319-1578 (Print)
Publisher: Elsevier
Country of publisher: Saudi Arabia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.journals.elsevier.com/journal-of-king-saud-university-computer-and-information-sciences/

About the journal

Abstract

Keywords