International Journal of Electronics and Telecommunications (Jun 2024)

Comparative analysis of natural and synthesized Polish speech

  • Michał Daniluk,
  • Agnieszka Paula Pietrzak

DOI
https://doi.org/10.24425/ijet.2024.149553
Journal volume & issue
Vol. vol. 70, no. No 2
pp. 361 – 366

Abstract

Read online

In the evolving field of speech synthesis, not only intelligibility, but also naturalness remains an important factor. This paper presents a comparative analysis of natural versus synthesized Polish speech. Speech synthesizers: Ivona, Mekatron, Notevibes, and ttsmp3 were explored. Four methods for assessing synthesized speech quality and comparing it to natural speech were presented: the AB test, MOS, logatom articulation test, and MUSHRA. Sentence databases and a database of logatoms were generated for each synthesizer and recorded for natural speech. Results indicated natural speech was consistently better than synthesized speech. Among the synthesizers, Notevibes performed best in all comparisons, while Mekatron ranked lowest.

Keywords