FST-Based Pronunciation Lexicon Compression for Speech Engines

Žiga Golob; Jerneja Žganec Gros; Mario Žganec; Boštjan Vesnicer; Simon Dobrišek

doi:10.5772/52795

International Journal of Advanced Robotic Systems (Nov 2012)

FST-Based Pronunciation Lexicon Compression for Speech Engines

Žiga Golob,
Jerneja Žganec Gros,
Mario Žganec,
Boštjan Vesnicer,
Simon Dobrišek

Affiliations

Žiga Golob: Alpineon Research and Development Ltd., Ljubljana, Slovenia
Jerneja Žganec Gros: Alpineon Research and Development Ltd., Ljubljana, Slovenia
Mario Žganec: Alpineon Research and Development Ltd., Ljubljana, Slovenia
Boštjan Vesnicer: Alpineon Research and Development Ltd., Ljubljana, Slovenia
Simon Dobrišek: Faculty of Electrical Engineering, University of Ljubljana, Slovenia

DOI: https://doi.org/10.5772/52795
Journal volume & issue: Vol. 9

Abstract

Read online

Finite-state transducers are frequently used for pronunciation lexicon representations in speech engines, in which memory and processing resources are scarce. This paper proposes two possibilities for further reducing the memory footprint of finite-state transducers representing pronunciation lexicons. First, different alignments of grapheme and allophone transcriptions are studied and a reduction in the number of states of up to 30% is reported. Second, a combination of grapheme-to-allophone rules with a finite-state transducer is proposed, which yields a 65% smaller finite-state transducer than conventional approaches.

Published in International Journal of Advanced Robotic Systems

ISSN: 1729-8814 (Online)
Publisher: SAGE Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journals.sagepub.com/home/arx

About the journal