Compact Acoustic Models for Embedded Speech Recognition

Christophe L&amp;#233;vy; Georges Linar&amp;#232;s; Jean-Fran&amp;#231;ois Bonastre

doi:10.1155/2009/806186

EURASIP Journal on Audio, Speech, and Music Processing (Jan 2009)

Compact Acoustic Models for Embedded Speech Recognition

Christophe L&#233;vy,
Georges Linar&#232;s,
Jean-Fran&#231;ois Bonastre

Affiliations

Christophe L&#233;vy
Georges Linar&#232;s
Jean-Fran&#231;ois Bonastre

DOI: https://doi.org/10.1155/2009/806186
Journal volume & issue: Vol. 2009

Abstract

Read online

Speech recognition applications are known to require a significant amount of resources. However, embedded speech recognition only authorizes few KB of memory, few MIPS, and small amount of training data. In order to fit the resource constraints of embedded applications, an approach based on a semicontinuous HMM system using state-independent acoustic modelling is proposed. A transformation is computed and applied to the global model in order to obtain each HMM state-dependent probability density functions, authorizing to store only the transformation parameters. This approach is evaluated on two tasks: digit and voice-command recognition. A fast adaptation technique of acoustic models is also proposed. In order to significantly reduce computational costs, the adaptation is performed only on the global model (using related speaker recognition adaptation techniques) with no need for state-dependent data. The whole approach results in a relative gain of more than 20% compared to a basic HMM-based system fitting the constraints.

Published in EURASIP Journal on Audio, Speech, and Music Processing

ISSN: 1687-4722 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Science: Physics: Acoustics. Sound; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://asmp-eurasipjournals.springeropen.com

About the journal