Lip-Synching Using Speaker-Specific Articulation, Shape and Appearance Models

Gaspard Breton; Fr&amp;#233;d&amp;#233;ric Elisei; Oxana Govokhina; G&amp;#233;rard Bailly

doi:10.1155/2009/769494

EURASIP Journal on Audio, Speech, and Music Processing (Jan 2009)

Lip-Synching Using Speaker-Specific Articulation, Shape and Appearance Models

Gaspard Breton,
Fr&#233;d&#233;ric Elisei,
Oxana Govokhina,
G&#233;rard Bailly

Affiliations

Gaspard Breton
Fr&#233;d&#233;ric Elisei
Oxana Govokhina
G&#233;rard Bailly

DOI: https://doi.org/10.1155/2009/769494
Journal volume & issue: Vol. 2009

Abstract

Read online

We describe here the control, shape and appearance models that are built using an original photogrammetric method to capture characteristics of speaker-specific facial articulation, anatomy, and texture. Two original contributions are put forward here: the trainable trajectory formation model that predicts articulatory trajectories of a talking face from phonetic input and the texture model that computes a texture for each 3D facial shape according to articulation. Using motion capture data from different speakers and module-specific evaluation procedures, we show here that this cloning system restores detailed idiosyncrasies and the global coherence of visible articulation. Results of a subjective evaluation of the global system with competing trajectory formation models are further presented and commented.

Published in EURASIP Journal on Audio, Speech, and Music Processing

ISSN: 1687-4722 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Science: Physics: Acoustics. Sound; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://asmp-eurasipjournals.springeropen.com

About the journal