Research Article
MAGEFACE: Performative Conversion of Facial Characteristics into Speech Synthesis Parameters
573 downloads
@INPROCEEDINGS{10.1007/978-3-319-03892-6_21, author={Nicolas d’Alessandro and Maria Astrinaki and Thierry Dutoit}, title={MAGEFACE: Performative Conversion of Facial Characteristics into Speech Synthesis Parameters}, proceedings={Intelligent Technologies for Interactive Entertainment. 5th International ICST Conference, INTETAIN 2013, Mons, Belgium, July 3-5, 2013, Revised Selected Papers}, proceedings_a={INTETAIN}, year={2014}, month={6}, keywords={speech synthesis software library performative media streaming architecture HTS MAGE realtime audio software face tracking mapping}, doi={10.1007/978-3-319-03892-6_21} }
- Nicolas d’Alessandro
Maria Astrinaki
Thierry Dutoit
Year: 2014
MAGEFACE: Performative Conversion of Facial Characteristics into Speech Synthesis Parameters
INTETAIN
Springer
DOI: 10.1007/978-3-319-03892-6_21
Abstract
In this paper, we illustrate the use of the MAGE performative speech synthesizer through its application to the conversion of realtime-measured facial features with FaceOSC into speech synthesis features such as vocal tract shape or intonation. MAGE is a new software library for using HMM-based speech synthesis in reactive programming environments. MAGE uses a rewritten version of the HTS engine enabling the computation of speech audio samples on a two-label window instead of the whole sentence. Only this feature enables the realtime mapping of facial attributes to synthesis parameters.
Copyright © 2013–2024 ICST