Main.TowardImprovedHMM-basedSpeechSynthesisUsingHigh-LevelSyntacticalFeatures History

Hide minor edits - Show changes to output

February 24, 2011, at 11:51 AM by 129.102.64.91 -
Added lines 1-14:
!!!Toward Improved HMM-based Speech Synthesis using High-Level Syntactical Features (with N. Obin)
*A major drawback of current Hidden Markov Model-based speech synthesis is the monotony of the generated speech which is closely related to the monotony of the generated prosody.
*This work presents a linguistic-oriented approaches in which high level linguistic features are extracted from text in order to improve prosody modeling.
*A linguistic processing chain based on linguistic preprocessing, morpho-syntactical labeling, and syntactical parsing is used to extract high-level syntactical features from an input text.
*Rich linguistic features are then introduces into a HMM-based speech synthesis system to model prosodic variations (f0, duration, and spectral variations).
*Subjective evaluation reveals that the proposed approach significantly improve speech synthesis compared to a baseline model, even if such improvment depends of the observed linguistic phenomenon.

* [[http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/speechpro2010.pdf | Toward Improved HMM-Based Speech Synthesis Using High-Level Syntactical Features]],\\
N. Obin, P. Lanchantin, M. Avanzi, A. Lacheret-Dujour and X. Rodet,\\
''Speech Prosody 2010 Proceedings'', Chicago, USA, 2010.

|| border=0
||!example 1||(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/lepetitpoucet.2.hts.morpho.mp3 width=60 height=18:)||(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/lepetitpoucet.2.1order.morpho.mp3 width=60 height=18:)||(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/lepetitpoucet.2.pg.morpho.mp3 width=60 height=18:)||
||!example 2||(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/PROUST_DUSSOLIER_110014.mp3 width=60 height=18:)||(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/PROUST_DUSSOLIER_110036.mp3 width=60 height=18:)||(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/PROUST_DUSSOLIER_110058.mp3 width=60 height=18:)||