Main.SpeakingStyleModelingOfVariousDiscourseGenresInHMM-BasedSpeechSynthesis History

Hide minor edits - Show changes to markup

February 24, 2011, at 11:52 AM by 129.102.64.91 -
Added lines 1-16:

Speaking Style Modeling of Various Discourse Genres in HMM-Based Speech Synthesis (with N. Obin)

  • This work presents an approach for modeling speaking style of various discourse genres in speech synthesis.
  • The proposed approach is based on phonological and acoustic average discourse genre - dependent speaking style parametric models.
  • The phonological module models the average abstract prosodic structure of a specific discourse genre.
  • The acoustic module jointly models average speaking style voice and prosodic cues of a given discourse genre.
  • Discourse genre - dependent speaking style models have been estimated for 4 discourses genres and evaluated on a speaking style prosodic identification perceptual experiment.
  • A comparison with speaking style identification on real speech is discussed and reveals consistent performance of the proposed approach.
  1. Speaking Style Modeling of Various Discourse Genres in HMM-Based Speech Synthesis,
    N. Obin, P. Lanchantin, A. Lacheret-Dujour and X. Rodet,
    ICASSP 2011, Prague, Czech Republic, May 2011, Submitted
Samples(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/speaking_styles_samples.mp3 width=60 height=18:)
Prosodical stereotype(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/DISCOURS.302.norm.mp3 width=60 height=18:)(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/DISCOURS.738.norm.mp3 width=60 height=18:)(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/DISCOURS.1853.norm.mp3 width=60 height=18:)(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/DISCOURS.2054.norm.mp3 width=60 height=18:)
HTS(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/DISCOURS_JOURNAL.norm.mp3 width=60 height=18:)(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/DISCOURS_MESSE.norm.mp3 width=60 height=18:)(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/DISCOURS_POLITIQUE.norm.mp3 width=60 height=18:)(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/DISCOURS_SPORT.norm.mp3 width=60 height=18:)