Main.IrcamAlign History
Show minor edits - Show changes to output
February 24, 2011, at 11:49 AM
by
- February 24, 2011, at 11:49 AM
by
- Added lines 1-33:
!HMM-based Speech Segmentation
!!!ircamAlign
*''ircamAlign'' is a tool for speech segmentation useful to create database for speech synthesis.
*it is based on the HTK toolbox and LIAPHON french phonetizer
*available for French and English
*audio speech file and its textual transcription are taken as input
*linguistic structure is extracted from the text and aligned on the audio file by considering multi-pronunciation graph to model the dependencies between phonemes.
*if the text transcription is no available, a bi-gram language model is used
*phoneme are modelized by left-right HMM with 7 states.
*Confidence measure are computed at different linguistic level for easier manual correction
*HTS lab features format are directly created to allow the quick creation of new voices.
*[[http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/606_paper.pdf |Automatic Phoneme Segmentation With Relaxed Textual Constraints]],\\
P. Lanchantin, A. C. Morris X. Rodet and C. Veaux,\\
''LREC'08 Proceedings'', Marrakech, Marocco, 2008.
%float height=250px% http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/ircamAlign6b.jpg
!!!!Musical productions using ircamAlign
*''ircamAlign'' is used by composers and it has been used in several musical creations at IRCAM such as:
**''Com que voz'', Stefano Gervasoni, Thomas Goepfer
**''HyperMusic: Prologue'', Hector Parra, Thomas Goepfer
**''Häxan'', la sorcellerie à travers les âges, Mauro Lanza, Olivier Pasquet
**''Cantate égale pays'', Gérard Pesson, Sébastien Roux
**''Le père'', Michael Jarrel, Serge Lemouton
|| border=0
||!Hypermusic||(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/extract_all_rev.mp3 width=200 height=18:)||
* some examples [[musicalProduction| here]]
!!!ircamAlign
*''ircamAlign'' is a tool for speech segmentation useful to create database for speech synthesis.
*it is based on the HTK toolbox and LIAPHON french phonetizer
*available for French and English
*audio speech file and its textual transcription are taken as input
*linguistic structure is extracted from the text and aligned on the audio file by considering multi-pronunciation graph to model the dependencies between phonemes.
*if the text transcription is no available, a bi-gram language model is used
*phoneme are modelized by left-right HMM with 7 states.
*Confidence measure are computed at different linguistic level for easier manual correction
*HTS lab features format are directly created to allow the quick creation of new voices.
*[[http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/606_paper.pdf |Automatic Phoneme Segmentation With Relaxed Textual Constraints]],\\
P. Lanchantin, A. C. Morris X. Rodet and C. Veaux,\\
''LREC'08 Proceedings'', Marrakech, Marocco, 2008.
%float height=250px% http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/ircamAlign6b.jpg
!!!!Musical productions using ircamAlign
*''ircamAlign'' is used by composers and it has been used in several musical creations at IRCAM such as:
**''Com que voz'', Stefano Gervasoni, Thomas Goepfer
**''HyperMusic: Prologue'', Hector Parra, Thomas Goepfer
**''Häxan'', la sorcellerie à travers les âges, Mauro Lanza, Olivier Pasquet
**''Cantate égale pays'', Gérard Pesson, Sébastien Roux
**''Le père'', Michael Jarrel, Serge Lemouton
|| border=0
||!Hypermusic||(:flash http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/dewplayer.swf?son=http://recherche.ircam.fr/equipes/analyse-synthese/lanchant/uploads/Main/cursus/extract_all_rev.mp3 width=200 height=18:)||
* some examples [[musicalProduction| here]]