| BOOK | speechsyn96 [vSHOS96] |
| Editor | |
| Title | Progress in Speech Synthesis |
| Publisher | Springer-Verlag |
| Address | New York |
| Year | 1996 |
| Isbn | 0-387-94701-9 |
| amazon-url | http://www.amazon.de/exec/obidos/ASIN/0387947019 |
| Remarks | van Santen Author Links: http://www.bell-labs.com/project/tts/BOOK.html, Springer Heidelberg: http://www.springer.de/cgi-bin/search-book.pl?isbn=0-387-94701-9, Springer New-York: http://www.springer-ny.com/catalog/np/may96np/DATA/0-387-94701-9.html |
| ARTICLE | psola92 [VMT92] |
| Key | synthesis |
| Author | |
| Title | Voice transformation using PSOLA technique |
| Journal | speech |
| Year | 1992 |
| Month | June |
| Volume | 11 |
| Number | 2-3 |
| Pages | 189--194 |
| BOOK | chomsky68sound [CH68] |
| Author | |
| Title | The Sound Pattern of English |
| Publisher | Harper & Row |
| Address | New York, NY |
| Year | 1968 |
| ARTICLE | bailly1991 [BLS91] |
| Author | |
| Title | Formant trajectories as audible gestures: an alternative for speech synthesis. |
| Journal | Journal of Phonetics |
| Year | 1991 |
| Volume | 19 |
| Pages | 9--23 |
| INPROC. | soong88 [SR88] |
| Author | |
| Title | On the use of Instantaneous and Transitional Spectral Information in Speaker Recognition |
| Booktitle | IEEE Transactions on Acoustics, Speech and Signal Processing |
| Volume | 36 |
| Year | 1988 |
| Pages | 871--879 |
| Keywords | derivative of cepstrum |
| Remarks | cited in [MD97a] |
| INPROC. | griffin88 [GL88] |
| Author | |
| Title | Multiband Excitation Vocoder |
| Booktitle | IEEE Transactions on Acoustics, Speech and Signal Processing |
| Volume | 36 |
| Year | 1988 |
| Pages | 1123--1235 |
| Keywords | robust cepstrum by sinusoidal weighting |
| Remarks | cited in [MD97a] |
| INPROC. | allessandro95 [dM95] |
| Author | |
| Title | Automatic pitch contour stylization using a model of tonal perception |
| Booktitle | Computer Speech and Language |
| Year | 1995 |
| Pages | 257--288 |
| Keywords | perceptual stylization, based on a model of tonal perception |
| Remarks | cited in [MD97a] |
| INPROC. | traber92 [Tra92] |
| Author | |
| Title | F0 Generation with a Database of Natural F0 Patterns and with a Neural Network |
| Booktitle | Talking Machines: Theories, Models, and Designs |
| Editor | |
| Publisher | North Holland |
| Year | 1992 |
| Pages | 287--304 |
| Remarks | cited in [MD97a]: machine learning techniques: multilayer perceptrons |
| INPROC. | sagisaka92 [SK92] |
| Author | |
| Title | Optimization of Intonation Control Using Statistical F0 Resetting Characteristics |
| Booktitle | Proceedings of the International Conference on Acoustics |
| Volume | 2 |
| Publisher | Speech and Signal Processing |
| Year | 1992 |
| Pages | 49--52 |
| Remarks | cited in [MD97a]: machine learning techniques: linear regression |
| INPROC. | hirschberg91 [Hir91] |
| Author | |
| Title | Using Text Analysis to Predict Intonational Boundaries |
| Booktitle | Proceedings of Eurospeech |
| Location | Genova |
| Year | 1991 |
| Pages | 1275--1278 |
| INPROC. | moebius93 [MPH93] |
| Author | |
| Title | Analysis and Synthesis of German F0 Contours by Means of Fujisaki's Model |
| Booktitle | Speech Communication |
| Volume | 13 |
| Year | 1993 |
| Pages | 53--61 |
| INPROC. | sagisaka88 [Sag88] |
| Author | |
| Title | Speech synthesis by rule using an optimal selection of non-uniform synthesis units |
| Booktitle | Proc. of the Int'l Conf. on Acoustics, Speech, and Signal Processing |
| Year | 1988 |
| Pages | 679 |
| Remarks | (origin of unit selection?), cited in [MCW98]: since the late 1980's, selection-based concatenative synthesis from large databases has received increased interest as a potential improvement upon fixed diphone inventories. TO BE FOUND |
| INPROC. | wang93 [WCIS93] |
| Author | |
| Title | Tree-based unit selection for English speech synthesis |
| Booktitle | Proc. of the Int'l Conf. on Acoustics, Speech, and Signal Processing |
| Year | 1993 |
| Pages | 191--194 |
| Remarks | cited in [MCW98, CM98]: clustering and decision trees. TO BE FOUND |
| INPROC. | nakajima94 [Nak94] |
| Author | |
| Title | Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering |
| Booktitle | Speech Communication |
| Volume | 14 |
| Month | September |
| Year | 1994 |
| Pages | 313 |
| Remarks | cited in [MCW98, CM98]: clustering and decision trees. TO BE FOUND |
| PHDTHESIS | donovan96 [Don96] |
| Author | |
| Title | Trainable Speech Synthesis |
| Type | PhD thesis |
| School | Cambridge University |
| Year | 1996 |
| Remarks | cited in [MCW98]: Mahalanobis distance |
| INPROC. | huang96 [HAea96] |
| Author | |
| Title | Whistler: A trainable text-to-speech system |
| Booktitle | Proc. of the Int'l Conf. on Spoken Language Processing |
| Year | 1996 |
| Pages | 2387--2390 |
| Remarks | cited in [MCW98]: decision trees for speech synthesis |
| INPROC. | karaali96 [KCG96] |
| Author | |
| Title | Speech Synthesis with Neural Networks |
| Booktitle | Proc. of World Congress on Neural Networks |
| Month | September |
| Year | 1996 |
| Pages | 45--50 |
| Remarks | cited in [MCW98]: data driven direct mapping with NN |
| INPROC. | tuerk93 [TR] |
| Author | |
| Title | Speech synthesis using artificial neural networks trained on cepstral coefficients |
| Booktitle | Proc. EUROSPEECH |
| Pages | 1713--1716 |
| Remarks | cited in [MCW98]: data driven direct mapping with NN |
| BOOK | quackenbush88 [QBC88] |
| Author | |
| Title | Objective Measures of Speech Quality |
| Publisher | Prentice-Hall |
| Address | Englewood Cliffs, NJ |
| Year | 1988 |
| Remarks | cited in [MCW98]: distance measures for coding |
| INPROC. | nocerino85 [NSRK85] |
| Author | |
| Title | Comparative study of several distortion measures for speech recognition |
| Booktitle | Speech Communication |
| Volume | 4 |
| Year | 1985 |
| Pages | 317--331 |
| Remarks | cited in [MCW98]: distance measures for ASR |
| INPROC. | asp:icassp88 [HJ88] |
| Author | |
| Title | Optimization of perceptually-based ASR front-end |
| Booktitle | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing |
| Year | 1988 |
| Pages | 219 |
| Remarks | cited in [MCW98]: distance measures for ASR |
| INPROC. | ghitza97 [GS97] |
| Author | |
| Title | On the perceptual distance between two speech segments |
| Booktitle | Journal of the Acoustical Society of America |
| Year | 1997 |
| Volume | 101 |
| Pages | 522--529 |
| Number | 1 |
| Remarks | cited in [MCW98]: distance measures in general |
| INPROC. | hansen98 [HC98] |
| Author | |
| Title | An auditory-based distortion measure with application to concatenative speech synthesis |
| Booktitle | IEEE Trans. on Speech and Audio Processing |
| Volume | 6 |
| Month | September |
| Year | 1998 |
| Pages | 489--495 |
| Remarks | cited in [MCW98]: distance measures for concatenative speech synthesis |
| INPROC. | asp:itsa94 [HM94] |
| Author | |
| Title | RASTA processing of speech |
| Booktitle | IEEE Transactions on Speech and Acoustics |
| Volume | 2 |
| Month | October |
| Year | 1994 |
| Pages | 587--589 |
| Remarks | cited in [MCW98] |
| BOOK | edwards93 [Edw93] |
| Author | |
| Title | An Introduction to Linear Regression and Correlation |
| Publisher | W. H. Freeman and Co |
| Address | San Francisco |
| Year | 1993 |
| Remarks | cited in [MCW98]: Fisher transform |
| INPROC. | Ding_OptiUnit_EURO97 [DC97] |
| Author | |
| Title | Optimising Unit Selection with Voice Source and Formants in the CHATR Speech Synthesis System |
| Booktitle | Proc. Eurospeech '97 |
| Address | Rhodes, Greece |
| Month | September |
| Year | 1997 |
| Pages | 537--540 |
| Remarks | To BE FOUND! |