Speech Synthesis

Previous

Contents

Next

10 Speech Synthesis

BOOK	speechsyn96 [vSHOS96]
Editor	J.P.H. van Santen, J. Hirschberg, J. Olive, R. Sproat
Title	Progress in Speech Synthesis
Publisher	Springer-Verlag
Address	New York
Year	1996
Isbn	0-387-94701-9
amazon-url	`http://www.amazon.de/exec/obidos/ASIN/0387947019`
Remarks	van Santen Author Links: `http://www.bell-labs.com/project/tts/BOOK.html`, Springer Heidelberg: `http://www.springer.de/cgi-bin/search-book.pl?isbn=0-387-94701-9`, Springer New-York: `http://www.springer-ny.com/catalog/np/may96np/DATA/0-387-94701-9.html`

ARTICLE	psola92 [VMT92]
Key	synthesis
Author	H. Valbret, E. Moulines, J. P. Tubach
Title	Voice transformation using PSOLA technique
Journal	speech
Year	1992
Month	June
Volume	11
Number	2-3
Pages	189--194

BOOK	chomsky68sound [CH68]
Author	N. Chomsky, M. Halle
Title	The Sound Pattern of English
Publisher	Harper & Row
Address	New York, NY
Year	1968

ARTICLE	bailly1991 [BLS91]
Author	G. Bailly, R. Laboissière, J. L. Schwartz
Title	Formant trajectories as audible gestures: an alternative for speech synthesis.
Journal	Journal of Phonetics
Year	1991
Volume	19
Pages	9--23

INPROC.	soong88 [SR88]
Author	F.K. Soong, A.E. Rosenberg
Title	On the use of Instantaneous and Transitional Spectral Information in Speaker Recognition
Booktitle	IEEE Transactions on Acoustics, Speech and Signal Processing
Volume	36
Year	1988
Pages	871--879
Keywords	derivative of cepstrum
Remarks	cited in [MD97a]

INPROC.	griffin88 [GL88]
Author	D.W. Griffin, J.S. Lim
Title	Multiband Excitation Vocoder
Booktitle	IEEE Transactions on Acoustics, Speech and Signal Processing
Volume	36
Year	1988
Pages	1123--1235
Keywords	robust cepstrum by sinusoidal weighting
Remarks	cited in [MD97a]

INPROC.	allessandro95 [dM95]
Author	C. d'Alessandro, P. Mertens
Title	Automatic pitch contour stylization using a model of tonal perception
Booktitle	Computer Speech and Language
Year	1995
Pages	257--288
Keywords	perceptual stylization, based on a model of tonal perception
Remarks	cited in [MD97a]

INPROC.	traber92 [Tra92]
Author	C. Traber
Title	F0 Generation with a Database of Natural F0 Patterns and with a Neural Network
Booktitle	Talking Machines: Theories, Models, and Designs
Editor	G. Bailly, C. Benot
Publisher	North Holland
Year	1992
Pages	287--304
Remarks	cited in [MD97a]: machine learning techniques: multilayer perceptrons

INPROC.	sagisaka92 [SK92]
Author	Y. Sagisaka, N. Kaiki
Title	Optimization of Intonation Control Using Statistical F0 Resetting Characteristics
Booktitle	Proceedings of the International Conference on Acoustics
Volume	2
Publisher	Speech and Signal Processing
Year	1992
Pages	49--52
Remarks	cited in [MD97a]: machine learning techniques: linear regression

INPROC.	hirschberg91 [Hir91]
Author	J. Hirschberg
Title	Using Text Analysis to Predict Intonational Boundaries
Booktitle	Proceedings of Eurospeech
Location	Genova
Year	1991
Pages	1275--1278

INPROC.	moebius93 [MPH93]
Author	B. Möbius, M. Pätzold, W. Hess
Title	Analysis and Synthesis of German F0 Contours by Means of Fujisaki's Model
Booktitle	Speech Communication
Volume	13
Year	1993
Pages	53--61

INPROC.	sagisaka88 [Sag88]
Author	Y. Sagisaka
Title	Speech synthesis by rule using an optimal selection of non-uniform synthesis units
Booktitle	Proc. of the Int'l Conf. on Acoustics, Speech, and Signal Processing
Year	1988
Pages	679
Remarks	(origin of unit selection?), cited in [MCW98]: since the late 1980's, selection-based concatenative synthesis from large databases has received increased interest as a potential improvement upon fixed diphone inventories. TO BE FOUND

INPROC.	wang93 [WCIS93]
Author	W. J. Wang, W. N. Campbell, N. Iwahashi, Y. Sagisaka
Title	Tree-based unit selection for English speech synthesis
Booktitle	Proc. of the Int'l Conf. on Acoustics, Speech, and Signal Processing
Year	1993
Pages	191--194
Remarks	cited in [MCW98, CM98]: clustering and decision trees. TO BE FOUND

INPROC.	nakajima94 [Nak94]
Author	S. Nakajima
Title	Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering
Booktitle	Speech Communication
Volume	14
Month	September
Year	1994
Pages	313
Remarks	cited in [MCW98, CM98]: clustering and decision trees. TO BE FOUND

PHDTHESIS	donovan96 [Don96]
Author	R. E. Donovan
Title	Trainable Speech Synthesis
Type	PhD thesis
School	Cambridge University
Year	1996
Remarks	cited in [MCW98]: Mahalanobis distance

INPROC.	huang96 [HAea96]
Author	X. D. Huang, A. Acero, et al.
Title	Whistler: A trainable text-to-speech system
Booktitle	Proc. of the Int'l Conf. on Spoken Language Processing
Year	1996
Pages	2387--2390
Remarks	cited in [MCW98]: decision trees for speech synthesis

INPROC.	karaali96 [KCG96]
Author	O. Karaali, G. Corrigan, I. Gerson
Title	Speech Synthesis with Neural Networks
Booktitle	Proc. of World Congress on Neural Networks
Month	September
Year	1996
Pages	45--50
Remarks	cited in [MCW98]: data driven direct mapping with NN

INPROC.	tuerk93 [TR]
Author	C. Tuerk, T. Robinson
Title	Speech synthesis using artificial neural networks trained on cepstral coefficients
Booktitle	Proc. EUROSPEECH
Pages	1713--1716
Remarks	cited in [MCW98]: data driven direct mapping with NN

BOOK	quackenbush88 [QBC88]
Author	S. R. Quackenbush, T. P. Barnwell, M. A. Clements
Title	Objective Measures of Speech Quality
Publisher	Prentice-Hall
Address	Englewood Cliffs, NJ
Year	1988
Remarks	cited in [MCW98]: distance measures for coding

INPROC.	nocerino85 [NSRK85]
Author	N. Nocerino, F. K. Soong, L. R. Rabiner, D. H Klatt
Title	Comparative study of several distortion measures for speech recognition
Booktitle	Speech Communication
Volume	4
Year	1985
Pages	317--331
Remarks	cited in [MCW98]: distance measures for ASR

INPROC.	asp:icassp88 [HJ88]
Author	H. Hermansky, J. C. Junqua
Title	Optimization of perceptually-based ASR front-end
Booktitle	Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
Year	1988
Pages	219
Remarks	cited in [MCW98]: distance measures for ASR

INPROC.	ghitza97 [GS97]
Author	O. Ghitza, M. M. Sondhi
Title	On the perceptual distance between two speech segments
Booktitle	Journal of the Acoustical Society of America
Year	1997
Volume	101
Pages	522--529
Number	1
Remarks	cited in [MCW98]: distance measures in general

INPROC.	hansen98 [HC98]
Author	J. H. L. Hansen, D. T. Chappell
Title	An auditory-based distortion measure with application to concatenative speech synthesis
Booktitle	IEEE Trans. on Speech and Audio Processing
Volume	6
Month	September
Year	1998
Pages	489--495
Remarks	cited in [MCW98]: distance measures for concatenative speech synthesis

INPROC.	asp:itsa94 [HM94]
Author	H. Hermansky, N. Morgan
Title	RASTA processing of speech
Booktitle	IEEE Transactions on Speech and Acoustics
Volume	2
Month	October
Year	1994
Pages	587--589
Remarks	cited in [MCW98]

BOOK	edwards93 [Edw93]
Author	A. L. Edwards
Title	An Introduction to Linear Regression and Correlation
Publisher	W. H. Freeman and Co
Address	San Francisco
Year	1993
Remarks	cited in [MCW98]: Fisher transform

INPROC.	Ding_OptiUnit_EURO97 [DC97]
Author	Wen Ding, Nick Campbell
Title	Optimising Unit Selection with Voice Source and Formants in the CHATR Speech Synthesis System
Booktitle	Proc. Eurospeech '97
Address	Rhodes, Greece
Month	September
Year	1997
Pages	537--540
Remarks	To BE FOUND!

Previous

Contents

Next