Previous Contents Next


Carlos Agon, Gérard Assayag, Olivier Delerue, and Camilo Rueda. Objects, Time and Constraints in OpenMusic. In Proceedings of the International Computer Music Conference (ICMC), Ann Arbor, Michigan, October 1998.

Gérard Assayag, Carlos Agon, Joshua Fineberg, and Peter Hanappe. An Object Oriented Visual Environment For Musical Composition. In Proceedings of the International Computer Music Conference (ICMC), Thessaloniki, Greece, 1997.

G. Assayag, C. Agon, and M. Stroppa. High Level Musical Control of Sound Synthesis in OpenMusic. In Proc. ICMC, Berlin, 2000.

G. Assayag, C. Agon, and M. Stroppa. High Level Musical Control of Sound Synthesis in OpenMusic. In Proc. ICMC, 2000.

Gérard Assayag, Carlos Agon, and Marco Stroppa. High Level Musical Control of Sound Synthesis in OpenMusic. In Proceedings of the International Computer Music Conference (ICMC), Berlin, August 2000.

Aldroubi and Eden. Wavelet analysis and its applications, volume 2, chapter Polynomial Spline and Wavelets. ???, ???

Marc Abrams, Constantinos Phanouriou, Alan L. Batongbacal, Stephen M. Williams, and Jonathan E. Shuster. UIML: an appliance-independent XML user interface language. Computer Networks (Amsterdam, Netherlands: 1999), 31(11--16):1695--1708, May 1999.

G. Assayag, C. Rueda, M. Laurson, C. Agon, and O. Delerue. Computer Assisted Composition at Ircam: PatchWork & OpenMusic. Computer Music Journal, 23(3), Fall 1999.

Gérard Assayag, Camilo Rueda, Mikael Laurson, Carlos Agon, and O. Delerue. Computer Assisted Composition at Ircam: PatchWork & OpenMusic. Computer Music Journal, 23(3), 1999.

Analysis--Synthesis Team / Équipe Analyse--Synthèse, IRCAM---Centre Georges Pompidou. WWW page, 1999.

Analysis--Synthesis Team / Équipe Analyse--Synthèse, IRCAM---Centre Georges Pompidou. WWW page, 2000.

Anthropic Signal Processing Group, Oregon Graduate Institute of Science and Technology. WWW page, 1999.

AT&T Labs, Oregon Graduate Institute of Science and Technology. WWW page, 1999.

Leo Breiman et al. Classification and Regression Trees. Chapman & Hall, New York, 1984. new edition of [BFOS84a]?

Roberto Battiti. Using the mutual information for selecting features in supervised neural net learning. IEEE Transactions on Neural Networks, 5(4):537--550, 1994.

A. W. Black and N. Campbell. Optimising selection of units from speech databases for concatenative synthesis. In Proc. Eurospeech '95, volume 1, pages 581--584, Madrid, Spain, September 1995.

G. Baudoin, J. Cernocký, and G. Chollet. Quantization of spectral sequences using variable length spectral segments for speech coding at very low bit rate. In Proc. EUROSPEECH 97, pages 1295--1298, Rhodes, Greece, September 1997.

Mark Beutnagel, Alistair Conkie, and Ann K. Syrdal. Diphone Synthesis using Unit Selection. In The 3rd ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Australia, November 1998. www [ATT99].

M. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, and A. Syrdal. The AT&T Next-Gen TTS System. In Joint Meeting of ASA, EAA, and DAGA, Berlin, Germany, March 1999. www [ATT99].

J. W. Beauchamp. Unix Workstation Software for Analysis, Graphics, Modification, and Synthesis of Musical Sounds. Proceedings of the Audio Engineering Society, 1993.

J. W. Beauchamp. Unix Workstation Software for Analysis, Graphics, Modification, and Synthesis of Musical Sounds. In Proc. AES, 1993.

James Beauchamp. Methods for measurement and manipulation of timbral physical correlates. 103(5):2966, 1998.

James Beauchamp, editor. The Sound of Music. Springer, New York, 2000.

Luciano Berio. Circles; sequenza i, iii, v. Mediathèque CD00008601, 1991. Cathy Berberian (Stimme), Francis Pierre (Harfe), Jean-Pierre Drouet, Jean-Claude Casadesus (Schlagzeug), Aurèle Nicolet (Flöte), Vinko Globokar (Posaune).

L. Breiman, J. Friedman, R. Olshen, and C. Stone. Classification and Regression Trees. Wadsworth and Brooks, Monterey, CA, 1984. new edition [B+84]?

Leo Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. Statistics/Probability Series. Wadsworth Publishing Company, Belmont, California, U.S.A., 1984.

James Beauchamp and A. Horner. Piecewise Linear Approximation of Additive Synthesis Envelopes: A Comparison of Various Methods. 20(2):72--95.

James Beauchamp, A. Horner, and S. McAdams. Musical Sounds, Data Reduction, and Perceptual Control Parameters. In Program for SMPC95, Society for Music Perception and Cognition, pages 8--9, Univ. Calif. Berkeley, 1995. Center for New Music and Audio Technologies (CNMAT).

G. Bailly, R. Laboissière, and J. L. Schwartz. Formant trajectories as audible gestures: an alternative for speech synthesis. Journal of Phonetics, 19:9--23, 1991.

Barry W. Boehm. Software risk management. IEEE Computer Society Press, Washington, 1989.

Grady Booch. Object-Oriented Analysis and Design with Applications. Benjamin--Cummings, Redwood City, Calif., 2nd edition, 1994.

Alan Black and Paul Taylor. The Festival Speech Synthesis System: System Documentation (1.1.1). Technical Report HCRC/TR-83, Human Communication Research Centre, January 1997. www [CSTR99].

Alan W Black and Paul Taylor. Automatically clustering similar units for unit selection in speech synthesis. In Proc. Eurospeech '97, pages 601--604, Rhodes, Greece, September 1997. www [CSTR99] Electronic version: cstr/Black_1997_b.*.

Alan Black, Paul Taylor, and Richard Caley. The Festival Speech Synthesis System: System Documentation (1.3.1). Technical Report HCRC/TR-83, Human Communication Research Centre, December 1998. www [CSTR99].

N. Campbell. CHATR: A high-definition speech re-sequencing system. Acoustical Society of America and Acoustical Society of Japan, Third Joint Meeting, December 1996.

A. Chaudhary, A. Freed, and M. Wright. An Open Architecture for Real-time Music Software. In Proc. ICMC, Berlin, 2000.

Ngai-Man Cheung and Andrew Horner. Group Synthesis with Genetic Algorithms. 44(3):130--147.

N. Chomsky and M. Halle. The Sound Pattern of English. Harper & Row, New York, NY, 1968.

Y. T. Chan. Wavelet Basics. Kluwer Academic Publ., Boston, 1995.

Arun Chandra. Compositional experiments with concatenating distinct waveform periods while changing their structural properties. In SEAMUS'98, Urbana, IL, April 1998. School of Music, University of Illinois. Available online26.

Jean-Marie Chauvet. Composants et transactions: COMMTS, CorbaOTS, JavaEJB, XML. Collection dirigée par Guy Hervier. Eyrolles: Informatiques magazine, Paris, France, 1999.

O. Cappé and E. Moulines. Regularization Techniques for Discrete Cepstrum Estimation. IEEE Signal Processing Letters, 3(4):100--102, April 1996.

Andrew E. Cronk and Michael W. Macon. Optimized Stopping Criteria for Tree-Based Unit Selection in Concatenative Synthesis. In Proc. of International Conference on Spoken Language Processing, volume 5, pages 1951--1955, November 1998. www [CSLU99].

O. Cappé, M. Oudot, and E. Moulines. Spectral Envelope Estimation using a Penalized Likelihood Criterion. In IEEE ASSP Workshop on App. of Sig. Proc. to Audio and Acoust., Mohonk, October 1997.

Robin Cover. The XML Cover Pages. WWW page, 2000.

CSLU Speech Synthesis Research Group, Oregon Graduate Institute of Science and Technology. WWW page, 1999.

Centre for Speech Technology Research, University of Edinburgh. WWW page, 1999.

John E. Clark and Colin Yallop. An Introduction to Phonetics and Phonology. Blackwell, Oxford, 1996.

Nick Campbell, Itoh Yoshiharu, Wen Ding, and Norio Higuchi. Factors affecting perceived quality and intelligibility in the CHATR concatenative speech synthesiser. In Proc. Eurospeech '97, pages 2635--2638, Rhodes, Greece, September 1997.

Wen Ding and Nick Campbell. Optimising unit selection with voice source and formants in the CHATR speech synthesis system. In Proc. Eurospeech '97, pages 537--540, Rhodes, Greece, September 1997.

François Déchelle, Maurizio De Cecco, Enzo Maggi, and Norbert Schnell. jMax Recent Developments. In Proceedings of the International Computer Music Conference, 1999.

F. Déchelle, M. DeCecco, E. Maggi, and N. Schnell. jMax Recent Developments. In Proc. ICMC, 1999.

François Dechelle, Maurizio DeCecco, Miller Puckette, and David Zicarelli. The IRCAM ``Real-Time Platform'': Evolution and Perspectives. In Proceedings of the International Computer Music Conference (ICMC), 1994. Available online27.

N. Delprat, B. Escudié, P. Guillemain, R. Kronland-Martinet, Ph. Tchamitchian, and B. Torrésani. Asymptotic wavelet and gabor analysis : Extraction of instantaneous frequency. 38(2):644--664, March 1992.

T. Dutoit and B. Gosselin. On the use of a hybrid harmonic/stochastic model for tts synthesis by concatenation. In Speech Communication, number 19, pages 119--143, 1996.

Ph. Depalle, G. Garcia, and X. Rodet. Tracking of Partials for Additive Sound Synthesis Using Hidden Markov Models. In IEEE Trans., pages 225--228, April 1993. Abstract28.

Ph. Depalle, G. Garcia, and X. Rodet. Tracking of Partials for Additive Sound Synthesis Using Hidden Markov Models. In IEEE Trans., pages 225--228, 1993.

Philippe Depalle, Guillermo García, and Xavier Rodet. A Virtual Castrato (!?). In Proceedings of the International Computer Music Conference (ICMC), 1994. Available online29.

C. d'Alessandro and P. Mertens. Automatic pitch contour stylization using a model of tonal perception. In Computer Speech and Language, pages 257--288, 1995.

O. Deroo, F. Malfrere, and T. Dutoit. Comparaison of two different alignment systems: speech synthesis vs. hybrid hmm/ann. In Proc. European Conference on Signal Processing (EUSIPCO'98), pages 1161--1164, Greece, 1998. www [TCTS99], same content as [MDD98] (but less references).

T. Dutoit, F. Malfrère, V. Pagel, M. Bagein P. Mertens, A. Ruelle, and A. Gilman. EULER: Multi-Lingual Text-to-Speech Project. In Petr Sojka, Václav Matousek, Karel Pala, and Ivan Kopecek, editors, Proceedings of the First Workshop on Text, Speech, Dialogue --- TSD'98, pages 27--32, Brno, Czech Republic, September 1998. Masaryk University Press. www [TCTS99]Electronic version: tcts/*.

Grzegorz Dogil. Phonetic correlates of word stress. AIMS Phonetik (Working Papers of the Department of Natural Language Processing), 2(2), 1995. Contents30.

R. E. Donovan. Trainable Speech Synthesis. Phd thesis, Cambridge University, 1996.

T. Dutoit, V. Pagel, N. Pierret, F. Bataille, and O. V. der Vrecken. The MBROLA project: Towards a set of high quality speech synthesizers free of use for non commercial purposes. In Proc. ICSLP '96, volume 3, pages 1393--1396, Philadelphia, PA, October 1996.

Shlomo Dubnov and Xavier Rodet. Statistical Modeling of Sound Aperiodicities. In Proceedings of the International Computer Music Conference (ICMC), Tessaloniki, Greece, September 1997.

F. Déchelle, N. Schnell, R. Borghesi, and N. Orio. The jMax Environment: An Overview of New Features. In Proc. ICMC, Berlin, 2000.

François Déchelle, Norbert Schnell, Ricardo Borghesi, and Nicolas Orio. The jMax Environment: An Overview of New Features. In Proceedings of the International Computer Music Conference, Berlin, 2000.

Shlomo Dubnov, Naftali Tishby, and Dalia Cohen. Hearing Beyond the Spectrum. Journal of New Music Research, 24(4).

Bob DuCharme. XML: the annotated specification. The Charles F. Goldfarb series on open information management. Prentice-Hall PTR, Upper Saddle River, NJ 07458, USA, 1999.

T. Dutoit. High quality text-to-speech synthesis: a comparison of four candidate algorithms. In Proc. ICASSP '94, pages I--565--I--568, Adelaide, Austrailia, April 1994.

A. L. Edwards. An Introduction to Linear Regression and Correlation. W. H. Freeman and Co, San Francisco, 1993.

K. Fitz, L. Haken, and P. Chirstensen. A New Algorithm for Bandwidth Association in Bandwidth-Enhanced Additive Sound Modeling. In Proc. ICMC, Berlin, 2000.

K. Fitz, L. Haken, and P. Chirstensen. Transient Preservation under Transformation in an Additive Sound Model. In Proc. ICMC, Berlin, 2000.

Kelly Fitz, Lippold Haken, and Paul Chirstensen. A New Algorithm for Bandwidth Association in Bandwidth-Enhanced Additive Sound Modeling. In Proc. ICMC, Berlin, 2000.

Kelly Fitz, Lippold Haken, and Paul Chirstensen. Transient Preservation under Transformation in an Additive Sound Model. In Proceedings of the International Computer Music Conference, Berlin, 2000.

K. Fitz, L. Haken, and B. Holloway. Lemur -- A Tool for Timbre Manipulation. In Proceedings of the International Computer Music Conference, pages 158--161, Banff, September 1995.

K. Fitz, L. Haken, and B. Holloway. Lemur -- A Tool for Timbre Manipulation. In Proc. ICMC, 1995.

Anne Faure and Stephen McAdams. Comparaison de profils sémantiques et de l'espace perceptif de timbres musicaux. In Actes du 4ème Congrès Français d'Acoustique, Marseille, April 1997. Société Française d'Acoustique.

A. Freed, X. Rodet, and Ph. Depalle. Synthesis and Control of Hundreds of Sinusoidal Partials on a Desktop Computer without Custom Hardware. In ICSPAT, 1992.

Adrian Freed, Xavier Rodet, and Phillipe Depalle. Synthesis and Control of Hundreds of Sinusoidal Partials on a Desktop Computer without Custom Hardware. In ICSPAT, 1992. Available online31.

A. Freed, X. Rodet, and Ph. Depalle. Performance, Synthesis and Control of Additive Synthesis on a Desktop Computer Using FFT-1. In Proceedings of the 19th International Computer Music Conference, Waseda University Center for Scholarly Information, 1993. International Computer Music Association.

A. Freed, X. Rodet, and Ph. Depalle. Performance, Synthesis and Control of Additive Synthesis on a Desktop Computer Using FFT-1. In Proc. ICMC, 1993.

K. Fukunaga. Introduction to Statistical Pattern Recognition. Academic Press, 2 edition, 1990.

Guillermo García. Pm: A library for additive analysis/transformation/synthesis, July 1994. Available online32.

R. Gribonval, E. Bacry, S. Mallat, Ph. Depalle, and X. Rodet. Analysis of Sound Signals with High Resolution Matching Pursuit. In Proceedings of the IEEE Time--Frequency and Time--Scale Workshop (TFTS), 1996. www [AS00].

R. Gribonval, Ph. Depalle, X. Rodet, E. Bacry, and S. Mallat. Sound Signal Decomposition using a High Resolution Matching Pursuit. In Proceedings of the International Computer Music Conference (ICMC), August 1996. www [AS00].

Carlo Ghezzi, Mehdi Jazayeri, and Dino Mandrioli. Fundamentals of Software Engineering. Prentice--Hall, Englewood Cliffs, NJ, 1991.

Ph. Guillemain and R. Kronland-Martinet. Characterization of acoustic signals through continuous linear time--frequency representations. 84(4):561--585, April 1996.

D.W. Griffin and J.S. Lim. Multiband excitation vocoder. In IEEE Transactions on Acoustics, Speech and Signal Processing, volume 36, pages 1123--1235, 1988.

Thierry Galas and Xavier Rodet. An Improved Cepstral Method for Deconvolution of Source--Filter Systems with Discrete Spectra: Application to Musical Sound Signals. In Proceedings of the International Computer Music Conference (ICMC), Glasgow, September 1990.

Th. Galas and X. Rodet. Generalized Functional Approximation for Source--Filter System Modeling. In Proc. Eurospeech, 1991.

Thierry Galas and Xavier Rodet. Generalized Discrete Cepstral Analysis for Deconvolution of Source--Filter Systems with Discrete Spectra. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, October 1991.

Thierry Galas and Xavier Rodet. Generalized Functional Approximation for Source--Filter System Modeling. In Proc. Eurospeech, pages 1085--1088, Geneve, 1991.

O. Ghitza and M. M. Sondhi. On the perceptual distance between two speech segments. In Journal of the Acoustical Society of America, volume 101, pages 522--529, 1997.

X. D. Huang, A. Acero, and et al. Whistler: A trainable text-to-speech system. In Proc. of the Int'l Conf. on Spoken Language Processing, pages 2387--2390, 1996.

R. W. Hamming. Digital Filters. Signal Processing Series. Prentice--Hall, 1977.

Richard Wesley Hamming. Digital Filters. Signal Processing Series. Prentice--Hall, Englewood Cliffs, 1977.

A. J. Hunt and A. W. Black. Unit selection in a concatenative speech synthesis system using a large speech database. In Proc. ICASSP '96, pages 373--376, Atlanta, GA, May 1996. www [CSTR99] Electronic version: cstr/Black_1996_a.s.*.

J. H. L. Hansen and D. T. Chappell. An auditory-based distortion measure with application to concatenative speech synthesis. In IEEE Trans. on Speech and Audio Processing, volume 6, pages 489--495, September 1998.

Nathalie Henrich. Synthèse de la voix chantée par règles. IRCAM, Paris, France, July 1998. Rapport de stage D.E.A. Acoustique, Traitement de Signal et Informatique Appliqués à la Musique.

Hynek Hermansky. Data-Driven Speech Analysis For ASR. In Petr Sojka, Václav Matousek, Karel Pala, and Ivan Kopecek, editors, Proceedings of the First Workshop on Text, Speech, Dialogue --- TSD'98, pages 213--218, Brno, Czech Republic, September 1998. Masaryk University Press.

H. Hermansky, B. A. Hanson, and H. Wakita. Perceptually based linear predictive analysis of speech. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 509--512, 1985.

J. Hirschberg. Using Text Analysis to Predict Intonational Boundaries. In Proceedings of Eurospeech, pages 1275--1278, 1991.

H. Hermansky and J. C. Junqua. Optimization of perceptually-based ASR front-end. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, page 219, 1988.

H. Hermansky and N. Morgan. RASTA processing of speech. In IEEE Transactions on Speech and Acoustics, volume 2, pages 587--589, October 1994.

J. N. Holmes. Formant synthesizers: Cascade or Parallel. In Speech Communication, volume 2, pages 251--273, 1983.

J. N. Holmes. Formant synthesizers: Cascade or Parallel. In Speech Communication, volume 2, 1983.

Barbara Burke Hubbard. The World According to Wavelets: The Story of a Mathematical Technique in the Making. A K Peters Ltd, 1997.

Michael A. Jackson. System development. Prentice--Hall International series in computer science. Prentice--Hall Intern., Englewood Cliffs, 1983.

Michael Jackson. Software requirements & specifications : a lexicon of practice, principles, and prejudices. Addison--Wesley, Wokingham, 1995.

Ivar Jacobson. Object-Oriented Software Engineering: a Use Case driven Approach. Addison--Wesley, Wokingham, England, 1995.

O. Karaali, G. Corrigan, and I. Gerson. Speech Synthesis with Neural Networks. In Proc. of World Congress on Neural Networks, pages 45--50, September 1996.

A. Kain and M. W. Macon. Personalizing a speech synthesizer by voice adaptation. In Proceedings of the 3rd ESCA/COCOSDA International Speech Synthesis Workshop, pages 225--230, November 1998. www [CSLU99].

A. Kain and M. W. Macon. Text-to-speech voice adaptation from sparse training data. In Proc. of International Conference on Spoken Language Processing, pages 2847--2850, November 1998. www [CSLU99].

Alexander Kain and Michael W Macon. Spectral voice conversion for text-to-speech synthesis. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'98), pages 285--288, 1998. www [CSLU99].

F. Kossentini, M. Macon, and M. Smith. Audio coding using variable-depth multistage quantization. 6, 1998. www [CSLU99].

Scott N. Levine. Audio Representations for Data Compression and Compressed Domain Processing. Ph.d. dissertation, Department of Electrical Engineering, CCRMA, Stanford University, December 1998.

Adam Lindsay. MPEG-7 Audio FAQ. WWW page, 1998. moved to [TPMAS98].

Michael W. Macon. Speech synthesis based on sinusoidal modeling. In PhD thesis. Georgia Institute of Technology, October 1996.

Stephane Mallat. A Wavelet Tour of Signal Processing. AP Professional, London, 1997.

M. W. Macon and M. A. Clements. Speech synthesis based on an overlap-add sinusoidal model. In J. of the Acoustical Society of America, volume 97, page 3246. Pt. 2, May 1995. www [CSLU99].

Michael W. Macon and Mark A. Clements. Speech Concatenation and Synthesis Using an Overlap--Add Sinusoidal Model. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96), volume 1, pages 361--364, Atlanta, USA, 1996. www [CSLU99].

M. W. Macon and M. A. Clements. Sinusoidal modeling and modification of unvoiced speech. In IEEE Transactions on Speech and Audio Processing, volume 5, pages 557--560, November 1997. www [CSLU99].

M. W. Macon, A. E. Cronk, and J. Wouters. Generalization and discrimination in tree-structured unit selection. In Proceedings of the 3rd ESCA/COCOSDA International Speech Synthesis Workshop, November 1998. www [CSLU99].

M. W. Macon, A. E. Cronk, J. Wouters, and A. Kain. Ogireslpc: Diphone synthesizer using residual-excited linear prediction. In Tech. Rep. CSE-97-007. Department of Computer Science, Oregon Graduate Institute of Science and Technology, Portland, OR, September 1997. www [CSLU99].

F. Malfrere and T. Dutoit. Speech synthesis for text-to-speech alignment and prosodic feature extraction. In Proc. ISCAS 97, pages 2637--2640, Hong-Kong, 1997. www [TCTS99].

Fabrice Malfrere and Thierry Dutoit. High quality speech synthesis for phonetic speech segmentation. In Proc. Eurospeech '97, pages 2631--2634, Rhodes, Greece, September 1997.

F. Malfrere, O. Deroo, and T. Dutoit. Phonetic alignement : Speech synthesis based vs. hybrid hmm/ann. In Proc. International Conference on Speech and Language Processing, pages 1571--1574, Sidney, Australia, 1998. www [TCTS99], same content as [DMD98] (with more references).

Jason Meldrum. The Z--Transform, 1997. Online tutorial33.

J.D. Markel and A.H. Gray. Linear Prediction of Speech. Springer, 1980.

M. W. Macon, L. Jensen-Link, J. Oliverio, M. Clements, and E. B. George. Concatenation-based midi-to-singing voice synthesis. In 103rd Meeting of the Audio Engineering Society. New York, 1997. www [CSLU99].

Michael Macon, Leslie Jensen-Link, James Oliverio, Mark A. Clements, and E. Bryan George. A singing voice synthesis system based on sinusoidal modeling. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97), pages 435--438, 1997. www [CSLU99].

M. W. Macon, A. Kain, A. E. Cronk, H. Meyer, K. Mueller, B. Saeuberlich, and A. W. Black. Rapid prototyping of a german tts system. In Tech. Rep. CSE-98-015. Department of Computer Science, Oregon Graduate Institute of Science and Technology, Portland, OR, September 1998. www [CSLU99].

M. W. Macon, A. McCree, W. M. Lai, and V. Viswanathan. Efficient analysis/synthesis of percussion musical instrument sounds using an all-pole model. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, volume 6, pages 3589--3592. Speech, May 1998. www [CSLU99].

B. C. J. Moore. An Introduction to the Psychology of Hearing. Academic Press Limited, 3rd edition, 1989.

G. A. Merchant and T. W. Parks. Efficient Solution of a Toeplitz--plus Hankel Coefficient Matrix System of Equations. In IEEE TASSP, volume 30, pages 40--44, February 1982.

MPEG-7 ``Multimedia Content Description Interface'' Documentation. WWW page, 1999.

B. Möbius, M. Pätzold, and W. Hess. Analysis and Synthesis of German F0 Contours by Means of Fujisaki's Model. In Speech Communication, volume 13, pages 53--61, 1993.

S. Mallat and S. Zhong. Characterization of Signals from Multiscale Edges. IEEE Trans. Pattern Anal. Machine Intell., 40(7):2464--2482, July 1992.

Manfred Nagl. Softwaretechnik: methodisches Programmieren im Großen. Springer compass. Springer, Berlin, 1990.

S. Nakajima. Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering. In Speech Communication, volume 14, page 313, September 1994.

H. J. Nock, M. J. F. Gales, and Steve Young. A comparative study of methods for phonetic decision-tree state clustering. In Proc. Eurospeech '97, volume 1, pages 111--114, Rhodes, Greece, September 1997.

N. Nocerino, F. K. Soong, L. R. Rabiner, and D. H Klatt. Comparative study of several distortion measures for speech recognition. In Speech Communication, volume 4, pages 317--331, 1985.

Jörn Ostermann, Mark C. Beutnagel, Ariel Fischer, and Yao Wang. Integration of talking heads and text-to-speech synthesizers for visual tts. In Proc. ICSLP98, 1998. www [ATT99].

M. Oudot, O. Cappé, and E. Moulines. Robust Estimation of the Spectral Envelope for ``Harmonics+Noise'' Models. In IEEE Workshop on Speech coding, Pocono Manor, September 1997.

Alan V. Oppenheim, editor. Applications of Digital Signal Processing, chapter Digital Processing of Speech, pages 117--168. Prentice--Hall, 1978.

Alan V. Oppenheim and Ronald W. Schafer. Digital Signal Processing. Prentice--Hall, 1975.

John Oswald. Plexure. CD, 1993.

John Oswald. Plunderphonics. WWW page, 1999., esp. [Osw93].

M. Campedel Oudot. Étude du modèle sinusoïdes et bruit pour le traitement de la parole. Estimation robuste de l'enveloppe spectrale. Thèse, ENST, Paris, 1998.

Marine Campedel Oudot. Étude du modèle ``sinusoïdes et bruit'' pour le traitement de la parole. Estimation robuste de l'enveloppe spectrale. Thèse, Ecole Nationale Supérieure des Télécommunications, Paris, France, November 1998.

G. Peeters. Analyse-Synthèse des sons musicaux par la méthode PSOLA. Agelonde (France), May 1998.

G. Peeters and X. Rodet. Sinusoidal versus Non-Sinusoidal Signal Characterisation. Barcelona, November 1998.

G. Peeters and X. Rodet. Non-Stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum. Orlando, November 1999.

G. Peeters and X. Rodet. SINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum. In Proceedings of the International Computer Music Conference (ICMC), Beijing, October 1999.

Miller Puckette. Combining Event and Signal Processing in the MAX Graphical Programming Environment. Computer Music Journal, 15(3):68--77, Winter 1991. Available from34.

Miller Puckette. FTS: A Real-Time Monitor for Multiprocessor Music Synthesis. Computer Music Journal, 15(3):58--67, Winter 1991. Available from35.

W. J. Pielemeier and G. H. Wakefield. A High Resolution Time--Frequency Representation for Musical Instrument Signals. J. Acoust. Soc. Am., 99(4):2382--2396, 1996.

S. R. Quackenbush, T. P. Barnwell, and M. A. Clements. Objective Measures of Speech Quality. Prentice-Hall, Englewood Cliffs, NJ, 1988.

James Rumbaugh, Michael Blaha, William Premerlani, Frederick Eddy, and William Lorensen. Object-Oriented Modeling and Design. Prentice--Hall, Englewood Cliffs, NJ, 1991.

Xavier Rodet and Phillipe Depalle. A new additive synthesis method using inverse Fourier transform and spectral envelopes. In Proceedings of the International Computer Music Conference (ICMC), October 1992.

Xavier Rodet, Philippe Depalle, and Guillermo García. New Possibilities in Sound Analysis and Synthesis. In ISMA, 1995. Available online36 PostScript37.

X. Rodet, Ph. Depalle, and G. Poirot. Speech Analysis and Synthesis Methods Based on Spectral Envelopes and Voiced/Unvoiced Functions. In European Conf. on Speech Tech., 1987.

Xavier Rodet, Phillipe Depalle, and G. Poirot. Speech Analysis and Synthesis Methods Based on Spectral Envelopes and Voiced/Unvoiced Functions. In European Conference on Speech Tech., September 1987.

Xavier Rodet and Dominique François. XSPECT: Introduction, January 1996. Available online38.

Xavier Rodet, Dominique François, and Guillaume Levy. Xspect: a New Motif Signal Visualisation, Analysis and Editing Program. In Proceedings of the International Computer Music Conference (ICMC), August 1996. Available online39.

Stuart Rosen and Peter Howell. Signals and Systems for Speech and Hearing. Academic Press, London, 1991.

X. Rodet and A. Lefèvre. The Diphone Program: New Features, new Synthesis Methods and Experience of Musical Use. In Proc. ICMC, Tessaloniki, 1997.

Xavier Rodet and Adrien Lefèvre. The Diphone Program: New Features, new Synthesis Methods and Experience of Musical Use. In Proceedings of the International Computer Music Conference (ICMC), Tessaloniki, Greece, September 1997. Abstract40, PostScript41.

Xavier Rodet and Adrien Lefèvre. The Diphone Program: New Features, new Synthesis Methods and Experience of Musical Use. In Proceedings of the International Computer Music Conference (ICMC), Tessaloniki, Greece, September 1997.

J.C. Risset and M.V. Mathews. Analysis of musical-instrument tones. Physics Today, 22(2):23--30, February 1969.

Curtis Roads. The Computer Music Tutorial. MIT Press, 1996.

Tony Robinson. Speech Analysis, 1998. Online tutorial42.

Thierry Rochebois. Méthodes d'analyse synthèse et représentations optimales des sons musicaux basées sur la réduction de données spectrales. PhD thesis, Université Paris XI, December 1997.

X. Rodet. Time-Domain Formant-Wave-Function Synthesis. Computer Music Journal, Fall 1984.

Xavier Rodet. Time-Domain Formant-Wave-Function Synthesis. Computer Music Journal, 8(3):9--14, Fall 1984. reprinted from [Sim80].

X. Rodet. Musical Sound Signals Analysis/Synthesis: Sinusoidal+Residual and Elementary Waveform Models. In Proc. IEEE Time--Frequency/Time--Scale Workshop, 1997.

Xavier Rodet. Musical Sound Signals Analysis/Synthesis: Sinusoidal+Residual and Elementary Waveform Models. In Proceedings of the IEEE Time--Frequency and Time--Scale Workshop (TFTS), August 1997. Abstract43, PostScript44.

Xavier Rodet. The Additive Analysis--Synthesis Package, 1997. Available online45.

X. Rodet, Y. Potard, and J.-B. Barrière. The Chant--Project: From the Synthesis of the Singing Voice to Synthesis in General. Computer Music Journal, Fall 1984.

Xavier Rodet, Yves Potard, and Jean-Baptiste Barrière. The Chant--Project: From the Synthesis of the Singing Voice to Synthesis in General. Computer Music Journal, 8(3):15--31, Fall 1984.

Xavier Rodet, Yves Potard, and Jean-Baptiste Barrière. CHANT: de la synthèse de la voix chantée à la synthèse en général. Rapports de recherche IRCAM, 1985. Available online46.

X. Rodet and D. Schwarz. Spectral Envelopes and Additive+Residual Analysis-Synthesis. In J. Beauchamp, ed. The Sound of Music. Springer, N.Y., to be published.

Xavier Rodet and Diemo Schwarz. Spectral Envelopes and Additive+Residual Analysis-Synthesis. In J. Beauchamp, ed. The Sound of Music. Springer, New York, to be published 2000.

Y. Sagisaka. Speech synthesis by rule using an optimal selection of non-uniform synthesis units. In Proc. of the Int'l Conf. on Acoustics, Speech, and Signal Processing, page 679, 1988.

X. Serra, J. Bonada, P. Herrera, and R. Loureiro. Integrating Complementary Spectral Models in the Design of a Musical Synthesizer. In Proc. ICMC, 1997.

X. Serra, J. Bonada, P. Herrera, and R. Loureiro. Integrating Complementary Spectral Models in the Design of a Musical Synthesizer. In Proceedings of the International Computer Music Conference, Tessaloniki, 1997.

X. Serra, J. Bonada, P. Herrera, and R. Loureiro. Integrating Complementary Spectral Models in the Design of a Musical Synthesizer. In Proc. ICMC, Tessaloniki, 1997.

Xavier Serra, Jordi Bonada, Perfecto Herrera, and Ramon Loureiro. Integrating complementary spectral models in the design of a musical synthesizer. In Proceedings of the International Computer Music Conference, 1997.

S. Sutton, R. Cole, J. de Villiers, J. Schalkwyk, P. Vermeulen, M. Macon, Y. Yan, E. Kaiser, B. Rundle, K. Shobaki, P. Hosom, A. Kain, J. Wouters, D. Massaro, and M. Cohen. Universal Speech Tools: the CSLU Toolkit. In Proc. of International Conference on Spoken Language Processing, November 1998. www [CSLU99].

D. Schwarz. Spectral Envelopes in Sound Analysis and Synthesis. Diplomarbeit Nr. 1622, Universität Stuttgart, Fakultät Informatik, Stuttgart, Germany, 1998.

D. Schwarz. Spectral Envelopes in Sound Analysis and Synthesis. Diplomarbeit, Universität Stuttgart, Informatik, 1998.

Diemo Schwarz. Spectral Envelopes in Sound Analysis and Synthesis. Diplomarbeit Nr. 1622, Universität Stuttgart, Fakultät Informatik, Stuttgart, Germany, June 1998.

Ann K. Syrdal, Alistair Conkie, and Yannis Stylianou. Exploration of acoustic correlates in speaker selection for concatenative synthesis. In Proc. ICSLP98, 1998. www [ATT99].

Yannis Stylianou, Thierry Dutoit, and Juergen Schroeter. Diphone concatenation using a harmonic plus noise model of speech. In Proc. Eurospeech '97, pages 613--616, Rhodes, Greece, September 1997. www [TCTS99]Electronic version: tcts/*.

J. C. Simon, editor. Spoken Language Generation and Understanding. D. Reidel Publishing Company, Dordrecht, Holland, 1980.

Y. Sagisaka and N. Kaiki. Optimization of Intonation Control Using Statistical F0 Resetting Characteristics. In Proceedings of the International Conference on Acoustics, volume 2, pages 49--52. Speech and Signal Processing, 1992.

Y. Stylianou, J. Laroche, and E. Moulines. High Quality Speech Modification based on a Harmonic+Noise Model. In Proc. EUROSPEECH, 1995.

Patrick Susini, Stephen McAdams, and Suzanne Winsberg. Caractérisation perceptive des bruits de véhicules. In Actes du 4ème Congrès Français d'Acoustique, Marseille, April 1997. Société Française d'Acoustique.

Rational Software. Unified modeling language, version 1.1. Online documentation47, September 1997.

Ian Sommerville. Software engineering. International computer science series. Addison--Wesley, Wokingham [u.a.], 2nd edition, 1985.

F.K. Soong and A.E. Rosenberg. On the use of instantaneous and transitional spectral information in speaker recognition. In IEEE Transactions on Acoustics, Speech and Signal Processing, volume 36, pages 871--879, 1988.

X. Serra and J. Smith. Spectral Modeling Synthesis: a Sound Analysis/Synthesis System Based on a Deterministic plus Stochastic Decomposition. Computer Music Journal, 14(4):12--24, 1990.

Ann K Syrdal, Yannis G Stylianou, Laurie F Garrison, Alistair Conkie, and Juergen Schroeter. Td-psola versus harmonic plus noise model in diphone based speech synthesis. In Proc. ICASSP98, pages 273--276, 1998. www [ATT99].

Richard Sproat, Paul Taylor, Michael Tanenblatt, and Amy Isard. A markup language for text-to-speech synthesis. In Proc. Eurospeech '97, pages 1747--1750, Rhodes, Greece, September 1997. www [CSTR99] Electronic version: cstr/Sproat_1997_a.*.

Y. Stylianou. Decomposition of speech signals into a deterministic and a stochastic part. In Proc. ICSLP '96, volume 2, pages 1213--1216, Philadelphia, PA, October 1996.

Yannis Stylianou. Concatenative Speech Synthesis using a Harmonic plus Noise Model. In The 3rd ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Australia, November 1998. www [ATT99].

Yannis Stylianou. Removing Phase Mismatches in Concatenative Speech Synthesis. In The 3rd ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Australia, November 1998. www [ATT99].

Diemo Schwarz and Matthew Wright. Extensions and Applications of the SDIF Sound Description Interchange Format. In Proceedings of the International Computer Music Conference, Berlin, August 2000.

Clemens Szyperski. Component Software: Beyond Object-Oriented Programming. ACM Press and Addison-Wesley, New York, NY, 1998.

Keith A. Teague, Walter Andrews, and Buddy Walls. Enhanced Modeling of Discrete Spectral Amplitudes. In IEEE Workshop on Speech coding, Pocono Manor, September 1997.

Paul Taylor. The Festival Speech Architecture. Web page, 1999. www [CSTR99].

TCTS (Circuit Theory and Signal Processing) Lab, Faculté Polytechnique de Mons. WWW page, 1999.

D. Thom, H. Purnhagen, and the MPEG Audio Subgroup. MPEG Audio FAQ Version 9. WWW page, October 1998. International Organisation for Standardisation, Organisation Internationale de Normalisation, Coding of Moving Pictures and Audio, ISO/IEC JTC1/SC29/WG11, N2431,

C. Tuerk and T. Robinson. Speech synthesis using artificial neural networks trained on cepstral coefficients. In Proc. EUROSPEECH, pages 1713--1716.

C. Traber. F0 Generation with a Database of Natural F0 Patterns and with a Neural Network. In G. Bailly and C. Benot, editors, Talking Machines: Theories, Models, and Designs, pages 287--304. North Holland, 1992.

Michael Unser, Akram Aldroubi, and Murray Eden. B--Spline Signal Processing: Part I---Theory. In IEEE Transactions on signal processing, volume 41, pages 821--833, 1993.

Ian A. Utting. Lecture Notes in Object-Oriented Software Engineering. University of Kent at Canterbury, Canterbury, UK, 1993.

Van van der Van, Dee de la La, and Don von der Von. The longest biliographic reference, 1848.

van der Vrecken Olivier, Nicolas Pierret, Thierry Dutoit, Vincent Pagel, and Fabrice Malfrere. A simple and efficient algorithm for the compression of MBROLA segment databases. In Proc. Eurospeech '97, pages 421--424, Rhodes, Greece, September 1997.

Hermann L. von Helmholtz. Die Lehre von den Tonempfindungen: als physiologische Grundlage für die Theorie der Musik. Vieweg, Braunschweig, 6th edition, 1913.

Hermann L. von Helmholtz. On the Sensations of Tone as a Physiological Basis for the Theory of Music. Dover, New York, 1954. Original title: [vH13].

Hermann L. von Helmholtz. Die Lehre von den Tonempfindungen: als physiologische Grundlage für die Theorie der Musik. Georg Olms Verlag, Hildesheim, 1983.

Dominique Virolle. La Librairie CHANT: Manuel d'utilisation des fonctions en C, April 1997. Available online48.

Dominique Virolle. Sound Description Interchange Format (SDIF), January 1998. Available online49.

H. Valbret, E. Moulines, and J. P. Tubach. Voice transformation using PSOLA technique. speech, 11(2-3):189--194, June 1992.

R. von Sachs. Peak-insensitive non-parametric spectrum estimation. In Journal of time series analysis, volume 15, pages 429--452. 1994.

J.P.H. van Santen, J. Hirschberg, J. Olive, and R. Sproat, editors. Progress in Speech Synthesis. Springer-Verlag, New York, 1996.

M. Wright et al. New Applications of the Sound Description Interchange Format. In Proc. ICMC, 1998.

M. Wright et al. Audio Applications of the Sound Description Interchange Format Standard. In AES 107th convention, 1999.

G. H. Wakefield. Time--Pitch Representations: Acoustic Signal Processing and Auditory Representations. In Proceedings of the IEEE Intl. Symp. on Time--Frequency/Time--Scale, Pittsburgh, 1998.

G. H. Wakefield. Time--Pitch Representations: Acoustic Signal Processing and Auditory Representations. In Proc. IEEE Intl. Symp. Time--Frequency/Time--Scale, Pittsburgh, 1998.

Matthew Wright, Amar Chaudhary, Adrian Freed, David Wessel, Xavier Rodet, Dominique Virolle, Rolf Woehrmann, and Xavier Serra. New Applications of the Sound Description Interchange Format. In Proceedings of the International Computer Music Conference, 1998.

M. Wright, A. Chaudhary, A. Freed, S. Khoury, and D. Wessel. Audio Applications of the Sound Description Interchange Format Standard. In AES 107th convention, 1999.

Matthew Wright, Amar Chaudhary, Adrian Freed, Sami Khoury, and David Wessel. Audio Applications of the Sound Description Interchange Format Standard. In AES 107th convention preprint, 1999.

M. Wright, A. Chaudhary, A. Freed, S. Khoury, A. Momeni, D. Schwarz, and D. Wessel. An XML-based SDIF Stream Relationships Language. In Proc. ICMC, Berlin, 2000.

Matthew Wright, Amar Chaudhary, Adrian Freed, Sami Khoury, Ali Momeni, Diemo Schwarz, and David Wessel. An XML-based SDIF Stream Relationships Language. In Proceedings of the International Computer Music Conference, Berlin, 2000.

W. J. Wang, W. N. Campbell, N. Iwahashi, and Y. Sagisaka. Tree-based unit selection for English speech synthesis. In Proc. of the Int'l Conf. on Acoustics, Speech, and Signal Processing, pages 191--194, 1993.

M. Wright, R. Dudas, S. Khoury, R. Wang, and D. Zicarelli. Supporting the Sound Description Interchange Format in the Max/MSP Environment. In Proc. ICMC, Beijing, 1999.

Matthew Wright, Richard Dudas, Sami Khoury, Raymond Wang, and David Zicarelli. Supporting the Sound Description Interchange Format in the Max/MSP Environment. In Proceedings of the International Computer Music Conference (ICMC), Beijing, October 1999.

J. Wouters and M. W. Macon. A perceptual evaluation of distance measures for concatenative speech synthesis. In Proc. of International Conference on Spoken Language Processing, November 1998. www [CSLU99].

Peter Wyngaard, Chris Rogers, and Philippe Depalle. UDI 2.1---A Unified DSP Interface, 1992. Available online50.

M. Wright and E. Scheirer. Cross-Coding SDIF into MPEG-4 Structured Audio. In Proc. ICMC, Beijing, 1999.

Matthew Wright and Eric D. Scheirer. Cross-Coding SDIF into MPEG-4 Structured Audio. In Proceedings of the International Computer Music Conference (ICMC), Beijing, October 1999.

Marcelo M. Wanderley, Norbert Schnell, and Joseph Rovan. ESCHER---Modeling and Performing composed Instruments in real-time. In IEEE Systems, Man, and Cybernetics Conference, October 1998. To be published.

Jennifer Yuen and Andrew Horner. Hybrid Sampling-Wavetable Synthesis with Genetic Algorithms. 45(5):316--330.

Ping-Fai Yang and Yannis Stylianou. Real time voice alteration based on linear prediction. In Proc. ICSLP98, 1998. www [ATT99].

Eberhard Zwicker. Psychoakustik. Springer, 1982.

Previous Contents Next