Evaluation of Expressive Speech Synthesis

Objective

The objective of this subjective evaluation is to compare the naturalness of three versions of a speech synthesis system. The experiment is strictly reserved to native French speaker only.

The experiment should not be longer than 20 minutes.

By completing this short questionnaire you are contributing to research on speech synthesis, conducted at the Sound Analysis and Synthesis team of IRCAM.

Please use HEADPHONES if possible

Thanks in advance !


Here are some natural utterances pronounced by the speaker that will be used for synthesis.

  • Original speaker:
speaker

In the following, you have to compare the naturalness of various pairs of speech utterances synthesized by different versions of a speech synthesizer.

1. For each line on the tab, listen carefully to File 1 and File 2. Both speech utterances correspond to the same sentence, but synthesized with different methods. The differences requires careful listening so please use headphones if you can.

2. Then give a preference score about according to the following grades tab:

Much better +3
Better +2
Slightly better +1
About the same 0
  • to the left if you prefer File 1
  • to the right if you prefer File 2.
' File 1 +3 +2 +1 0 +1 +2 +3 File 2
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33

A few more questions :

  • Are you familiar with Speech processing or Speech Synthesis ? Yes | No
  • Did you use headphones ? Yes | No
  • Your language ? Native french speaker | French speaker | Other

Comments

Please, verify that you gave a preference to all questions, then press this button

All recordings are Ircam's property.