Evaluation of expressivity and singing style modelisation for singing voice synthesis
(Not active anymore)
contact : luc.ardaillon@ircam.fr



Thank you for your time ! Please read carefully the following informations, even though you are used to doing such tests !

Objectives of this test

This listening test aims at evaluating the expressivity and modelisation of singing styles in singing voice synthesis, based on variations of pitch and phonemes durations.
We do not want to assess the overall quality of the synthesis.
However, the sounds below may contain artefacts which are not related to this modelisation (e.g. unatural timbre, unexpected noise, ...).
Please try to ignore these artefacts and focus only on the pitch variations (attacks, transitions between notes, vibrato, ...) and phonemes durations.

You will be presented 2 tests below. The first one aims at evaluating the expressivity (or liveliness, musicality) of singing voice synthesis using various settings.
The second one aims at evaluating the modelisation of the singing style.
Detailed explanations on the evaluation procedure are given below for each test.

General recommendations


Test I

For each pair of recordings below (each line) select one button depending on your preference about the expressivity of the two interpretations.
By the term "expressive", we mean an interpretation that sounds lively, with musical intentions, as opposed to a more mechanical or static interpretation.

... and the same on the other way.

There is no "correct" answer. It is only about your subjective preference.



PairFile1+3+2+10+1+2+3File2Prob
1
PairFile1+3+2+10+1+2+3File2Prob
2
PairFile1+3+2+10+1+2+3File2Prob
3
PairFile1+3+2+10+1+2+3File2Prob
4
PairFile1+3+2+10+1+2+3File2Prob
5
PairFile1+3+2+10+1+2+3File2Prob
6
PairFile1+3+2+10+1+2+3File2Prob
7
PairFile1+3+2+10+1+2+3File2Prob
8
PairFile1+3+2+10+1+2+3File2Prob
9
PairFile1+3+2+10+1+2+3File2Prob
10



Test II

In this second test, you are asked to assess, among 2 different synthesis, which one presents a singing style which is the most similar to the style of a "Target style" from an original recording.
For each line, first listen to leftmost sound in the column "Target style" to get an idea of the main characteristic of this style.
Then, listen to the 2 other sounds in columns "File 1" and "File 2", and choose a button (similarly to Test I) according to whether you think that the target singing style is more similar to that of File 1 or File 2.
Please try to focus mainly on differences in the pitch variations (attacks, transitions between notes, vibrato, ...) and phonemes durations. (other features are not modeled here).


Synthesis:


PairTarget styleFile 1+3+2+10+1+2+3File 2Prob
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25



Once finished, you can reassess the comparisons as many times as you want.

Some information about you

What's your mother tongue ?
Are you
Female Male
Age years
How did you listen to the sounds ?
Headphones Earphones Loudspeakers
Do you play any musical instrument (including singing) ?
Yes No
What is your level in singing? (on a 0-5 scale: 0=do not sing; 5=professional singer)
Are you a professionnal involved in audio processing (e.g. sound engineer, researcher in audio processing) ?
Yes No
Are you familiar with listening tests ?
Yes No
Are you familiar with voice synthesis techniques ?
Yes No
(optional) Leave your '''e-mail''' address ...
(optional) ... if you want to leave a message:

Then

(even though you have not had time to evaluate all pairs, please send it anyway. A partial answer will still be useful.)

The sounds used in this test are under Copyright © 2017 Ircam, Institut de recherche et coordination acoustique/musique.