Evaluation of expressivity and singing style modelisation for singing voice synthesis
(Not active anymore)
contact : luc.ardaillon@ircam.fr


Aller à la version française

Thank you for your time ! Please read carefully the following information, even though you are used to doing such tests !

This listening test aims at evaluating the expressivity and modelisation of singing styles using the pitch and phonemes durations, for singing voice synthesis.
We do not want to assess the overall quality of the synthesis or the used techniques for processing the waveforms.
However, the sounds below may contain artefacts which are not related to this modelisation (e.g. unatural timbre, distortions, ...).
Thus, please try to ignore these artefacts and focus only on the pitch variations and phonemes durations.

You will be presented 2 tests below. The first one aims at evaluating the expressivity (or liveliness, musicality) of singing voice synthesis using various settings.
The second one aims at evaluating the modelisation of the singing style.
Detailed explanations on the evaluation procedure are given below for each test.

Recommendations


Test I

For each pair of recordings below (each line) select one button depending on your preference about the expressivity of the two interpretations.
By the term "expressive", we mean an interpretation that sounds lively, with musical intentions, as opposed to a more mechanical or static interpretation.

... and the same on the other way.

There is no "correct" answer. It is only about your subjective preference.



PairFile1+3+2+10+1+2+3File2Prob
1
PairFile1+3+2+10+1+2+3File2Prob
2
PairFile1+3+2+10+1+2+3File2Prob
3
PairFile1+3+2+10+1+2+3File2Prob
4




PairFile1+3+2+10+1+2+3File2Prob
5
PairFile1+3+2+10+1+2+3File2Prob
6
PairFile1+3+2+10+1+2+3File2Prob
7
PairFile1+3+2+10+1+2+3File2Prob
8



Test II

For each line below, first listen to the 2 sounds in columns "style A" and "style B". Then listen to the 3rd sound in column "style X", and select "A" or "B" according to wether you think that the expressivity and singing style is more similar to the sound in "style A" or "style B" column.
Please, note that many parameters related to the singing style, like intensity, timbre, tempo, and variations on the score itself (rhythm and notes) are not modelled in the synthesis and thus should not be considered in this evaluation.
The score used for the synthesis is "stylistically neutral" and differs from the interpretations of the original recordings in columns "style A" and "style B", which have different rhythms and pitches which shouldn't be considered in the evaluation.
Your choice should be based only on the relative pitch variations (attacks, transitions, vibrato, ...) and phonemes durations.


Pairstyle Astyle Bstyle XABProb
9
10
11
12


Pairstyle Astyle Bstyle XABProb
13
14
15
16



Once finished, you can reassess the comparisons as many times as you want.

Some information about you

What's your mother tongue ?
Are you
Female Male
Age years
How did you listen to the sounds ?
Headphones Earphones Loudspeakers
Do you play any musical instrument (including singing) ?
Yes No
What is your level in singing? (on a 0-5 scale: 0=do not sing; 5=professional singer)
Are you a professionnal involved in audio processing (e.g. sound engineer, researcher in audio processing) ?
Yes No
Are you familiar with listening tests ?
Yes No
Are you familiar with voice synthesis techniques ?
Yes No
(optional) Leave your '''e-mail''' address ...
(optional) ... if you want to leave a message:

Please, check that all pairs are evaluated, then

The sounds used in this test are under Copyright © 2016 Ircam, Institut de recherche et coordination acoustique/musique.