An auto-encoder for neural pitch transformations

This website contains supplementary material for our paper ``A bottleneck auto-encoder for F0 transformations on speech and singing voice'', currently under review for the Special Issue - Signal Processing with Convolutional Neural Networks of the MDPI journal Information.

Samples for Singing Voice

Below are samples from the test set of our singing dataset. Click on a sample to open a summary page for that sample, containing transpositions from different models.

Samples for Speech

Below are samples from the test set of our speech dataset. Click on a sample to open a summary page for that sample, containing transpositions from different models.

VCTK Female p361

10-205-1 , 10-205-2 , 10-205-3 , 10-205-4 , 10-205-5 , 10-205-6 , 10-205-7 , 10-205-8 ,

VCTK Female p362

10-207-1 , 10-207-2 , 10-207-3 , 10-207-4 , 10-207-5 , 10-207-6 , 10-207-7 , 10-207-8 ,

VCTK Male p374

10-213-1 , 10-213-2 , 10-213-3 , 10-213-4 , 10-213-5 , 10-213-6 , 10-213-7 , 10-213-8 ,

VCTK Male p376

10-215-1 , 10-215-2 , 10-215-3 , 10-215-4 , 10-215-5 , 10-215-6 , 10-215-7 , 10-215-8 ,

A bottleneck auto-encoder for F0 transformations on speech and singing voice

Samples for Singing Voice

Soprano

Alto

Counter Tenor

Tenor

Baritone

Byzantine

Childvoice

French Pop Female

JPop Male

JPop Female

unseen female

unseen male

Samples for Speech

VCTK Female p361

VCTK Female p362

VCTK Male p374

VCTK Male p376