Zhang, Y., Weiss, R. J., Zen, H., Wu, Y., Chen, Z., Skerry-Ryan, R. J., ... & Ramabhadran, B. (2019). Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning. arXiv preprint arXiv:1907.04448.
Length scale | 0.8 | 0.9 | 1.0 | 1.1 | 1.2 |
---|---|---|---|---|---|
Text1 | |||||
Text2 | |||||
Text3 | |||||
Text4 |
Length scale | 0.8 | 0.9 | 1.0 | 1.1 | 1.2 | |
---|---|---|---|---|---|---|
Text1 | CMUA-BDL (Trained male) |
|||||
LJSpeech (Trained female) |
||||||
VCTK-P226 (Unseen male) |
||||||
BC2013 (Unseen female) |
||||||
Text2 | CMUA-BDL (Trained male) |
|||||
LJSpeech (Trained female) |
||||||
VCTK-P226 (Unseen male) |
||||||
BC2013 (Unseen female) |
||||||
Text3 | CMUA-BDL (Trained male) |
|||||
LJSpeech (Trained female) |
||||||
VCTK-P226 (Unseen male) |
||||||
BC2013 (Unseen female) |
||||||
Text4 | CMUA-BDL (Trained male) |
|||||
LJSpeech (Trained female) |
||||||
VCTK-P226 (Unseen male) |
||||||
BC2013 (Unseen female) |
Length scale | 0.8 | 0.9 | 1.0 | 1.1 | 1.2 | |
---|---|---|---|---|---|---|
Text1 | VCTK-P360 (Trained male 1) |
|||||
LJSpeech (Trained female 2) |
||||||
VCTK-P226 (Trained male 2) |
||||||
VCTK-P240 (Trained female 2) |
||||||
Text2 | VCTK-P360 (Trained male 1) |
|||||
LJSpeech (Trained female 2) |
||||||
VCTK-P226 (Trained male 2) |
||||||
VCTK-P240 (Trained female 2) |
||||||
Text3 | VCTK-P360 (Trained male 1) |
|||||
LJSpeech (Trained female 2) |
||||||
VCTK-P226 (Trained male 2) |
||||||
VCTK-P240 (Trained female 2) |
||||||
Text4 | VCTK-P360 (Trained male 1) |
|||||
LJSpeech (Trained female 2) |
||||||
VCTK-P226 (Trained male 2) |
||||||
VCTK-P240 (Trained female 2) |
Length scale | 0.8 | 0.9 | 1.0 | 1.1 | 1.2 | |
---|---|---|---|---|---|---|
Text1 | CMUA-BDL (Male1) |
|||||
LJSpeech (Female1) |
||||||
CMUA-AWB (Male2) |
||||||
CMUA-CLB (Female2) |
||||||
Text2 | CMUA-BDL (Male1) |
|||||
LJSpeech (Female1) |
||||||
CMUA-AWB (Male2) |
||||||
CMUA-CLB (Female2) |
||||||
Text3 | CMUA-BDL (Male1) |
|||||
LJSpeech (Female1) |
||||||
CMUA-AWB (Male2) |
||||||
CMUA-CLB (Female2) |
||||||
Text4 | CMUA-BDL (Male1) |
|||||
LJSpeech (Female1) |
||||||
CMUA-AWB (Male2) |
||||||
CMUA-CLB (Female2) |
To | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LJSpeech (Female1) |
VCTK-P226 (Male1) |
VCTK-P360 (Male2) |
VCTK-P240 (Female2) |
|||||||||||
0.8 | 1.0 | 1.2 | 0.8 | 1.0 | 1.2 | 0.8 | 1.0 | 1.2 | 0.8 | 1.0 | 1.2 | |||
From | LJSpeech (Female1) |
|||||||||||||
VCTK-P226 (Male1) |
||||||||||||||
VCTK-P360 (Male2) |
||||||||||||||
VCTK-P240 (Female2) |