共 53 条
- [1] Transfer Learning, Style Control, and Speaker Reconstruction Loss for Zero-Shot Multilingual Multi-Speaker Text-to-Speech on Low-Resource Languages [J]. IEEE ACCESS, 2022, 10 : 5895 - 5911
- [2] Baevski A., 2021, Advances in Neural Information Processing Systems, P27826
- [3] Baevski A, 2020, ADV NEUR IN, V33
- [4] Cao YW, 2019, INT CONF ACOUST SPEE, P6935, DOI [10.1109/ICASSP.2019.8682927, 10.1109/icassp.2019.8682927]
- [5] Casanova E, 2022, PR MACH LEARN RES
- [6] SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model [J]. INTERSPEECH 2021, 2021, : 3645 - 3649
- [7] Cross-lingual, Multi-speaker Text-To-Speech Synthesis Using Neural Speaker Embedding [J]. INTERSPEECH 2019, 2019, : 2105 - 2109
- [8] Cho W., 2022, INTERSPEECH, P1
- [9] Choi BJ, 2022, ASIAPAC SIGN INFO PR, P1708, DOI 10.23919/APSIPAASC55919.2022.9979900