共 12 条
[1]
Baevski A, 2020, ADV NEUR IN, V33
[2]
Choi H.-S., 2021, ADV NEUR IN, V34, p16 251
[3]
Kameoka H, 2018, IEEE W SP LANG TECH, P266, DOI 10.1109/SLT.2018.8639535
[4]
Kaneko T, 2017, Arxiv, DOI [arXiv:1711.11293, 10.48550/ARXIV.1711.11293]
[5]
MASKCYCLEGAN-VC: LEARNING NON-PARALLEL VOICE CONVERSION WITH FILLING IN FRAMES
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:5919-5923
[6]
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion
[J].
INTERSPEECH 2020,
2020,
:2017-2021
[7]
Kaneko T, 2019, INT CONF ACOUST SPEE, P6820, DOI [10.1109/icassp.2019.8682897, 10.1109/ICASSP.2019.8682897]
[8]
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
[J].
INTERSPEECH 2021,
2021,
:1349-1353
[9]
Mohammadi SH, 2014, IEEE W SP LANG TECH, P19, DOI 10.1109/SLT.2014.7078543
[10]
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
[J].
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
2007, 15 (08)
:2222-2235