共 35 条
[1]
Ando A, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P4964, DOI 10.1109/ICASSP.2018.8461299
[2]
Baevski A., 2020, Advances in Neural Information Processing Systems
[3]
Deep Speaker Embeddings for Short-Duration Speaker Verification
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:1517-1521
[4]
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
[J].
INTERSPEECH 2022,
2022,
:3699-3703
[5]
Chien C.-M., 2021, ICASSP
[7]
Cooper E, 2020, INT CONF ACOUST SPEE, P6184, DOI [10.1109/icassp40776.2020.9054535, 10.1109/ICASSP40776.2020.9054535]
[8]
Speaker adaptation in DNN-based speech synthesis using d-vectors
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:3404-3408
[9]
Phoneme Duration Modeling Using Speech Rhythm-Based Speaker Embeddings for Multi-Speaker Speech Synthesis
[J].
INTERSPEECH 2021,
2021,
:3141-3145
[10]
Gomathi D, 2012, 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, P694