共 27 条
- [1] Joint Encoder-Decoder Self-Supervised Pre-training for ASR [J]. INTERSPEECH 2022, 2022, : 3418 - 3422
- [2] Baevski A, 2020, ADV NEUR IN, V33
- [4] LARGE-SCALE SELF-SUPERVISED SPEECH REPRESENTATION LEARNING FOR AUTOMATIC SPEAKER VERIFICATION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6147 - 6151
- [5] Cosentino J, 2020, Arxiv, DOI arXiv:2005.11262
- [6] Listen only to me! How well can target speech extraction handle false alarms? [J]. INTERSPEECH 2022, 2022, : 216 - 220
- [7] Delcroix M, 2020, INT CONF ACOUST SPEE, P691, DOI [10.1109/icassp40776.2020.9054683, 10.1109/ICASSP40776.2020.9054683]
- [8] SpEx plus : A Complete Time Domain Speaker Extraction Network [J]. INTERSPEECH 2020, 2020, : 1406 - 1410
- [9] DPCCN: DENSELY-CONNECTED PYRAMID COMPLEX CONVOLUTIONAL NETWORK FOR ROBUST SPEECH SEPARATION AND EXTRACTION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7292 - 7296