共 38 条
- [1] BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR INTERSPEECH 2023, 2023, : 3487 - 3491
- [2] CONTINUOUS STREAMING MULTI-TALKER ASR WITH DUAL-PATH TRANSDUCERS 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7317 - 7321
- [3] Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings INTERSPEECH 2022, 2022, : 521 - 525
- [4] ENDPOINT DETECTION FOR STREAMING END-TO-END MULTI-TALKER ASR 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7312 - 7316
- [5] Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR INTERSPEECH 2020, 2020, : 3097 - 3101
- [6] Streaming Multi-talker Speech Recognition with Joint Speaker Identification INTERSPEECH 2021, 2021, : 1782 - 1786
- [8] Token-level Adaptive Training for Neural Machine Translation PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1035 - 1046
- [9] Recognizing Multi-talker Speech with Permutation Invariant Training 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2456 - 2460
- [10] Knowledge Distillation for End-to-End Monaural Multi-talker ASR System INTERSPEECH 2019, 2019, : 2633 - 2637