共 19 条
- [1] Bain M, 2023, WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
- [2] Bain Max, 2023, INTERSPEECH 2023
- [3] Bredin H., 2023, P INTERSPEECH 2023
- [4] Bredin H, 2020, INT CONF ACOUST SPEE, P7124, DOI [10.1109/icassp40776.2020.9052974, 10.1109/ICASSP40776.2020.9052974]
- [5] Chan W, 2016, INT CONF ACOUST SPEE, P4960, DOI 10.1109/ICASSP.2016.7472621
- [6] OVERLAP-AWARE LOW-LATENCY ONLINE SPEAKER DIARIZATION BASED ON END-TO-END LOCAL SEGMENTATION [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1139 - 1146
- [7] Dehak N, 2009, INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, P1527
- [8] ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification [J]. INTERSPEECH 2020, 2020, : 3830 - 3834
- [9] Dong LH, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P5884, DOI 10.1109/ICASSP.2018.8462506
- [10] Joint Speech Recognition and Speaker Diarization via Sequence Transduction [J]. INTERSPEECH 2019, 2019, : 396 - 400