共 38 条
- [1] Bullock L, 2020, INT CONF ACOUST SPEE, P7114, DOI [10.1109/ICASSP40776.2020.9053096, 10.1109/icassp40776.2020.9053096]
- [2] Carletta J, 2005, LECT NOTES COMPUT SC, V3869, P28
- [3] Chen Z, 2020, INT CONF ACOUST SPEE, P7284, DOI [10.1109/ICASSP40776.2020.9053426, 10.1109/icassp40776.2020.9053426]
- [4] Chung JS, 2018, INTERSPEECH, P1086
- [5] Diez M., 2018, P OD, P147
- [6] End-to-End Neural Speaker Diarization with Permutation-Free Objectives [J]. INTERSPEECH 2019, 2019, : 4300 - 4304
- [7] Fujita Y, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P296, DOI [10.1109/ASRU46091.2019.9003959, 10.1109/asru46091.2019.9003959]
- [8] Gao Shanghua, 2019, IEEE T PAMI
- [9] Garcia-Romero D, 2017, INT CONF ACOUST SPEE, P4930, DOI 10.1109/ICASSP.2017.7953094
- [10] Conformer: Convolution-augmented Transformer for Speech Recognition [J]. INTERSPEECH 2020, 2020, : 5036 - 5040