共 28 条
- [11] End-to-End Neural Speaker Diarization with Permutation-Free Objectives [J]. INTERSPEECH 2019, 2019, : 4300 - 4304
- [12] Fujita Y, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P296, DOI [10.1109/asru46091.2019.9003959, 10.1109/ASRU46091.2019.9003959]
- [13] End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors [J]. INTERSPEECH 2020, 2020, : 269 - 273
- [14] END-TO-END SPEAKER DIARIZATION AS POST-PROCESSING [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7188 - 7192
- [15] INTEGRATING END-TO-END NEURAL AND CLUSTERING-BASED DIARIZATION: GETTING THE BEST OF BOTH WORLDS [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7198 - 7202
- [16] Landini Federico, 2020, ICASSP 2020 2020 IEE
- [17] Landini Federico, COMPUT SPEECH LANG, V71, P2022
- [18] Lin, 2019, ARXIV PREPRINT ARXIV
- [19] VoxCeleb: a large-scale speaker identification dataset [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2616 - 2620
- [20] Efficient use of overlap information in speaker diarization [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 683 - 686