共 42 条
- [2] Afouras Triantafyllos, 2020, LNCS, DOI DOI 10.1007/978-3-030-58523-5_13
- [3] End-to-End Active Speaker Detection [J]. COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 126 - 143
- [4] Alcazar Juan Leon, 2020, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, P12465
- [5] Killing Two Birds with One Stone: Efficient and Robust Training of Face Recognition CNNs by Partial FC [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4032 - 4041
- [7] Bronkhorst AW, 2000, ACUSTICA, V86, P117
- [8] Chen Z., 2023, INTERSPEECH
- [10] Who said that?: Audio-visual speaker diarisation of real-world meetings [J]. INTERSPEECH 2019, 2019, : 371 - 375