共 50 条
- [1] Target Active Speaker Detection with Audio-visual Cues [J]. INTERSPEECH 2023, 2023, : 3152 - 3156
- [2] RETHINKING AUDIO-VISUAL SYNCHRONIZATION FOR ACTIVE SPEAKER DETECTION [J]. 2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2022,
- [3] Active Speaker Detection Using Audio-Visual Sensor Array [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2014, : 480 - 484
- [4] Active Speaker Detection with Audio-Visual Co-training [J]. ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 312 - 316
- [6] AVA-AVD: Audio-Visual Speaker Diarization in the Wild [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3838 - 3847
- [7] Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection [J]. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 225 - 232
- [8] Active Speaker Detection Using Audio, Visual, and Depth Modalities: A Survey [J]. IEEE ACCESS, 2024, 12 : 96617 - 96634
- [9] WASD: A Wilder Active Speaker Detection Dataset [J]. IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2025, 7 (01): : 61 - 70
- [10] BEST OF BOTH WORLDS: MULTI-TASK AUDIO-VISUAL AUTOMATIC SPEECH RECOGNITION AND ACTIVE SPEAKER DETECTION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6047 - 6051