共 50 条
- [11] How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1173 - 1183
- [12] Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2023, 4 : 225 - 232
- [13] Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10534 - 10542
- [14] Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3927 - 3935
- [15] Active Speaker Detection Using Audio, Visual, and Depth Modalities: A Survey IEEE ACCESS, 2024, 12 : 96617 - 96634
- [16] Audio-Visual Synchronisation for Speaker Diarisation 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2662 - +
- [17] WASD: A Wilder Active Speaker Detection Dataset IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2025, 7 (01): : 61 - 70
- [18] BEST OF BOTH WORLDS: MULTI-TASK AUDIO-VISUAL AUTOMATIC SPEECH RECOGNITION AND ACTIVE SPEAKER DETECTION 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6047 - 6051
- [19] E-Talk: Accelerating Active Speaker Detection with Audio-Visual Fusion and Edge-Cloud Computing 2023 20TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING, SECON, 2023,
- [20] Speaker position detection system using audio-visual information FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 1999, 35 (02): : 212 - 220