共 50 条
- [21] Semantic Audio-Visual Navigation 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15511 - 15520
- [22] Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1456 - 1463
- [23] MTCAM: A Novel Weakly-Supervised Audio-Visual Saliency Prediction Model With Multi-Modal Transformer IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1756 - 1771
- [25] Audio-visual event detection based on mining of semantic audio-visual labels STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2004, 2004, 5307 : 292 - 299
- [28] Multi-modal fusion learning through biosignal, audio, and visual content for detection of mental stress Neural Computing and Applications, 2023, 35 : 24435 - 24454
- [29] Multi-modal fusion learning through biosignal, audio, and visual content for detection of mental stress NEURAL COMPUTING & APPLICATIONS, 2023, 35 (34): : 24435 - 24454
- [30] Multi-Modal Anomaly Detection by Using Audio and Visual Cues IEEE ACCESS, 2021, 9 : 30587 - 30603