共 50 条
- [1] DEEP MULTIMODAL LEARNING FOR AUDIO-VISUAL SPEECH RECOGNITION 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2130 - 2134
- [2] An audio-visual corpus for multimodal automatic speech recognition Journal of Intelligent Information Systems, 2017, 49 : 167 - 192
- [5] Indonesian Audio-Visual Speech Corpus for Multimodal Automatic Speech Recognition 2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 381 - 385
- [6] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
- [7] AFT-SAM: Adaptive Fusion Transformer with a Sparse Attention Mechanism for Audio-Visual Speech Recognition APPLIED SCIENCES-BASEL, 2025, 15 (01):
- [9] Audio-Visual Action Recognition Using Transformer Fusion Network APPLIED SCIENCES-BASEL, 2024, 14 (03):