共 50 条
- [11] Streaming Audio-Visual Speech Recognition with Alignment Regularization INTERSPEECH 2023, 2023, : 1598 - 1602
- [12] Noisy Speech Recognition Based on Combined Audio-Visual Classifiers MULTIMODAL PATTERN RECOGNITION OF SOCIAL SIGNALS IN HUMAN-COMPUTER-INTERACTION, 2015, 8869 : 43 - 53
- [15] Multi-pose lipreading and audio-visual speech recognition EURASIP Journal on Advances in Signal Processing, 2012
- [16] Speech enhancement and recognition in meetings with an audio-visual sensor array IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2257 - 2269
- [17] Transfer Learning from Audio-Visual Grounding to Speech Recognition INTERSPEECH 2019, 2019, : 3242 - 3246
- [19] Audio-Visual Speech Recognition Using A Two-Step Feature Fusion Strategy 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1896 - 1903
- [20] Audio-visual Integration for Robust Speech Recognition Using Maximum Weighted Stream Posteriors INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 869 - 872