共 50 条
- [41] Comparison between different feature extraction techniques for audio-visual speech recognition Journal on Multimodal User Interfaces, 2007, 1 : 7 - 20
- [42] AUDIO-VISUAL SPEECH RECOGNITION INCORPORATING FACIAL DEPTH INFORMATION CAPTURED BY THE KINECT 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2714 - 2717
- [45] Matrix-MCE Based Fuzzy Neural Network for Speech Recognition 11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 546 - 550
- [46] Audio-visual Integration for Robust Speech Recognition Using Maximum Weighted Stream Posteriors INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 869 - 872
- [47] MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation INTERSPEECH 2023, 2023, : 4064 - 4068
- [50] Statistical multimodal integration for audio-visual speech processing IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 854 - 866