共 50 条
- [31] Face-to-talk: Audio-visual speech detection for robust speech recognition in noisy environment IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03): : 505 - 513
- [32] THE NEW DELFT UNIVERSITY OF TECHNOLOGY DATA CORPUS FOR AUDIO-VISUAL SPEECH RECOGNITION EUROMEDIA'2009, 2009, : 63 - 69
- [33] Comparison between different feature extraction techniques for audio-visual speech recognition Journal on Multimodal User Interfaces, 2007, 1 : 7 - 20
- [34] Audio-Visual Speech Recognition Scheme Based on Wavelets and Random Forests Classification PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2015, 2015, 9423 : 567 - 574
- [35] A robust visual feature extraction based BTSM-LDA for audio-visual speech recognition 2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 1044 - +
- [37] Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 2241 - 2244
- [39] MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation INTERSPEECH 2023, 2023, : 4064 - 4068
- [40] SPEAKER-TARGETED AUDIO-VISUAL SPEECH RECOGNITION USING A HYBRID CTC/ATTENTION MODEL WITH INTERFERENCE LOSS 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 251 - 255