共 50 条
- [22] Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10534 - 10542
- [23] Separation of audio-visual speech sources: A new approach exploiting the audio-visual coherence of speech stimuli Sodoyer, D. (sodoyer@icp.inpg.fr), 1600, Hindawi Publishing Corporation (2002):
- [24] Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli EURASIP Journal on Advances in Signal Processing, 2002
- [26] Bayesian separation of audio-visual speech sources 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 657 - 660
- [27] A Robust Audio-visual Speech Recognition Using Audio-visual Voice Activity Detection 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2702 - +
- [28] Multi-stream asynchrony modeling for audio-visual speech recognition ISM 2007: NINTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2007, : 37 - 44
- [29] Audio-Visual Multi-Talker Speech Recognition in A Cocktail Party INTERSPEECH 2021, 2021, : 3021 - 3025
- [30] Speaker independent audio-visual speech recognition 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076