共 50 条
[31]
FEATURE SPACE VIDEO STREAM CONSISTENCY ESTIMATION FOR DYNAMIC STREAM WEIGHTING IN AUDIO-VISUAL SPEECH RECOGNITION
[J].
2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5,
2008,
:1316-1319
[32]
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video
[J].
INTERSPEECH 2022,
2022,
:2833-2837
[34]
System for Producing Subtitles to Internet Audio-Visual Documents
[J].
2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP),
2015,
[36]
Connectionism based audio-visual speech recognition method
[J].
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition),
2024, 54 (10)
:2984-2993
[37]
An Audio-Visual Attention System for Online Association Learning
[J].
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5,
2009,
:2127-2130
[39]
Streaming Audio-Visual Speech Recognition with Alignment Regularization
[J].
INTERSPEECH 2023,
2023,
:1598-1602
[40]
Optimality and Limitations of Audio-Visual Integration for Cognitive Systems
[J].
FRONTIERS IN ROBOTICS AND AI,
2020, 7