共 50 条
- [31] MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation INTERSPEECH 2023, 2023, : 4064 - 4068
- [32] Multifactor fusion for audio-visual speaker recognition LECTURE NOTES IN SIGNAL SCIENCE, INTERNET AND EDUCATION (SSIP'07/MIV'07/DIWEB'07), 2007, : 70 - +
- [34] My lips are concealed: Audio-visual speech enhancement through obstructions INTERSPEECH 2019, 2019, : 4295 - 4299
- [35] Statistical multimodal integration for audio-visual speech processing IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 854 - 866
- [38] Multi-Task Joint Learning for Embedding Aware Audio-Visual Speech Enhancement 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 255 - 259
- [39] Improving Visual Speech Enhancement Network by Learning Audio-visual Affinity with Multi-head Attention INTERSPEECH 2022, 2022, : 971 - 975
- [40] Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):