共 50 条
- [1] End-to-end audio-visual speech recognition for overlapping speech INTERSPEECH 2021, 2021, : 3016 - 3020
- [2] An Improved End-to-End Audio-Visual Speech Recognition Model INTERSPEECH 2023, 2023, : 3093 - 3097
- [3] MODALITY ATTENTION FOR END-TO-END AUDIO-VISUAL SPEECH RECOGNITION 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6565 - 6569
- [4] FUSING INFORMATION STREAMS IN END-TO-END AUDIO-VISUAL SPEECH RECOGNITION 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3430 - 3434
- [5] Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition INTERSPEECH 2019, 2019, : 4090 - 4094
- [6] Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition INTERSPEECH 2022, 2022, : 2838 - 2842
- [8] End-to-End Bloody Video Recognition by Audio-Visual Feature Fusion PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 501 - 510
- [10] END-TO-END MULTI-PERSON AUDIO/VISUAL AUTOMATIC SPEECH RECOGNITION 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6994 - 6998