共 50 条
- [1] End-to-end audio-visual speech recognition for overlapping speech INTERSPEECH 2021, 2021, : 3016 - 3020
- [2] END-TO-END AUDIO-VISUAL SPEECH RECOGNITION WITH CONFORMERS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7613 - 7617
- [3] An Improved End-to-End Audio-Visual Speech Recognition Model INTERSPEECH 2023, 2023, : 3093 - 3097
- [4] MODALITY ATTENTION FOR END-TO-END AUDIO-VISUAL SPEECH RECOGNITION 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6565 - 6569
- [6] End-to-End Multi-Person Pose Estimation with Transformers 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11059 - 11068
- [7] FUSING INFORMATION STREAMS IN END-TO-END AUDIO-VISUAL SPEECH RECOGNITION 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3430 - 3434
- [9] END-TO-END VISUAL SPEECH RECOGNITION WITH LSTMS 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2592 - 2596
- [10] Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition INTERSPEECH 2019, 2019, : 4090 - 4094