共 3 条
- [2] Audio-visual speech translation with automatic LIP synchronization and face tracking based on 3-D head model ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2002, 2
- [3] COMPRESSING TRANSFORMER-BASED ASR MODEL BY TASK-DRIVEN LOSS AND ATTENTION-BASED MULTI-LEVEL FEATURE DISTILLATION ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2022, 2022-May : 7992 - 7996