共 46 条
[2]
My lips are concealed: Audio-visual speech enhancement through obstructions
[J].
INTERSPEECH 2019,
2019,
:4295-4299
[3]
Afouras T, 2018, INTERSPEECH, P3244
[4]
BEST OF BOTH WORLDS: MULTI-TASK AUDIO-VISUAL AUTOMATIC SPEECH RECOGNITION AND ACTIVE SPEAKER DETECTION
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:6047-6051
[5]
Chiu CC, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P4774, DOI 10.1109/ICASSP.2018.8462105
[6]
Choi H.-S., 2019, INT C LEARN REPR
[7]
Lip Reading Sentences in the Wild
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3444-3450
[9]
Dong LH, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P5884, DOI 10.1109/ICASSP.2018.8462506
[10]
Dosovitskiy A., 2020, PROC INT C LEARNING