共 34 条
[1]
Afouras T., 2018, arXiv preprint arXiv:1809.00496
[3]
My lips are concealed: Audio-visual speech enhancement through obstructions
[J].
INTERSPEECH 2019,
2019,
:4295-4299
[4]
[Anonymous], CVPR
[5]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[6]
Assael Yannis M, 2016, ARXIV161101599
[7]
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[8]
Bertasius G., 2021, arXiv
[9]
Braga Otavio, 2021, ICASSP
[10]
Braga Otavio, 2020, ICASSP