共 58 条
[1]
Afouras T., 2019, IEEE PAMI
[2]
Afouras T, 2018, Arxiv, DOI arXiv:1809.00496
[3]
My lips are concealed: Audio-visual speech enhancement through obstructions
[J].
INTERSPEECH 2019,
2019,
:4295-4299
[4]
Afouras T, 2018, INTERSPEECH, P3244
[6]
Look, Listen and Learn
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:609-617
[7]
Barzelay Z., 2007, 2007 IEEE C COMP VIS
[8]
Cross-Modal Supervision for Learning Active Speaker Detection in Video
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:285-301
[9]
Chatfield K, 2014, Arxiv, DOI arXiv:1405.3531
[10]
Chen Ting, 2020, ICML