共 57 条
[1]
Afouras T., 2018, arXiv preprint arXiv:1809.00496
[3]
Self-supervised Learning of Audio-Visual Objects from Video
[J].
COMPUTER VISION - ECCV 2020, PT XVIII,
2020, 12363
:208-224
[4]
My lips are concealed: Audio-visual speech enhancement through obstructions
[J].
INTERSPEECH 2019,
2019,
:4295-4299
[5]
Afouras T, 2018, INTERSPEECH, P3244
[6]
[Anonymous], 2018, COMP VIS ECCV 2018 W, DOI DOI 10.1163/9789004385580002
[7]
[Anonymous], 2007, CVPR
[8]
[Anonymous], 2016, INT CONF ACOUST SPEE
[9]
Bandanau D, 2016, INT CONF ACOUST SPEE, P4945, DOI 10.1109/ICASSP.2016.7472618
[10]
Bregman A. S., 1994, Auditory Scene Analysis: The Perceptual Organization of Sound