共 291 条
[3]
Afouras T, 2022, Arxiv, DOI [arXiv:2104.06401, 10.48550/ARXIV.2104.06401, DOI 10.48550/ARXIV.2104.06401]
[4]
Self-supervised object detection from audio-visual correspondence
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2022,
:10565-10576
[5]
Afouras T, 2018, Arxiv, DOI arXiv:1809.00496
[6]
Self-supervised Learning of Audio-Visual Objects from Video
[J].
COMPUTER VISION - ECCV 2020, PT XVIII,
2020, 12363
:208-224
[7]
Audio-Visual Face Reenactment
[J].
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV),
2023,
:5167-5176
[8]
Audio-Visual Multimedia Quality Assessment A Comprehensive Survey
[J].
IEEE ACCESS,
2017, 5
:21090-21117
[9]
[Anonymous], 2004, Proceedings of the 6th international conference on Multimodal interfaces
[10]
[Anonymous], 2000, Advances in Neural Information Processing Systems