共 120 条
- [2] Akbari H., 2021, P NIPS, P24206
- [3] Audio Visual Scene-Aware Dialog [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7550 - 7559
- [4] Alayrac JB, 2022, ADV NEUR IN
- [5] Alfasly S, 2022, P IEEE CVF C COMP VI, P20208
- [6] [Anonymous], 2022, P IEEE CVF C COMP VI, DOI DOI 10.1002/CPE.7048
- [7] VQA: Visual Question Answering [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
- [8] Look, Listen and Learn [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 609 - 617
- [9] Arandjelovic Relja, 2018, P EUR C COMP VIS ECC, P435