共 51 条
- [2] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
- [3] Ba L.J., 2016, arXiv
- [6] Chen YC, 2019, AEBMR ADV ECON, V106, P104, DOI 10.1007/978-3-030-58577-8_7
- [8] Du Yunhao., 2022, 2022 IEEE INT C MULT, P1, DOI DOI 10.1109/ICME52920.2022.9859880
- [9] Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6087 - 6096