共 54 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
[Anonymous], 2022, INT C MACH LEARN
[3]
[Anonymous], 2015, P 24 ACM INT C INF K
[4]
[Anonymous], 2012, P 50 ANN M ASS COMPU
[5]
Bollacker K, 2008, Proceedings of SIGMOD, SIGMOD '08, P1247
[6]
Bordes A., 2013, P ADV NEUR INF PROC, P2787, DOI DOI 10.5555/2999792.2999923
[7]
Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:9959-9968
[8]
Cross-modal Ambiguity Learning for Multimodal Fake News Detection
[J].
PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22),
2022,
:2897-2905
[9]
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]
Deng Mingkai, 2022, ARXIV220512548