共 50 条
[1]
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:2631-2639
[2]
Burks A.W., 1954, MATH TABLES OTHER AI, V8, P53, DOI DOI 10.1090/S0025-5718-1954-0061484-4
[3]
Dense and Low-Rank Gaussian CRFs Using Deep Embeddings
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:5113-5122
[4]
See-Through-Text Grouping for Referring Image Segmentation
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:7453-7462
[5]
Chen L-C, 2015, ARXIV
[7]
Chen Yi-Wen, 2019, BMVC
[8]
Graph-Based Global Reasoning Networks
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:433-442
[9]
Visual Grounding via Accumulated Attention
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:7746-7755
[10]
Duta I. C., 2020, ARXIV