共 43 条
[2]
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:2631-2639
[3]
Chen D., 2014, P 2014 C EMP METH NA, P740, DOI [10.3115/v1/D14-1082, DOI 10.3115/V1/D14-1082]
[4]
See-Through-Text Grouping for Referring Image Segmentation
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:7453-7462
[5]
Chen LC, 2016, Arxiv, DOI arXiv:1412.7062
[6]
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[8]
Chen YW, 2019, Arxiv, DOI arXiv:1910.04748
[9]
Graph-Based Global Reasoning Networks
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:433-442