共 30 条
[1]
[Anonymous], 2010, JMLR WORKSHOP C P
[2]
Chen L, 2021, AAAI CONF ARTIF INTE, V35, P1036
[3]
Multi-Modal Dynamic Graph Transformer for Visual Grounding
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:15513-15522
[4]
Chen XP, 2018, Arxiv, DOI arXiv:1812.03426
[7]
TransVG: End-to-End Visual Grounding with Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:1749-1759
[8]
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, DOI 10.48550/ARXIV.1810.04805]
[9]
Du Y, 2022, Arxiv, DOI arXiv:2105.04281
[10]
Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:16883-16892