共 77 条
[61]
Few-Shot Visual Grounding for Natural Human-Robot Interaction
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC),
2021,
:50-55
[62]
Vedantam R, 2015, PROC CVPR IEEE, P4566, DOI 10.1109/CVPR.2015.7299087
[63]
Wang Liwei, 2017, NEURAL INFORM PROCES, V2017
[64]
Wang Wei, 2021, ACM INT C MULT
[65]
Weakly-supervised Visual Grounding of Phrases with Linguistic Structures
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:5253-5262
[66]
Xu K, 2015, PR MACH LEARN RES, V37, P2048
[67]
Yang Xun, 2020, ACM INT C MULT
[68]
Yu Haonan, 2013, ANN M ASS COMP LING, V1
[69]
Zaheer Manzil, 2017, INT C MACH LEARN, V8
[70]
Neural Motifs: Scene Graph Parsing with Global Context
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:5831-5840