共 45 条
[31]
Comprehension-guided referring expressions
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3125-3134
[32]
Generation and Comprehension of Unambiguous Object Descriptions
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:11-20
[33]
Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries
[J].
COMPUTER VISION - ECCV 2018, PT XI,
2018, 11215
:656-672
[34]
Learning Deconvolution Network for Semantic Segmentation
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:1520-1528
[35]
Qin XL, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), P1, DOI [10.1109/ATNAC.2017.8215431, 10.1109/ICPHM.2017.7998297]
[36]
Zero-Shot Grounding of Objects from Natural Language Queries
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4693-4702
[38]
A Fast and Accurate One-Stage Approach to Visual Grounding
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4682-4692
[39]
Cross-Modal Self-Attention Network for Referring Image Segmentation
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:10494-10503
[40]
MAttNet: Modular Attention Network for Referring Expression Comprehension
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:1307-1315