共 38 条
[11]
Hu Zhiwei, 2020, CVPR
[12]
Referring Image Segmentation via Cross-Modal Progressive Comprehension
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10485-10494
[13]
Linguistic Structure Guided Context Modeling for Referring Image Segmentation
[J].
COMPUTER VISION - ECCV 2020, PT X,
2020, 12355
:59-75
[14]
Jing Yongcheng, 2021, CVPR
[15]
Kazemzadeh S., 2014, P 2014 C EMPIRICAL M, P787
[16]
Li RC, 2020, IEEE POSITION LOCAT, P798, DOI [10.1109/plans46316.2020.9109908, 10.1109/PLANS46316.2020.9109908]
[17]
Lin HB, 2022, PROCEEDINGS OF THE FIFTH FACT EXTRACTION AND VERIFICATION WORKSHOP (FEVER 2022), P6
[18]
Recurrent Multimodal Interaction for Referring Image Segmentation
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:1280-1289
[19]
Learning to Assemble Neural Module Tree Networks for Visual Grounding
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4672-4681