共 52 条
[31]
Generation and Comprehension of Unambiguous Object Descriptions
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:11-20
[32]
Modeling Context Between Objects for Referring Expression Understanding
[J].
COMPUTER VISION - ECCV 2016, PT IV,
2016, 9908
:792-807
[33]
Pennington J., 2014, Proceedings of the Empiricial Methods in Natural Language Processing EMNLP 2014, DOI 10.3115/v1/D14-1162
[34]
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:724-732
[35]
Pont-Tuset J, 2017, ARXIV
[37]
Reiter Ehud., 1992, Proceedings of the 14th conference on Computational linguistics-Volume 1, V1, P232
[39]
Seo S., 2020, ECCV
[40]
Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556