共 40 条
[21]
Modeling Context Between Objects for Referring Expression Understanding
[J].
COMPUTER VISION - ECCV 2016, PT IV,
2016, 9908
:792-807
[22]
Spithourakis GP, 2018, Arxiv, DOI arXiv:1805.08154
[23]
Perez E, 2018, AAAI CONF ARTIF INTE, P3942
[24]
Qi P, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, P101
[25]
Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension
[J].
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA,
2020,
:4171-4180
[26]
Redmon J., 2018, arXiv, DOI 10.48550/arXiv.1804.02767
[27]
Speer R, 2017, AAAI CONF ARTIF INTE, P4444
[28]
Multisensor Fusion and Explicit Semantic Preserving-Based Deep Hashing for Cross-Modal Remote Sensing Image Retrieval
[J].
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING,
2022, 60
[29]
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge
[J].
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA,
2020,
:28-36
[30]
Wolf T, 2020, Arxiv, DOI arXiv:1910.03771