共 40 条
- [21] Modeling Context Between Objects for Referring Expression Understanding [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 792 - 807
- [22] Spithourakis GP, 2018, Arxiv, DOI arXiv:1805.08154
- [23] Perez E, 2018, AAAI CONF ARTIF INTE, P3942
- [24] Qi P, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, P101
- [25] Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4171 - 4180
- [26] Redmon J., 2018, arXiv
- [27] Speer R, 2017, AAAI CONF ARTIF INTE, P4444
- [28] Multisensor Fusion and Explicit Semantic Preserving-Based Deep Hashing for Cross-Modal Remote Sensing Image Retrieval [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
- [29] Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 28 - 36
- [30] Wolf T, 2020, Arxiv, DOI [arXiv:1910.03771, DOI 10.48550/ARXIV.1910.03771]