共 49 条
- [1] G3RAPHGROUND: Graph-based Language Grounding [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4280 - 4289
- [2] Cai Han, 2019, INT C LEARN REPR
- [3] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
- [4] Self-Adaptive Network Pruning [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 175 - 186
- [5] Chen L, 2021, AAAI CONF ARTIF INTE, V35, P1036
- [6] You Look Twice: GaterNet for Dynamic Filter Selection in CNNs [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9164 - 9172
- [7] TransVG: End-to-End Visual Grounding with Transformers [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1749 - 1759
- [8] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
- [9] Bejnordi BE, 2020, Arxiv, DOI arXiv:1907.06627
- [10] Escalante H. J., 2010, The segmented and annotated IAPR TC-12 benchmark