共 60 条
- [1] Andreas J., 2016, NAACL, P1545
- [2] [Anonymous], 2014, P EMNLP
- [3] [Anonymous], 2010, P ECCV
- [4] [Anonymous], P BMVC2018 C
- [5] See-Through-Text Grouping for Referring Image Segmentation [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7453 - 7462
- [6] Chen J, 2020, P IEEE CVF C COMP VI, P9901
- [7] Dang K., 2018, ARXIV180708430
- [8] TALL: Temporal Activity Localization via Language Query [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5277 - 5285
- [9] Actor and Action Video Segmentation from a Sentence [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5958 - 5966
- [10] ActionVLAD: Learning spatio-temporal aggregation for action classification [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3165 - 3174