共 38 条
- [1] [Anonymous], P 3 INT C LEARN REPR
- [2] Cao M, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P9810
- [3] Chen JY, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P162
- [4] Chen Ting, 2019, 25 AMERICAS C INFORM
- [5] Learning Spatiotemporal Features with 3D Convolutional Networks [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
- [6] Gao J., 2017, P IEEE INT C COMPUTE
- [7] Gao JL, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P3978
- [8] He DL, 2019, AAAI CONF ARTIF INTE, P8393
- [9] Localizing Moments in Video with Natural Language [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5804 - 5813
- [10] Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 217 - 225