共 52 条
[11]
DAPs: Deep Action Proposals for Action Understanding
[J].
COMPUTER VISION - ECCV 2016, PT III,
2016, 9907
:768-784
[12]
TALL: Temporal Activity Localization via Language Query
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:5277-5285
[13]
Gao MF, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P1481
[14]
MAC: Mining Activity Concepts for Language-based Temporal Localization
[J].
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV),
2019,
:245-253
[15]
Momentum Contrast for Unsupervised Visual Representation Learning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:9726-9735
[16]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[17]
Localizing Moments in Video with Natural Language
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:5804-5813
[18]
Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:7179-7188
[19]
Jiang Bin, 2019, P 2019 INT C MULT RE, P217, DOI DOI 10.1145/3323873.3325019