共 33 条
[1]
Heilbron FC, 2015, PROC CVPR IEEE, P961, DOI 10.1109/CVPR.2015.7298698
[2]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733
[3]
Rethinking the Faster R-CNN Architecture for Temporal Action Localization
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:1130-1139
[4]
Dosovitskiy A., 2021, P 9 INT C LEARN REPR
[5]
Multi-modal Transformer for Video Retrieval
[J].
COMPUTER VISION - ECCV 2020, PT IV,
2020, 12349
:214-229
[6]
He B., 2022, IEEE C COMPUTER VISI
[7]
Cross-modal Consensus Network forWeakly Supervised Temporal Action Localization
[J].
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021,
2021,
:1591-1599
[8]
Huang L., 2022, IEEE C COMPUTER VISI
[10]
Islam A, 2021, AAAI CONF ARTIF INTE, V35, P1637