共 49 条
[1]
Weakly-Supervised Alignment of Video With Text
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:4462-4470
[2]
SST: Single-Stream Temporal Action Proposals
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:6373-6382
[3]
Heilbron FC, 2015, PROC CVPR IEEE, P961, DOI 10.1109/CVPR.2015.7298698
[4]
Chen L, 2020, AAAI CONF ARTIF INTE, V34, P10551
[5]
Chen Ting, 2019, PMLR
[6]
An Empirical Study of Training Self-Supervised Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:9620-9629
[8]
Learning Spatiotemporal Features with 3D Convolutional Networks
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:4489-4497
[9]
Feng Y., 2018, PROC EUR C COMPUT VI
[10]
Spatio-temporal Video Re-localization by Warp LSTM
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:1288-1297