共 60 条
- [1] ViViT: A Video Vision Transformer [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6816 - 6826
- [2] Ba Lei Jimmy, 2016, arXiv
- [3] Soft-NMS - Improving Object Detection With One Line of Code [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
- [4] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
- [5] Rethinking the Faster R-CNN Architecture for Temporal Action Localization [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1130 - 1139
- [7] Chen G, 2022, AAAI CONF ARTIF INTE, P248
- [8] Dosovitskiy A., 2021, P INT C LEARN REPR, P1, DOI DOI 10.48550/ARXIV.2010.11929
- [9] TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3648 - 3656
- [10] Gao Jiyang, 2017, BMVC