共 56 条
[1]
[Anonymous], 2022, P IEEE CVF C COMP VI, DOI DOI 10.1109/ICPSASIA55496.2022.9949880
[2]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[3]
Bai Song, 2021, ARXIV210705790
[4]
Bertasius G, 2021, PR MACH LEARN RES, V139
[5]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[6]
Emerging Properties in Self-Supervised Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:9630-9640
[7]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733
[8]
Chen Tianlong, 2021, Advances in Neural Information Processing Systems, V34
[9]
Chen Xinlei, 2021, arXiv preprint arXiv:2104.02057
[10]
Graph-Based Global Reasoning Networks
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:433-442