共 75 条
[1]
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition
[J].
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV),
2023,
:3319-3328
[2]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[4]
Bahsoon R, 2017, SOFTWARE ARCHITECTURE FOR BIG DATA AND THE CLOUD, P1, DOI 10.1016/B978-0-12-805467-3.00001-6
[5]
Attention Augmented Convolutional Networks
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:3285-3294
[6]
Bertasius G, 2021, PR MACH LEARN RES, V139
[7]
Efficient Video Classification Using Fewer Frames
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:354-363
[9]
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
[J].
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022),
2022,
:786-797
[10]
Cipriani Giulio, 2021, Advances in Italian Mechanism Science. Proceedings of the 3rd International Conference of IFToMM Italy. Mechanisms and Machine Science (MMS 91), P260, DOI 10.1007/978-3-030-55807-9_30