共 71 条
[1]
Aharon N, 2022, Arxiv, DOI [arXiv:2206.14651, DOI 10.48550/ARXIV.2206.14651]
[2]
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3425-3434
[3]
Boundary Content Graph Neural Network for Temporal Action Proposal Generation
[J].
COMPUTER VISION - ECCV 2020, PT XXVIII,
2020, 12373
:121-137
[5]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[6]
Chang S., 2021, arXiv
[7]
Chappa Naga V. S. Raviteja, 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P5158, DOI 10.1109/CVPRW59228.2023.00544
[8]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[9]
Non-Local Neural Networks with Grouped Bilinear Attentional Transforms
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:11801-11810
[10]
TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking
[J].
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV),
2023,
:4859-4869