共 41 条
[1]
MiniROAD: Minimal RNN Framework for Online Action Detection
[J].
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023),
2023,
:10307-10316
[2]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[3]
Azorín-López J, 2013, IEEE IJCNN
[5]
Bao H., 2021, arXiv
[7]
Predicting Human-Object Interactions in Egocentric Videos
[J].
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN),
2022,
[8]
Interaction Estimation in Egocentric Videos via Simultaneous Hand-Object Recognition
[J].
16TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2021),
2022, 1401
:439-448
[9]
VINDLU : A Recipe for Effective Video-and-Language Pretraining
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:10739-10750