共 122 条
[41]
Gabeur Valentin, 2020, P ECCV, V5
[42]
Video Action Transformer Network
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:244-253
[43]
Goyal Priya, 2017, CORR
[44]
The "something something" video database for learning and evaluating visual common sense
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:5843-5851
[45]
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6047-6056
[46]
Guo M.-H., 2020, ARXIV201209688
[47]
Hang Zhang, 2020, RESNEST SPLIT ATTENT
[48]
Hanin B, 2018, ADV NEUR IN, V31
[49]
He K., 2015, C COMPUTER VISION PA, DOI DOI 10.1109/CVPR.2016.90
[50]
He K., 2017, PROC IEEE INT C COMP, P2961