共 43 条
[11]
Learning Spatiotemporal Features with 3D Convolutional Networks
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:4489-4497
[12]
Convolutional Two-Stream Network Fusion for Video Action Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:1933-1941
[15]
The "something something" video database for learning and evaluating visual common sense
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:5843-5851
[16]
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6047-6056
[18]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[19]
Towards understanding action recognition
[J].
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2013,
:3192-3199