共 24 条
[1]
Gatys LA, 2015, Arxiv, DOI [arXiv:1508.06576, DOI 10.48550/ARXIV.1508.06576]
[2]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733
[3]
MARS: Motion-Augmented RGB Stream for Action Recognition
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:7874-7883
[4]
Deep Temporal Linear Encoding Networks
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:1541-1550
[5]
Learning Spatiotemporal Features with 3D Convolutional Networks
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:4489-4497
[6]
Convolutional Two-Stream Network Fusion for Video Action Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:1933-1941
[7]
ActionVLAD: Learning spatio-temporal aggregation for action classification
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3165-3174
[8]
The "something something" video database for learning and evaluating visual common sense
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:5843-5851
[9]
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6546-6555
[10]
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
[J].
COMPUTER VISION - ECCV 2016, PT II,
2016, 9906
:694-711