共 45 条
[1]
[Anonymous], 2014, ECCV
[2]
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[3]
Object Detection in Video with Spatiotemporal Sampling Networks
[J].
COMPUTER VISION - ECCV 2018, PT XII,
2018, 11216
:342-357
[4]
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:4042-4050
[5]
Cho Kyunghyun, 2014, C EMPIRICAL METHODS, P1724
[6]
Dai J., 2016, ADV NEURAL INFORM PR, P379, DOI DOI 10.1109/CVPR.2017.690
[7]
Deformable Convolutional Networks
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:764-773
[8]
FlowNet: Learning Optical Flow with Convolutional Networks
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2758-2766
[9]
Spatiotemporal Multiplier Networks for Video Action Recognition
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:7445-7454
[10]
Girshick R., 2015, P IEEE INT C COMPUTE, DOI [DOI 10.1109/ICCV.2015.169, 10.1109/ICCV.2015.169]