共 13 条
[1]
Ba Jimmy, 2014, Advances in Neural Information Processing Systems, P2654
[2]
Less Is More: Picking Informative Frames for Video Captioning
[J].
COMPUTER VISION - ECCV 2018, PT XIII,
2018, 11217
:367-384
[3]
Chen Z, 2018, PREPRINT
[4]
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[5]
Hinton GE, 2015, ARXIV
[6]
Li Q., 2017, CVPR, DOI DOI 10.1109/CVPR.2017.776
[7]
Microsoft COCO: Common Objects in Context
[J].
COMPUTER VISION - ECCV 2014, PT V,
2014, 8693
:740-755
[8]
Stacked Hourglass Networks for Human Pose Estimation
[J].
COMPUTER VISION - ECCV 2016, PT VIII,
2016, 9912
:483-499
[9]
Towards Accurate Multi-person Pose Estimation in the Wild
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3711-3719
[10]
Romero N., 2014, ABS14126550 ARXIV