共 23 条
[1]
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:12479-12488
[2]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[3]
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:4724-4733
[4]
Chen J, 2020, P ACM INT C MULT MM, P4605
[5]
Chen JW, 2019, AAAI CONF ARTIF INTE, P8167
[6]
Chen SX, 2019, AAAI CONF ARTIF INTE, P8191
[8]
A Novel Image Captioning Method Based on Generative Adversarial Networks
[J].
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV,
2019, 11730
:281-292
[10]
Hou JY, 2020, AAAI CONF ARTIF INTE, V34, P10973