共 64 条
[23]
Entangled Transformer for Image Captioning
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:8927-8936
[25]
Li Q, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P1338
[27]
Recurrent Topic-Transition GAN for Visual Paragraph Generation
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:3382-3391
[28]
Microsoft COCO: Common Objects in Context
[J].
COMPUTER VISION - ECCV 2014, PT V,
2014, 8693
:740-755
[29]
Feature Pyramid Networks for Object Detection
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:936-944
[30]
Leveraging Visual Question Answering for Image-Caption Ranking
[J].
COMPUTER VISION - ECCV 2016, PT II,
2016, 9906
:261-277