共 93 条
[41]
Li BY, 2019, AAAI CONF ARTIF INTE, P8577
[42]
Entangled Transformer for Image Captioning
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:8927-8936
[43]
Li X., 2020, P EUR C COMP VIS, P121
[45]
Lin C.-Y., 2004, TEXT SUMMARIZATION B, P74
[46]
Focal Loss for Dense Object Detection
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:2999-3007
[47]
Microsoft COCO: Common Objects in Context
[J].
COMPUTER VISION - ECCV 2014, PT V,
2014, 8693
:740-755
[48]
Liu F., 2019, ARXIV190506139
[49]
Liu FL, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P5095
[50]
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3242-3250