共 20 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
Meshed-Memory Transformer for Image Captioning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10575-10584
[3]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[4]
Herdade S, 2019, ADV NEUR IN, V32
[5]
Boost image captioning with knowledge reasoning
[J].
MACHINE LEARNING,
2020, 109 (12)
:2313-2332
[6]
Attention on Attention for Image Captioning
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4633-4642
[7]
Ji JY, 2021, AAAI CONF ARTIF INTE, V35, P1655
[9]
Entangled Transformer for Image Captioning
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:8927-8936
[10]
Luo YP, 2021, AAAI CONF ARTIF INTE, V35, P2286