共 66 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
[Anonymous], 2020, P IEEE C COMP VIS PA, DOI DOI 10.1109/BIBM49941.2020.9313406
[3]
[Anonymous], 2015, Microsoft COCO captions: Data collection and evaluation server
[4]
[Anonymous], 2020, ARXIV200607733
[7]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[8]
Caron M, 2020, ADV NEUR IN, V33
[9]
Chen T., 2020, ICML
[10]
Webly Supervised Learning of Convolutional Networks
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:1431-1439