共 96 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
SPICE: Semantic Propositional Image Caption Evaluation
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:382-398
[3]
[Anonymous], 2010, Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
[4]
Probabilistic Debiasing of Scene Graphs
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:10429-10438
[5]
Breese J.S., 2013, CORR
[6]
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4612-4622
[7]
Knowledge-Embedded Routing Network for Scene Graph Generation
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:6156-6164
[8]
Destruction and Construction Learning for Fine-grained Image Recognition
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:5152-5161
[9]
Cohn-Gordon R., 2018, P 2018 C N AM CHAPT, V2, P439, DOI DOI 10.18653/V1/N18-2070