共 57 条
- [1] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
- [2] SPICE: Semantic Propositional Image Caption Evaluation [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 382 - 398
- [3] Visual Explanations for DNNs with Contextual Importance [J]. EXPLAINABLE AND TRANSPARENT AI AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2021, 2021, 12688 : 83 - 96
- [4] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
- [5] Bengio Y., 2009, ICML
- [6] Chen L, 2020, PROC CVPR IEEE, P10797, DOI 10.1109/CVPR42600.2020.01081
- [7] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6298 - 6306
- [8] Chen XL, 2015, Arxiv, DOI arXiv:1504.00325
- [9] Cheng Xu, 2019, 2019 IEEE International Conference on Unmanned Systems and Artificial Intelligence (ICUSAI), P172, DOI 10.1109/ICUSAI47366.2019.9124779
- [10] What does BERT look at? An Analysis of BERT's Attention [J]. BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, : 276 - 286