共 58 条
- [1] Abacha A. B., 2019, CLEF2019 WORKING NOT, V2, P1
- [2] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
- [3] VQA: Visual Question Answering [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2425 - 2433
- [5] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering [J]. IEEE ACCESS, 2020, 8 : 35662 - 35671
- [6] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
- [7] Multiple Meta-model Quantifying for Medical Visual Question Answering [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 : 64 - 74
- [9] Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6087 - 6096
- [10] Finn C, 2017, PR MACH LEARN RES, V70