共 50 条
- [31] Erasing-based Attention Learning for Visual Question Answering PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1175 - 1183
- [33] Counting Attention Based on Classification Confidence for Visual Question Answering 2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1173 - 1179
- [34] Multimodal Bi-direction Guided Attention Networks for Visual Question Answering Neural Processing Letters, 2023, 55 : 11921 - 11943
- [35] Question Modifiers in Visual Question Answering LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
- [36] Co-Attention Network With Question Type for Visual Question Answering IEEE ACCESS, 2019, 7 : 40771 - 40781
- [37] MUTAN: Multimodal Tucker Fusion for Visual Question Answering 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2631 - 2639
- [38] Multimodal Knowledge Reasoning for Enhanced Visual Question Answering 2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 224 - 230
- [39] ICDAR 2021 Competition on Document Visual Question Answering DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 635 - 649
- [40] MUREL: Multimodal Relational Reasoning for Visual Question Answering 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1989 - 1998