共 50 条
- [21] Video Question Answering by Frame Attention ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
- [22] Video Question Answering with Procedural Programs COMPUTER VISION-ECCV 2024, PT XXXVIII, 2025, 15096 : 315 - 332
- [23] Invariant Grounding for Video Question Answering 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2918 - 2927
- [24] BERT Representations for Video Question Answering 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1545 - 1554
- [26] Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language Models MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 726 - 736
- [27] BVQA: Connecting Language and Vision Through Multimodal Attention for Open-Ended Question Answering IEEE ACCESS, 2025, 13 : 27570 - 27586
- [28] Structured Attentions for Visual Question Answering 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1300 - 1309