共 38 条
- [2] Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3078 - 3089
- [3] Knowledge-Constrained Answer Generation for Open-Ended Video Question Answering THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8141 - 8149
- [4] Open-Ended Multi-Modal Relational Reasoning for Video Question Answering 2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 363 - 369
- [5] Coarse to Fine Frame Selection for Online Open-ended Video Question Answering 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 353 - 361
- [6] BVQA: Connecting Language and Vision Through Multimodal Attention for Open-Ended Question Answering IEEE ACCESS, 2025, 13 : 27570 - 27586
- [7] Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language Models MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 726 - 736
- [9] Large Language Models are Temporal and Causal Reasoners for Video Question Answering 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 4300 - 4316
- [10] Open-Ended Long-form Video Question Answering via Adaptive Hierarchical Reinforced Networks PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3683 - 3689