共 50 条
- [31] Remember and forget: video and text fusion for video question answering Multimedia Tools and Applications, 2018, 77 : 29269 - 29282
- [32] Learning Question-Guided Video Representation for Multi-Turn Video Question Answering 20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 215 - 225
- [33] Equivariant and Invariant Grounding for Video Question Answering PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4714 - 4722
- [34] TVQA: Localized, Compositional Video Question Answering 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1369 - 1379
- [35] HIERARCHICAL RELATIONAL ATTENTION FOR VIDEO QUESTION ANSWERING 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 599 - 603
- [36] Research Progress of Video Question Answering Technologies Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (03): : 639 - 673
- [37] VQuAD: Video Question Answering Diagnostic Dataset 2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 282 - 291
- [39] CSA-BERT: Video Question Answering 2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 532 - 536
- [40] Uncovering the Temporal Context for Video Question Answering International Journal of Computer Vision, 2017, 124 : 409 - 421