Knowledge enhancement and scene understanding for knowledge-based visual question answering

被引:0
作者
Zhenqiang Su
Gang Gou
机构
[1] Guizhou University,State Key Laboratory of Public Big Data
[2] Guizhou University,College of Computer Science and Technology
来源
Knowledge and Information Systems | 2024年 / 66卷
关键词
Visual question answering; Feature fusion; Knowledge enhancement; Knowledge discovery; Scene understanding;
D O I
暂无
中图分类号
学科分类号
摘要
Knowledge-based visual question answering calls for not only paying attention to the visual content of images but also the support of relevant outside knowledge for improved question and answer thinking. The semantics of the questions should not be overlooked since knowledge retrieval relies on more than just visual information. This paper first proposed a question-based semantic retrieval strategy to compensate for the absence of image retrieval knowledge in order to better combine visual and knowledge information. Secondly, image caption is added to help the model better achieve scene understanding. Finally, modal knowledge is represented and accumulated through the triplets. Experimental results on the OK-VQA dataset show that the proposed method achieves an improvement of 4.24% and 1.90% over the two baseline methods, respectively, which proves the effectiveness of this method.
引用
收藏
页码:2193 / 2208
页数:15
相关论文
共 14 条
[1]  
Ren M(2015)Image question answering: A visual semantic embedding model and a new dataset Proc Advances in Neural Inf Process Syst 1 5-1149
[2]  
Kiros R(2016)Faster r-cnn: Towards real-time object detection with region proposal networks IEEE Trans Pattern Anal Mach Intell 39 1137-85
[3]  
Zemel R(2014)Wikidata: a free collaborative knowledgebase Commun ACM 57 78-2427
[4]  
Ren S(2017)Fvqa: Fact-based visual question answering IEEE Trans Pattern Anal Mach Intell 40 2413-undefined
[5]  
He K(2020)Cross-modal knowledge reasoning for knowledge-based visual question answering Pattern Recogn 108 undefined-undefined
[6]  
Girshick R(undefined)undefined undefined undefined undefined-undefined
[7]  
Vrandečić D(undefined)undefined undefined undefined undefined-undefined
[8]  
Krötzsch M(undefined)undefined undefined undefined undefined-undefined
[9]  
Wang P(undefined)undefined undefined undefined undefined-undefined
[10]  
Wu Q(undefined)undefined undefined undefined undefined-undefined