Learning to Answer Questions from Image Using Convolutional Neural Network

被引:0
|
作者
Ma, Lin [1 ]
Lu, Zhengdong [1 ]
Li, Hang [1 ]
机构
[1] Huawei Technol, Noahs Ark Lab, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose to employ the convolutional neural network (CNN) for the image question answering (QA) task. Our proposed CNN provides an end-to-end framework with convolutional architectures for learning not only the image and question representations, but also their inter-modal interactions to produce the answer. More specifically, our model consists of three CNNs: one image CNN to encode the image content, one sentence CNN to compose the words of the question, and one multimodal convolution layer to learn their joint representation for the classification in the space of candidate answer words. We demonstrate the efficacy of our proposed model on the DAQUAR and COCO-QA datasets, which are two benchmark datasets for image QA, with the performances significantly outperforming the state-of-the-art.
引用
收藏
页码:3567 / 3573
页数:7
相关论文
共 50 条
  • [31] Text image refocusing by using the convolutional neural network
    Wang, Kangkang
    Wang, Keyan
    Li, Yunsong
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2018, 45 (04): : 80 - 85
  • [32] GRAYSCALE IMAGE COLORIZATION USING A CONVOLUTIONAL NEURAL NETWORK
    Jwa, Minje
    Kang, Myungjoo
    JOURNAL OF THE KOREAN SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS, 2021, 25 (02) : 26 - 38
  • [33] Endoscopic Image Colorization Using Convolutional Neural Network
    Jiang, HuiPeng
    Tang, SongYuan
    Li, Yating
    Ai, Danni
    Song, Hong
    Yang, Jian
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (ICBCB 2019), 2019, : 162 - 166
  • [34] Pathology Image Classification Using Convolutional Neural Network
    Li, Qunxian
    2015 2ND INTERNATIONAL CONFERENCE ON EDUCATION AND EDUCATION RESEARCH (EER 2015), PT 5, 2015, 9 : 331 - 335
  • [35] Advertisement Image Classification Using Convolutional Neural Network
    An Tien Vo
    Hai Son Tran
    Thai Hoang Le
    2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2017), 2017, : 197 - 202
  • [36] Image Distortion Detection using Convolutional Neural Network
    Ahn, Namhyuk
    Kang, Byungkon
    Sohn, Kyung-Ah
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 220 - 225
  • [37] Image Classification using small Convolutional Neural Network
    Tripathi, Shyava
    Kumar, Rishi
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 483 - 487
  • [38] SAR Image Despeckling Using a Convolutional Neural Network
    Wang, Puyang
    Zhang, He
    Patel, Vishal M.
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (12) : 1763 - 1767
  • [39] Image Captioning using Convolutional Neural Networks and Recurrent Neural Network
    Calvin, Rachel
    Suresh, Shravya
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [40] An Enhanced Convolutional Neural Network Model for Answer Selection
    Guo, Jiahui
    Yue, Bin
    Xu, Guandong
    Yang, Zhenglu
    Wei, Jin-Mao
    WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 789 - 790