Learning to Answer Questions from Image Using Convolutional Neural Network

被引:0
|
作者
Ma, Lin [1 ]
Lu, Zhengdong [1 ]
Li, Hang [1 ]
机构
[1] Huawei Technol, Noahs Ark Lab, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose to employ the convolutional neural network (CNN) for the image question answering (QA) task. Our proposed CNN provides an end-to-end framework with convolutional architectures for learning not only the image and question representations, but also their inter-modal interactions to produce the answer. More specifically, our model consists of three CNNs: one image CNN to encode the image content, one sentence CNN to compose the words of the question, and one multimodal convolution layer to learn their joint representation for the classification in the space of candidate answer words. We demonstrate the efficacy of our proposed model on the DAQUAR and COCO-QA datasets, which are two benchmark datasets for image QA, with the performances significantly outperforming the state-of-the-art.
引用
收藏
页码:3567 / 3573
页数:7
相关论文
共 50 条
  • [41] Experiments with Convolutional Neural Network Models for Answer Selection
    Rao, Jinfeng
    He, Hua
    Lin, Jimmy
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1217 - 1220
  • [42] Image-Based Learning to Measure Traffic Density Using a Deep Convolutional Neural Network
    Chung, Jiyong
    Sohn, Keemin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (05) : 1670 - 1675
  • [43] Medical image enhancement algorithms using deep learning-based convolutional neural network
    C. Ghandour
    Walid El-Shafai
    S. El-Rabaie
    Journal of Optics, 2023, 52 : 1931 - 1941
  • [44] Medical image enhancement algorithms using deep learning-based convolutional neural network
    Ghandour, C.
    El-Shafai, Walid
    El-Rabaie, S.
    JOURNAL OF OPTICS-INDIA, 2023, 52 (04): : 1931 - 1941
  • [45] Reverse Image Search Using Deep Unsupervised Generative Learning and Deep Convolutional Neural Network
    Kiran, Aqsa
    Qureshi, Shahzad Ahmad
    Khan, Asifullah
    Mahmood, Sajid
    Idrees, Muhammad
    Saeed, Aqsa
    Assam, Muhammad
    Refaai, Mohamad Reda A.
    Mohamed, Abdullah
    APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [46] Image retrieval method based on metric learning for convolutional neural network
    Wang, Jieyuan
    Qian, Ying
    Ye, Qingqing
    Wang, Biao
    2017 2ND INTERNATIONAL SEMINAR ON ADVANCES IN MATERIALS SCIENCE AND ENGINEERING, 2017, 231
  • [47] Convolutional neural network based on an extreme learning machine for image classification
    Park, Youngmin
    Yang, Hyun S.
    NEUROCOMPUTING, 2019, 339 : 66 - 76
  • [48] Sign Language Learning System with Image Sampling and Convolutional Neural Network
    Ji, Yangho
    Kim, Sunmok
    Lee, Ki-Baek
    2017 FIRST IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC), 2017, : 371 - 375
  • [49] Weather Image Recognition Based on Convolutional Neural Network and Transfer Learning
    Gao, Zunhai
    Qiu, Yuzhan
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 631 - 638
  • [50] A Convolutional Neural Network Image Classification Based on Extreme Learning Machine
    Wang, Shasha
    Liu, Daohua
    Yang, Zhipeng
    Feng, Chen
    Yao, Ruiling
    IAENG International Journal of Computer Science, 2021, 48 (03): : 1 - 5