Visual Dialog with Multi-turn Attentional Memory Network

被引:2
|
作者
Kong, Dejiang [1 ]
Wu, Fei [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I | 2018年 / 11164卷
基金
中国国家自然科学基金;
关键词
Visual dialog; Memory network; Multi-turn attention;
D O I
10.1007/978-3-030-00776-8_56
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual dialog is a task of answering a question given an input image, a historical dialog about the image and often requires to retrieve visual and textual facts about the question. This problem is different from visual question answering (VQA), which only relies on visual grounding estimated from an image and question pair, while visual dialog task requires interactions among a question, an input image and a historical dialog. Most methods rely on one-turn attention network to obtain facts w.r.t. a question. However, the information transition phenomenon which exists in these facts restricts these methods to retrieve all relevant information. In this paper, we propose a multi-turn attentional memory network for visual dialog. Firstly, we propose a attentional memory network that maintains image regions and historical dialog in two memory banks and attends the question to be answered to both the visual and textual banks to obtain multi-model facts. Further, considering the information transition phenomenon, we design a multi-turn attention architecture which attend to memory banks multiple turns to retrieve more facts in order to produce a better answer. We evaluate the proposed model in on VisDial v0.9 dataset and the experimental results prove the effectiveness of the proposed model.
引用
收藏
页码:611 / 621
页数:11
相关论文
共 50 条
  • [31] Dynamic memory network with spatial-temporal feature fusion for visual tracking
    Zhang, Hongchao
    Bao, Hua
    Lu, Yixiang
    Zhang, Dexiang
    Xun, Lina
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
  • [32] END-TO-END DYNAMIC QUERY MEMORY NETWORK FOR ENTITY-VALUE INDEPENDENT TASK-ORIENTED DIALOG
    Wu, Chien-Sheng
    Madotto, Andrea
    Winata, Genta Indra
    Fung, Pascale
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6154 - 6158
  • [33] Dense Attention Memory Network for Multi-modal emotion recognition
    Ma, Gailing
    Guo, Xiao
    2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 48 - 53
  • [34] Sarcasm Detection with Sentiment Semantics Enhanced Multi-level Memory Network
    Ren, Lu
    Xu, Bo
    Lin, Hongfei
    Liu, Xikai
    Yang, Liang
    NEUROCOMPUTING, 2020, 401 : 320 - 326
  • [35] Multi-Object Tracking and Segmentation with a Space-Time Memory Network
    Miah, Mehdi
    Bilodeau, Guillaume-Alexandre
    Saunier, Nicolas
    2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV, 2023, : 184 - 193
  • [36] Brain-inspired memory network for visual tracking with recurrent meta-learning updater
    Zhang, Huanlong
    Song, Peipei
    Fu, Weiqiang
    Wang, Xin
    Zhong, Bineng
    Wang, Yanfeng
    DIGITAL SIGNAL PROCESSING, 2025, 162
  • [37] Memory Network With Pixel-Level Spatio-Temporal Learning for Visual Object Tracking
    Zhou, Zechu
    Zhou, Xinyu
    Chen, Zhaoyu
    Guo, Pinxue
    Liu, Qian-Yu
    Zhang, Wenqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6897 - 6911
  • [38] Multi-Modal Memory Enhancement Attention Network for Image-Text Matching
    Ji, Zhong
    Lin, Zhigang
    Wang, Haoran
    He, Yuqing
    IEEE ACCESS, 2020, 8 : 38438 - 38447
  • [39] A Multi-Frequency Memory Network for Short-Term Electricity Load Forecasting
    Li, Yulin
    Gu, Yiming
    Guo, Tuo
    Kang, Jiachen
    Xu, Bingrong
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 161 - 165
  • [40] Memory network with hierarchical multi-head attention for aspect-based sentiment analysis
    Yuzhong Chen
    Tianhao Zhuang
    Kun Guo
    Applied Intelligence, 2021, 51 : 4287 - 4304