Variational Memory Encoder-Decoder

被引:0
|
作者
Hung Le [1 ]
Truyen Tran [1 ]
Thin Nguyen [1 ]
Venkatesh, Svetha [1 ]
机构
[1] Deakin Univ, Appl AI Inst, Geelong, Vic, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Introducing variability while maintaining coherence is a core task in learning to generate utterances in conversation. Standard neural encoder-decoder models and their extensions using conditional variational autoencoder often result in either trivial or digressive responses. To overcome this, we explore a novel approach that injects variability into neural encoder-decoder via the use of external memory as a mixture model, namely Variational Memory Encoder-Decoder (VMED). By associating each memory read with a mode in the latent mixture distribution at each timestep, our model can capture the variability observed in sequential data such as natural conversations. We empirically compare the proposed model against other recent approaches on various conversational datasets. The results show that VMED consistently achieves significant improvement over others in both metric-based and qualitative evaluations.
引用
收藏
页数:11
相关论文
共 50 条
  • [11] Understanding Geometry of Encoder-Decoder CNNs
    Ye, Jong Chul
    Sung, Woon Kyoung
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [12] A variational encoder-decoder approach to precise spectroscopic age estimation for large Galactic surveys
    Leung, Henry W.
    Bovy, Jo
    Mackereth, J. Ted
    Miglio, Andrea
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, 522 (03) : 4577 - 4597
  • [13] Weakly-Supervised Video Summarization Using Variational Encoder-Decoder and Web Prior
    Cai, Sijia
    Zuo, Wangmeng
    Davis, Larry S.
    Zhang, Lei
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 193 - 210
  • [14] Feedforward Sequential Memory Networks based Encoder-Decoder Model for Machine Translation
    Hou, Junfeng
    Zhang, Shiliang
    Dai, Lirong
    Jiang, Hui
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 622 - 625
  • [15] Investigation on the Encoder-Decoder Application for Mesh Generation
    Mameli, Marco
    Balloni, Emanuele
    Mancini, Adriano
    Frontoni, Emanuele
    Zingaretti, Primo
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT II, 2024, 14496 : 387 - 400
  • [16] Development of Secure Encoder-Decoder for JPEG Images
    Hamissa, Ghada
    Abd Elkader, Hatem
    Sarhan, Amany
    Fahmy, Mahmoud
    ICCES'2010: THE 2010 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2010, : 189 - 194
  • [17] Encoder-decoder network with RMP for tongue segmentation
    Kusakunniran, Worapan
    Borwarnginn, Punyanuch
    Karnjanapreechakorn, Sarattha
    Thongkanchorn, Kittikhun
    Ritthipravat, Panrasee
    Tuakta, Pimchanok
    Benjapornlert, Paitoon
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (05) : 1193 - 1207
  • [18] Encoder-decoder multimodal speaker change detection
    Jung, Jee-weon
    Seo, Soonshin
    Heo, Hee-Soo
    Kim, Geonmin
    Kim, You Jin
    Kwon, Young-ki
    Lee, Minjae
    Lee, Bong-Jin
    INTERSPEECH 2023, 2023, : 5311 - 5315
  • [19] Understanding How Encoder-Decoder Architectures Attend
    Aitken, Kyle
    Ramasesh, Vinay V.
    Cao, Yuan
    Maheswaranathan, Niru
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [20] DOM Refinement with neural Encoder-Decoder Networks
    Metzger, Nando
    PFG-JOURNAL OF PHOTOGRAMMETRY REMOTE SENSING AND GEOINFORMATION SCIENCE, 2020, 88 (3-4): : 362 - 363