HSAN: A HIERARCHICAL SELF-ATTENTION NETWORK FOR MULTI-TURN DIALOGUE GENERATION

被引:9
作者
Kong, Yawei [1 ,2 ]
Zhang, Lu [1 ,2 ]
Ma, Can [2 ]
Cao, Cong [2 ]
机构
[1] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
Dialogue generation; Hierarchical network; Self attention;
D O I
10.1109/ICASSP39728.2021.9413753
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the multi-turn dialogue system, response generation is not only related to the sentences in context but also relies on the words in each utterance. Although there are lots of methods that pay attention to model words and utterances, there still exist problems such as tending to generate common responses. In this paper, we propose a hierarchical self-attention network, named HSAN, which attends to the important words and utterances in context simultaneously. Firstly, we use the hierarchical encoder to update the word and utterance representations with their position information respectively. Secondly, the response representations are updated by the mask self-attention module in the decoder. Finally, the relevance between utterances and response is computed by another self-attention module and used for the next response decoding process. In terms of automatic metrics and human judgements, experimental results show that HSAN significantly outperforms all baselines on two common public datasets.
引用
收藏
页码:7433 / 7437
页数:5
相关论文
共 16 条
  • [1] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [2] Danescu-Niculescu-Mizil C., 2011, P 2 WORKSH COGN LING, P76
  • [3] The measurement of textual coherence with latent semantic analysis
    Foltz, PW
    Kintsch, W
    Landauer, TK
    [J]. DISCOURSE PROCESSES, 1998, 25 (2-3) : 285 - 307
  • [4] Forgues Gabriel, 2014, NIPS MODERN MACHINE, V2
  • [5] Li J, 2016, NAACL HLT, P110, DOI 10.18653/v1/N16-1014
  • [6] Lowe R., 2015, SIGDIAL, P285, DOI [DOI 10.18653/V1/W15-4640, 10.18653/v1/W15-4640]
  • [7] Park, 2018, ARXIV180403424
  • [8] AliMe Chat: A Sequence to Sequence and Rerank based Chatbot Engine
    Qiu, Minghui
    Li, Feng-Lin
    Wang, Siyu
    Gao, Xing
    Chen, Yan
    Zhao, Weipeng
    Chen, Haiqing
    Huang, Jun
    Chu, Wei
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 498 - 503
  • [9] Rus Vasile, 2012, Proceedings of the Seventh Workshop on Building Educational Applications Using NLP, P157
  • [10] Serban IV, 2017, AAAI CONF ARTIF INTE, P3295