ATNet: Answering Cloze-Style Questions via Intra-attention and Inter-attention

被引:2
作者
Fu, Chengzhen [1 ]
Li, Yuntao [1 ]
Zhang, Yan [1 ]
机构
[1] Peking Univ, Dept Machine Intelligence, Beijing, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT II | 2019年 / 11440卷
关键词
Question answering; Intra-attention; Inter-attention;
D O I
10.1007/978-3-030-16145-3_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel framework, named ATNet, for answering doze-style questions over documents. Our model, in the encoder phase, projects all contextual embeddings into multiple latent semantic spaces, with representations of each space attending to a specific aspect of semantics. Long-term dependencies among the whole document are captured via the intra-attention module. A gate is produced to control the degree to which the retrieved dependency information is fused and the previous token embedding is exposed. Then, in the interaction phase, the context is aligned with the query across different semantic spaces to achieve the information aggregation. Specifically, we compute inter-attention based on a sophisticated feature set. Experiments and ablation studies demonstrate the effectiveness of ATNet.
引用
收藏
页码:242 / 252
页数:11
相关论文
共 24 条
  • [1] [Anonymous], 2016, EMNLP
  • [2] Bordes A., 2015, ABS150602075 CORR
  • [3] Chen DQ, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P2358
  • [4] Chen JZ, 2016, PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), P551, DOI [10.1109/CIS.2016.133, 10.1109/CIS.2016.0134]
  • [5] Chung J, 2014, Em-
  • [6] Attention-over-Attention Neural Networks for Reading Comprehension
    Cui, Yiming
    Chen, Zhipeng
    Wei, Si
    Wang, Shijin
    Liu, Ting
    Hu, Guoping
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 593 - 602
  • [7] Gated-Attention Readers for Text Comprehension
    Dhingra, Bhuwan
    Liu, Hanxiao
    Yang, Zhilin
    Cohen, William W.
    Salakhutdinov, Ruslan
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1832 - 1846
  • [8] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [9] Hermann KM, 2015, ADV NEUR IN, V28
  • [10] Hill Felix, 2015, ARXIV151102301