Siamese Network cooperating with Multi-head Attention for semantic sentence matching

被引:4
作者
Yuan, Zhao [1 ]
Jun, Sun [1 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Wuxi, Jiangsu, Peoples R China
来源
2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020) | 2020年
基金
国家重点研发计划;
关键词
Siamese network; Multi-Head Attention; Semantic Matching;
D O I
10.1109/DCABES50732.2020.00068
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
To compare a pair of sentences is a fundamental technology in many NLP tasks. According to the difference between the pair of sentence, we divide semantic sentence matching into two situations: Situation A is that the pair of sentences are worded with a context relationship, Situation B is that two are equal in semantics. Models for Situation A works in Situation B too, so prior deep work mostly model each sentence's representation considering the interaction of the other sentence simultaneously. However, models designed for Situation A bring redundant information for Situation B. In this paper, for sentence pairs with equivalence, we present a deep architecture with comparison-interaction separated to match two sentences, which based on Siamese network for comparison and multi-head attention for interaction information between sentence pairs. Experimental results on the latest Chinese sentence matching datasets outline the effectiveness of our approach.
引用
收藏
页码:235 / 238
页数:4
相关论文
共 17 条
  • [1] [Anonymous], 2016, COLING
  • [2] Ant Financial, ANT FIN ART COMP
  • [3] Ba J. L., 2016, P ADV NEUR INF PROC
  • [4] Enhanced LSTM for Natural Language Inference
    Chen, Qian
    Zhu, Xiaodan
    Ling, Zhenhua
    Wei, Si
    Jiang, Hui
    Inkpen, Diana
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1657 - 1668
  • [5] Dozat T., 2016, INT C LEARN REPR WOR, P1
  • [6] Glorot X., 2011, INT C ARTIF INTELLIG, P315
  • [7] Ioffe S, 2015, PR MACH LEARN RES, V37, P448
  • [8] Junyi S., JIEBA
  • [9] Meng YX, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P3242
  • [10] Mikolov T, 2013, Efficient estimation of word representations in vector space