Context-Aware Interaction Network for Question Matching

被引:0
作者
Hu, Zhe [1 ]
Fu, Zuohui [2 ]
Yin, Yu [3 ]
de Melo, Gerard [4 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
[2] Rutgers State Univ, New Brunswick, NJ USA
[3] Northeastern Univ, Boston, MA 02115 USA
[4] Univ Potsdam, HPI, Potsdam, Germany
来源
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Impressive milestones have been achieved in text matching by adopting a cross-attention mechanism to capture pertinent semantic connections between two sentence representations. However, regular cross-attention focuses on word-level links between the two input sequences, neglecting the importance of contextual information. We propose a context-aware interaction network (COIN) to properly align two sequences and infer their semantic relationship. Specifically, each interaction block includes (1) a context-aware cross-attention mechanism to effectively integrate contextual information when aligning two sequences, and (2) a gate fusion layer to flexibly interpolate aligned representations. We apply multiple stacked interaction blocks to produce alignments at different levels and gradually refine the attention results. Experiments on two question matching datasets and detailed analyses demonstrate the effectiveness of our model.
引用
收藏
页码:3846 / 3853
页数:8
相关论文
共 25 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 2018, P 27 INT C COMPUTATI
  • [3] Chen Lu, 2020, P 58 ANN M ASS COMP, P6152
  • [4] Enhanced LSTM for Natural Language Inference
    Chen, Qian
    Zhu, Xiaodan
    Ling, Zhenhua
    Wei, Si
    Jiang, Hui
    Inkpen, Diana
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1657 - 1668
  • [5] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [6] Gao Y, 2016, PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), P1442, DOI 10.1109/WCICA.2016.7578677
  • [7] Glorot X., 2011, JMLR WORKSHOP C P, P315
  • [8] Gong Y., 2018, 6 INT C LEARNING REP
  • [9] Hu Z., 2020, P 6 WORKSH NOIS US G, P34, DOI DOI 10.18653/V1/2020.WNUT-1.6
  • [10] Liu MT, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P2652