Character-Aware Convolutional Neural Networks for Paraphrase Identification

被引:1
作者
Huang, Jiangping [1 ]
Ji, Donghong [1 ]
Yao, Shuxin [2 ]
Huang, Wenzhi [1 ]
机构
[1] Wuhan Univ, Comp Sch, Wuhan 430072, Peoples R China
[2] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
来源
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II | 2016年 / 9948卷
关键词
CNN; Sentence model; Paraphrase identification; Twitter;
D O I
10.1007/978-3-319-46672-9_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Network (CNN) have been successfully used for many natural language processing applications. In this paper, we propose a novel CNN model for sentence-level paraphrase identification. We learn the sentence representations using character-aware convolutional neural network that relies on character-level input and gives sentence-level representation. Our model adopts both random and one-hot initialized methods for character representation and trained with two paraphrase identification corpora including news and social media sentences. A comparison between the results of our approach and the typical systems participating in challenge on the news sentence, suggest that our model obtains a comparative performance with these baselines. The experimental result with tweets corpus shows that the proposed model has a significant performance than baselines. The results also suggest that character inputs are effective for modeling sentences.
引用
收藏
页码:177 / 184
页数:8
相关论文
共 21 条
  • [1] [Anonymous], 2014, Advances in Neural Information Processing Systems
  • [2] [Anonymous], 2015, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
  • [3] [Anonymous], 2014, P C EMP METH NAT LAN, DOI [DOI 10.3115/V1/D14-1005, 10.3115/v1/D14-1005]
  • [4] [Anonymous], 2004, P 20 INT C COMP LING
  • [5] [Anonymous], AAAI
  • [6] Bicici E., 2015, SemEval-2015, page, P56
  • [7] Blacoe W., 2012, P 2012 JOINT C EMPIR, P546, DOI 10.5555/2390948.2391011
  • [8] Cheng J., 2015, PROC C EMPIRICAL MET, P1531, DOI DOI 10.18653/V1/D15-1177
  • [9] Collobert R, 2011, J MACH LEARN RES, V12, P2493
  • [10] Das Dipanjan, 2009, P JOINT C ANN M ASS, V1, P468