Chinese Short Text Classification Based on Interactive Attention Mechanism

被引:0
作者
Bian, Qinyu [1 ,2 ,3 ]
Rao, Yuan [1 ,2 ,3 ]
Wang, Leipeng [1 ,2 ,3 ]
Yang, Fan [1 ,2 ,3 ]
Dong, Shipeng [1 ,2 ,3 ]
机构
[1] Xi An Jiao Tong Univ, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
[2] Xi An Jiao Tong Univ, Software Sch, Lab Social Intelligence & Complex Data Proc, Xian 710049, Peoples R China
[3] Xi An Jiao Tong Univ, Shannxi Joint Key Lab Artifact Intelligence, Sublab, Xian 710049, Peoples R China
来源
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN SIGNAL PROCESSING AND ARTIFICIAL INTELLIGENCE, ASPAI' 2020 | 2020年
关键词
Short text classification; Semantic enhancement; Interactive attention mechanism;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosive growth of social media, millions of Chinese short texts are produced every day. However, these short texts is exactly short, so the machine can not extract features accurately from short texts for classification learning. In this paper, we propose an effective semantic enhancement model that not only combines character-level features, word-level features and sentence-level features as additional features for Chinese short text, but also removes noise by introducing Interactive Attention Mechanism to measure the semantic similarity between the i-th character and short text representation. Experiments on the open dataset Chinese news headlines classification task (190,000 news headlines/18 tags) show that our model achieves 84.53 %, 84.32 % and 84.36 % in macro-averaged precision, recall, F1, which boosts 1.26 % F1, 1.22 % recall and 1.3 % precision and achieves the state-of-the art performance.
引用
收藏
页码:119 / 123
页数:5
相关论文
共 11 条
  • [1] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
  • [2] Chen J., 2019, PREPRINT
  • [3] Chung J., 2014, P NIPS 2014 WORKSH D
  • [4] Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, DOI 10.48550/ARXIV.1810.04805]
  • [5] Dong X., 2018, Journal of Chinese Information Processing, V32, P73
  • [6] Jingwen Li, 2018, 2018 IEEE 4th International Conference on Computer and Communications (ICCC). Proceedings, P2344, DOI 10.1109/CompComm.2018.8780948
  • [7] Mikolov T, 2013, Arxiv, DOI [arXiv:1301.3781, DOI 10.48550/ARXIV.1301.3781]
  • [8] Overview of the NLPCC 2017 Shared Task: Chinese News Headline Categorization
    Qiu, Xipeng
    Gong, Jingjing
    Huang, Xuanjing
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 948 - 953
  • [9] Wang J, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2915
  • [10] A Semantic Representation Enhancement Method for Chinese News Headline Classification
    Yin, Zhongbo
    Tang, Jintao
    Ru, Chengsen
    Luo, Wei
    Luo, Zhunchen
    Ma, Xiaolei
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 318 - 328