A Fusion Model-Based Label Embedding and Self-Interaction Attention for Text Classification

被引:29
作者
Dong, Yanru [1 ]
Liu, Peiyu [1 ]
Zhu, Zhenfang [2 ]
Wang, Qicai [1 ]
Zhang, Qiuyue [1 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Peoples R China
[2] Shandong Jiaotong Univ, Sch Informat Sci & Elect Engn, Jinan 250357, Peoples R China
基金
国家教育部科学基金资助;
关键词
Text categorization; Semantics; Feature extraction; Natural language processing; Bit error rate; Task analysis; Neural networks; Text classification; text representations; label embedding;
D O I
10.1109/ACCESS.2019.2954985
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification is a pivotal task in NLP (Natural Language Processing), which has received widespread attention recently. Most of the existing methods leverage the power of deep learning to improve the performance of models. However, these models ignore the interaction information between all the sentences in a text when generating the current text representation, which results in a partial semantics loss. Labels play a central role in text classification. And the attention learned from text-label in the joint space of labels and words is not leveraged, leaving enough room for further improvement. In this paper, we propose a text classification method based on Self-Interaction attention mechanism and label embedding. Firstly, our method introduce BERT (Bidirectional Encoder Representation from Transformers) to extract text features. Then Self-Interaction attention mechanism is employed to obtain text representations containing more comprehensive semantics. Moreover, we focus on the embedding of labels and words in the joint space to achieve the dual-label embedding, which further leverages the attention learned from text-label. Finally, the texts are classified by the classifier according to the weighted labels representations. The experimental results show that our method outperforms other state-of-the-art methods in terms of classification accuracy.
引用
收藏
页码:30548 / 30559
页数:12
相关论文
共 49 条
  • [11] E. Peters M, 2018, LONG PAPERS, DOI [10.18653/v1/N18-1202, DOI 10.18653/V1/N18-1202]
  • [12] Gehring J, 2017, PR MACH LEARN RES, V70
  • [13] Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    Graves, A
    Schmidhuber, J
    [J]. NEURAL NETWORKS, 2005, 18 (5-6) : 602 - 610
  • [14] Multi Label Classification Methods for Green Computing and Application for Mobile Medical Recommendations
    Guo, Li
    Jin, Bo
    Yu, Ruiyun
    Yao, Cuili
    Sun, Chonglin
    Huang, Degen
    [J]. IEEE ACCESS, 2016, 4 : 3201 - 3209
  • [15] Convolutional Recurrent Deep Learning Model for Sentence Classification
    Hassan, Abdalraouf
    Mahmood, Ausif
    [J]. IEEE ACCESS, 2018, 6 : 13949 - 13957
  • [16] Ifrim G., 2008, Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, P354, DOI 10.1145/1401890.1401936
  • [17] Ikonomakis M., 2005, WSEAS Transactions on Computers, V4, P966
  • [18] Joachims T, 1998, P 10 EUR C MACH LEAR, P137, DOI DOI 10.1007/BFB0026683
  • [19] MIMIC-III, a freely accessible critical care database
    Johnson, Alistair E. W.
    Pollard, Tom J.
    Shen, Lu
    Lehman, Li-wei H.
    Feng, Mengling
    Ghassemi, Mohammad
    Moody, Benjamin
    Szolovits, Peter
    Celi, Leo Anthony
    Mark, Roger G.
    [J]. SCIENTIFIC DATA, 2016, 3
  • [20] Joulin Armand, 2017, P 15 C EUR CHAPT ASS, V2, P427, DOI DOI 10.18653/V1/E17-2068