Hierarchical multi-attention networks for document classification

被引:28
作者
Huang, Yingren [1 ,2 ]
Chen, Jiaojiao [2 ]
Zheng, Shaomin [2 ]
Xue, Yun [2 ]
Hu, Xiaohui [2 ]
机构
[1] Guangdong Univ Foreign Studies, Lab Language Engn & Comp, Guangzhou, Guangdong, Peoples R China
[2] South China Normal Univ, Guangdong Prov Key Lab Quantum Engn & Quantum Mat, Sch Phys & Telecommun Engn, Guangzhou 510006, Peoples R China
关键词
Document classification; Hierarchical network; Bi-GRU; Attention mechanism; SYSTEM;
D O I
10.1007/s13042-020-01260-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research of document classification is ongoing to employ the attention based-deep learning algorithms and achieves impressive results. Owing to the complexity of the document, classical models, as well as single attention mechanism, fail to meet the demand of high-accuracy classification. This paper proposes a method that classifies the document via the hierarchical multi-attention networks, which describes the document from the word-sentence level and the sentence-document level. Further, different attention strategies are performed on different levels, which enables accurate assigning of the attention weight. Specifically, the soft attention mechanism is applied to the word-sentence level while the CNN-attention to the sentence-document level. Due to the distinctiveness of the model, the proposed method delivers the highest accuracy compared to other state-of-the-art methods. In addition, the attention weight visualization outcomes present the effectiveness of attention mechanism in distinguishing the importance.
引用
收藏
页码:1639 / 1647
页数:9
相关论文
共 29 条
  • [1] [Anonymous], 2016, WAY OUT ODYSSEY ANAL
  • [2] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [3] Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
  • [4] Chambers A., 2013, THESIS IRVINE U CALI
  • [5] Cho, 2016, LEARNING PHRASE REPR
  • [6] Cho K., 2014, ARXIV, DOI [10.3115/v1/w14-4012, DOI 10.3115/V1/W14-4012]
  • [7] Core DB, 2012, APPL TEXT CLASSIFICA
  • [8] DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
  • [9] 2-9
  • [10] Convolution-Based Neural Attention With Applications to Sentiment Classification
    Du, Jiachen
    Gui, Lin
    He, Yulan
    Xu, Ruifeng
    Wang, Xuan
    [J]. IEEE ACCESS, 2019, 7 : 27983 - 27992