Hierarchical multi-attention networks for document classification

被引:1
作者
Yingren Huang
Jiaojiao Chen
Shaomin Zheng
Yun Xue
Xiaohui Hu
机构
[1] Guangdong University of Foreign Studies,Laboratory of Language Engineering and Computing
[2] South China Normal University,Guangdong Provincial Key Laboratory of Quantum Engineering and Quantum Materials, School of Physics and Telecommunication Engineering
来源
International Journal of Machine Learning and Cybernetics | 2021年 / 12卷
关键词
Document classification; Hierarchical network; Bi-GRU; Attention mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
Research of document classification is ongoing to employ the attention based-deep learning algorithms and achieves impressive results. Owing to the complexity of the document, classical models, as well as single attention mechanism, fail to meet the demand of high-accuracy classification. This paper proposes a method that classifies the document via the hierarchical multi-attention networks, which describes the document from the word-sentence level and the sentence-document level. Further, different attention strategies are performed on different levels, which enables accurate assigning of the attention weight. Specifically, the soft attention mechanism is applied to the word-sentence level while the CNN-attention to the sentence-document level. Due to the distinctiveness of the model, the proposed method delivers the highest accuracy compared to other state-of-the-art methods. In addition, the attention weight visualization outcomes present the effectiveness of attention mechanism in distinguishing the importance.
引用
收藏
页码:1639 / 1647
页数:8
相关论文
共 26 条
[1]  
Deerwester S(1990)Indexing by latent semantic analysis J Am Soc Inf Sci 41 391-407
[2]  
Dumais ST(2008)Retrieval TiI Opin Min Sentiment Anal 2 1-135
[3]  
Furnas GW(2018)Convolutional recurrent deep learning model for sentence classification IEEE Access 6 13949-13957
[4]  
Landauer TK(2010)Dobnikar AJIToS, Man, cybernetics PC. Distributed text classification with an ensemble kernel-based learning approach IEEE Trans Syst Man Cybern 40 287-297
[5]  
Harshman R(2019)Sentiment analysis using embedding from language model and multi-scale convolutional neural network Comput Appl 40 651-657
[6]  
Pang B(1993)Approximation of dynamical systems by continuous time recurrent neural networks Neural Netw 6 801-806
[7]  
Lee LJF(2018)Functional and contextual attention-based LSTM for service recommendation in Mashup creation IEEE Trans Parallel Distrib Syst 30 1077-1090
[8]  
Hassan A(2018)An intrusion detection system using a deep neural network with gated recurrent units IEEE Access 6 48697-48707
[9]  
Mahmood A(2019)Convolution-based neural attention with applications to sentiment classification IEEE Access 7 27983-27992
[10]  
Silva C(undefined)undefined undefined undefined undefined-undefined