Hierarchical multi-attention networks for document classification

被引:29
作者
Huang, Yingren [1 ,2 ]
Chen, Jiaojiao [2 ]
Zheng, Shaomin [2 ]
Xue, Yun [2 ]
Hu, Xiaohui [2 ]
机构
[1] Guangdong Univ Foreign Studies, Lab Language Engn & Comp, Guangzhou, Guangdong, Peoples R China
[2] South China Normal Univ, Guangdong Prov Key Lab Quantum Engn & Quantum Mat, Sch Phys & Telecommun Engn, Guangzhou 510006, Peoples R China
关键词
Document classification; Hierarchical network; Bi-GRU; Attention mechanism; SYSTEM;
D O I
10.1007/s13042-020-01260-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research of document classification is ongoing to employ the attention based-deep learning algorithms and achieves impressive results. Owing to the complexity of the document, classical models, as well as single attention mechanism, fail to meet the demand of high-accuracy classification. This paper proposes a method that classifies the document via the hierarchical multi-attention networks, which describes the document from the word-sentence level and the sentence-document level. Further, different attention strategies are performed on different levels, which enables accurate assigning of the attention weight. Specifically, the soft attention mechanism is applied to the word-sentence level while the CNN-attention to the sentence-document level. Due to the distinctiveness of the model, the proposed method delivers the highest accuracy compared to other state-of-the-art methods. In addition, the attention weight visualization outcomes present the effectiveness of attention mechanism in distinguishing the importance.
引用
收藏
页码:1639 / 1647
页数:9
相关论文
共 29 条
[1]  
[Anonymous], 2016, WAY OUT ODYSSEY ANAL
[2]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[3]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
[4]  
Chambers A., 2013, THESIS IRVINE U CALI
[5]  
Cho, 2016, LEARNING PHRASE REPR
[6]  
Cho K., 2014, P 8 WORKSH SYNT SEM, P103
[7]  
Core DB, 2012, APPL TEXT CLASSIFICA
[8]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[9]  
2-9
[10]   Convolution-Based Neural Attention With Applications to Sentiment Classification [J].
Du, Jiachen ;
Gui, Lin ;
He, Yulan ;
Xu, Ruifeng ;
Wang, Xuan .
IEEE ACCESS, 2019, 7 :27983-27992