Text Classification Based on Label Data Augmentation and Graph Neural Network

被引:0
作者
Sun, Guoying [1 ]
Cheng, Yanan [1 ]
Kong, Ke [1 ]
Zhang, Zhaoxin [1 ]
Zhao, Dong [1 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin 150006, Peoples R China
关键词
Vectors; Feature extraction; Semantics; Context modeling; Data augmentation; Attention mechanisms; Adaptation models; Long short term memory; Informatics; Data models; Data sparsity problem; label data augmentation (LDA); text classification; uneven text length problem;
D O I
10.1109/TII.2025.3537607
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although graph neural networks based methods can solve the uneven text length problem of text classification datasets, they are difficult to address the data sparsity problem of short texts. Although some researchers try to reduce the sparsity of the graph by adding labels to its structure, most of them only treat labels as node features other than words and documents, which is not sufficient to construct denser matrices. To address the above problems, three label data augmentation strategies are proposed to build a dense graph, and the attention mechanisms are used to update node features. In addition, a node feature updating method that simultaneously uses global and local weights is proposed. Multiple comparative experiments on five benchmark datasets demonstrate that the method proposed in this article is optimal and the accuracy and micro-F1 have improved by at least 0.012 on four benchmark datasets.
引用
收藏
页码:3966 / 3975
页数:10
相关论文
共 27 条
[1]   Graph Fusion Network for Text Classification [J].
Dai, Yong ;
Shou, Linjun ;
Gong, Ming ;
Xia, Xiaolin ;
Kang, Zhao ;
Xu, Zenglin ;
Jiang, Daxin .
KNOWLEDGE-BASED SYSTEMS, 2022, 236
[2]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
[3]  
Harel-Canada F, 2022, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), P1771
[4]  
Huang Y.-H., 2022, P INT C COMP LING, P1163
[5]   Electrical Fault Diagnosis From Text Data: A Supervised Sentence Embedding Combined With Imbalanced Classification [J].
Jing, Xiao ;
Wu, Zhiang ;
Zhang, Lu ;
Li, Zhe ;
Mu, Dejun .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (03) :3064-3073
[6]  
Kim Yoon, 2014, P 2014 C EMPIRICAL M
[7]  
Kipf T. N., 2017, ARXIV
[8]  
Lan Z., 2019, ARXIV
[9]  
Lee K., 2018, P 2019 ANN C N AM CH
[10]   Semi-Supervised Mixture Learning for Graph Neural Networks With Neighbor Dependence [J].
Liu, Kai ;
Liu, Hongbo ;
Wang, Tao ;
Hu, Guoqiang ;
Ward, Tomas E. E. ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) :12528-12539