Zero-sample text classification algorithm based on BERT and graph convolutional neural network

被引:0
作者
Qiao Y. [1 ]
Li Y. [1 ]
Zhou L. [2 ]
Shang X. [2 ]
机构
[1] School of Computer Science and Software Engineering, Southwest Petroleum University, Sichuan, Chengdu
[2] PetroChina Changqing Oilfield Company Oil Production Plant NO.7, Shaanxi, Xi'an
关键词
Attention mechanism; Baseline model; BERT model; Graph convolutional neural network; Text classification;
D O I
10.2478/amns-2024-1560
中图分类号
学科分类号
摘要
In this study, we undertake a comprehensive examination of zero-shot text classification and its associated implications. We propose the adoption of the BERT model as a method for text feature representation. Subsequently, we utilize the Pointwise Mutual Information (PMI) metric to adjust the weight values within a graph convolutional neural network, thereby facilitating the construction of a text graph. Additionally, we incorporate an attention mechanism to transform this text graph, enabling it to represent the output labels of zero-shot text classification effectively. The experimental environment is set up, and the comparison and ablation experiments of the text classification model based on BERT and graph convolutional neural network with the baseline models are carried out in several different types of datasets, and the parameter settings of λ are adjusted according to the experimental results, and the convergence of the BERT model is compared to test the robustness of the model performance and the classification effect. When λ was set to 0.60, the model achieved the best results in each dataset. When the task is set to 5-way-5-shot, the convergence rate of the model for the Snippets dataset using the penultimate layer of features can reach 74%-80% of the training accuracy at the 5,000th step. The training accuracy gradually flattens out in the first 10,000 steps, and the model achieves classification accuracy in all four learning scenarios, with good stability. © 2024 Ying Qiao et al., published by Sciendo.
引用
收藏
相关论文
共 50 条
[31]   Text Classification with Attention Gated Graph Neural Network [J].
Deng, Zhaoyang ;
Sun, Chenxiang ;
Zhong, Guoqiang ;
Mao, Yuxu .
COGNITIVE COMPUTATION, 2022, 14 (04) :1464-1473
[32]   Convolutional Neural Network Algorithm-Based Novel Automatic Text Classification Framework for Construction Accident Reports [J].
Luo, Xixi ;
Li, Xinchun ;
Song, Xuefeng ;
Liu, Quanlong .
JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2023, 149 (12)
[33]   Text Classification Based on Word2vec and Convolutional Neural Network [J].
Li, Lin ;
Xiao, Linlong ;
Jin, Wenzhen ;
Zhu, Hong ;
Yang, Guocai .
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 :450-460
[34]   A Short Text Classification Method Based on Convolutional Neural Network and Semantic Extension [J].
Wang, Haitao ;
Tian, Keke ;
Wu, Zhengjiang ;
Wang, Lei .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) :367-375
[35]   Graph Convolutional Network Based on Multi-Head Pooling for Short Text Classification [J].
Zhao, Hongyu ;
Xie, Jiazhi ;
Wang, Hongbin .
IEEE ACCESS, 2022, 10 :11947-11956
[36]   VGCN-BERT: Augmenting BERT with Graph Embedding for Text Classification [J].
Lu, Zhibin ;
Du, Pan ;
Nie, Jian-Yun .
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2020, PT I, 2020, 12035 :369-382
[37]   Knowledge-enhanced graph convolutional neural networks for text classification [J].
Wang T. ;
Zhu X.-F. ;
Tang G. .
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (02) :322-328
[38]   Variable-Depth Convolutional Neural Network for Text Classification [J].
Ch, Ka-Hou ;
Im, Sio-Kei ;
Ke, Wei .
NEURAL INFORMATION PROCESSING, ICONIP 2020, PT V, 2021, 1333 :685-692
[39]   Convolutional Neural Network with Contextualized Word Embedding for Text Classification [J].
Fan, Gaoyang ;
Zhu, Cui ;
Zhu, Wenjun .
2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
[40]   A Neural Network Based Text Classification with Attention Mechanism [J].
Lu SiChen .
PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, :333-338