Text classification problems via BERT embedding method and graph convolutional neural network

被引:3
作者
Loc Tran [1 ]
Lam Pham [2 ,3 ]
Tuan Tran [1 ]
An Mai [3 ,4 ]
机构
[1] Paris Saclay Univ, EPHE, CHArt Lab, Paris, France
[2] Univ Informat Technol, HCMC, Ho Chi Minh City, Vietnam
[3] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[4] Int Univ, Dept Comp Sci & Engn, Ho Chi Minh City, Vietnam
来源
2021 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2021) | 2021年
关键词
text classification; graph convolutional neural network; BERT; classical machine learning model;
D O I
10.1109/ATC52653.2021.9598337
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a hybrid technique of combining the BERT embedding method and the graph convolutional neural network. This combination is then employed to solve the text classification problem. Initially, we apply the BERT embedding method to the whole corpus in order to transform all the texts into numerical vectors. Then, the graph convolutional neural network will be applied to these numerical vectors to classify these texts into their appropriate classes. Especially, in our approach, we need only a few labeled texts for the model training. For the illustration, in this paper, we use the BBC news and the IMDB movie reviews datasets to perform our experiments, showing that the performance of the graph convolutional neural network model is better than the performances of the combination of the BERT embedding method with other classical machine learning models.
引用
收藏
页码:260 / 264
页数:5
相关论文
共 19 条
[1]  
Akbik A, 2019, NAACL HLT 2019: THE 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, P54
[2]  
Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107
[3]  
Dang N. T. V., ARXIV PREPRINT ARXIV, P2021
[4]   Target-Dependent Sentiment Classification With BERT [J].
Gao, Zhengjie ;
Feng, Ao ;
Song, Xinyu ;
Wu, Xi .
IEEE ACCESS, 2019, 7 :154290-154299
[5]  
Guo ZJ, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P241
[6]  
Ikonomakis M., 2005, WSEAS Transactions on Computers, V4, P966
[7]  
Kipf T. N., 2016, ARXIV
[8]  
Larson S, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P1311
[9]  
Man Ngo, 2020, 2020 7th NAFOSTED Conference on Information and Computer Science (NICS), P114, DOI 10.1109/NICS51282.2020.9335914
[10]  
Pranckevicius T, 2017, BALT J MOD COMPUT, V5, P221, DOI 10.22364/bjmc.2017.5.2.05