TextConvoNet: a convolutional neural network based architecture for text classification

被引:60
作者
Soni, Sanskar [1 ]
Chouhan, Satyendra Singh [1 ]
Rathore, Santosh Singh [2 ]
机构
[1] MNIT Jaipur, Dept Comp Sci & Engn, Jaipur 302017, Rajasthan, India
[2] ABV IIITM Gwalior, Dept Comp Sci & Engn, Gwalior 474015, India
关键词
Text classification; Convolution neural network (CNN); Multi-dimensional convolution; Deep learning;
D O I
10.1007/s10489-022-04221-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents, TextConvoNet, a novel Convolutional Neural Network (CNN) based architecture for binary and multi-class text classification problems. Most of the existing CNN-based models use one-dimensional convolving filters, where each filter specializes in extracting n-grams features of a particular input word embeddings (Sentence Matrix). These features can be termed as intra-sentence n-gram features. To the best of our knowledge, all the existing CNN models for text classification are based on the aforementioned concept. The presented TextConvoNet not only extracts the intra-sentence n-gram features but also captures the inter-sentence n-gram features in input text data. It uses an alternative approach for input matrix representation and applies a two-dimensional multi-scale convolutional operation on the input. We perform an experimental study on five binary and multi-class classification datasets and evaluate the performance of the TextConvoNet for text classification. The results are evaluated using eight performance measures, accuracy, precision, recall, f1-score, specificity, gmean1, gmean2, and Mathews correlation coefficient (MCC). Furthermore, we extensively compared presented TextConvoNet with machine learning, deep learning, and attention-based models. The experimental results evidenced that the presented TextConvoNet outperformed and yielded better performance than the other used models for text classification purposes.
引用
收藏
页码:14249 / 14268
页数:20
相关论文
共 58 条
[1]  
Adhikari A, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4046
[2]   Exploring deep learning approaches for Urdu text classification in product manufacturing [J].
Akhter, Muhammad Pervez ;
Jiangbin, Zheng ;
Naqvi, Irfan Raza ;
Abdelmajeed, Mohammed ;
Fayyaz, Muhammad .
ENTERPRISE INFORMATION SYSTEMS, 2022, 16 (02) :223-248
[3]   Document-Level Text Classification Using Single-Layer Multisize Filters Convolutional Neural Network [J].
Akhter, Muhammad Pervez ;
Jiangbin, Zheng ;
Naqvi, Irfan Raza ;
Abdelmajeed, Mohammed ;
Mehmood, Atif ;
Sadiq, Muhammad Tariq .
IEEE ACCESS, 2020, 8 :42689-42707
[4]   Arabic Text Classification Using Convolutional Neural Network and Genetic Algorithms [J].
Alsaleh, Deem ;
Larabi-Marie-Sainte, Souad .
IEEE ACCESS, 2021, 9 (09) :91670-91685
[5]  
Bozarth Lia, 2020, P INT AAAI C WEB SOC, V14, P60, DOI 10.1609/icwsm.v14i1.7279
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Chen T., 2015, R PACKAGE VERSION 04, V1, P1
[8]   BerConvoNet: A deep learning framework for fake news classification [J].
Choudhary, Monika ;
Chouhan, Satyendra Singh ;
Pilli, S. Emmanuel ;
Vipparthi, Santosh Kumar .
APPLIED SOFT COMPUTING, 2021, 110
[9]  
Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107
[10]   Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classi fi cation [J].
Deng, Jianfeng ;
Cheng, Lianglun ;
Wang, Zhuowei .
COMPUTER SPEECH AND LANGUAGE, 2021, 68