Text Classification of Mixed Model Based on Deep Learning

被引:1
作者
Lee, Sang-Hwa [1 ]
机构
[1] Seowon Univ, Dept Webtoon Contents, 377-3 Musimseo Ro, Cheongju 28674, Chungcheongbuk, South Korea
来源
TEHNICKI GLASNIK-TECHNICAL JOURNAL | 2023年 / 17卷 / 03期
关键词
classification; deep confidence network; deep learning; sparse automatic encoder; softmax; FEATURE-SELECTION;
D O I
10.31803/tg-20221228180808
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
At present, deep learning has been widely used many fields, but the research on text classification is still relatively few. This paper makes full use of the good learning characteristics of deep learning, proposes a hybrid model based on deep learning, and designs a text classifier based on the hybrid model. This hybrid model uses two common deep learning models, sparse automatic encoder and deep confidence network, to mix. The hybrid model is mainly composed of three parts, the first two layers are constructed by sparse automatic encoder, the middle layer is a three-layer depth Convolutional Neural Network (CNN), and finally Softmax regression is used as the classification layer. In order to test the classification performance of the classifier based on deep learning hybrid model, relevant experiments were conducted on English data set 20Newsgroup and Chinese data set Fudan University Chinese Corpus. In the English text classification experiment, the classifier based on deep learning hybrid model is used to classify, and a high classification accuracy rate is obtained. In order to further verify the superiority of its performance, a comparative experiment with naive Bayes classifier, K-Nearest Neighbor (KNN) classifier and Support Vector Machine (SVM) classifier demonstrates that the classification effect of the classifier based on deep learning hybrid model is better than that of naive Bayes classifier, KNN classifier and support vector machine classifier. In the experiment of Chinese text classification, the Chinese corpus of Fudan University is tested, and a good classification effect is obtained. The influence of different parameter settings on the classification accuracy is discussed.
引用
收藏
页码:367 / 374
页数:8
相关论文
共 27 条
[1]  
Barak F., 2021, INT J HYBRID INNOVAT, V1, P63, DOI [10.21742/ijhit.2653-309X.2021.1.2.04, DOI 10.21742/IJHIT.2653-309X.2021.1.2.04]
[2]  
Chayangkoon N., 2019, INT J CONTROL AUTOMA, V12, P1, DOI [10.33832/ijca.2019.12.2.01, DOI 10.33832/IJCA.2019.12.2.01]
[3]   Adversarial Multi-Criteria Learning for Chinese Word Segmentation [J].
Chen, Xinchi ;
Shi, Zhan ;
Qiu, Xipeng ;
Huang, Xuanjing .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1193-1203
[4]   Performance Evaluation of Filter-based Feature Selection Techniques in Classifying Portable Executable Files [J].
Darshan, S. L. Shiva ;
Jaidhar, C. D. .
6TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS, 2018, 125 :346-356
[5]   The classification of construction waste material using a deep convolutional neural network [J].
Davis, Peter ;
Aziz, Fayeem ;
Newaz, Mohammad Tanvi ;
Sher, Willy ;
Simon, Laura .
AUTOMATION IN CONSTRUCTION, 2021, 122
[6]   基于互信息改进算法的新词发现对中文分词系统改进 [J].
杜丽萍 ;
李晓戈 ;
于根 ;
刘春丽 ;
刘睿 .
北京大学学报(自然科学版), 2016, 52 (01) :35-40
[7]   Differential evolution for feature selection: a fuzzy wrapper-filter approach [J].
Hancer, Emrah .
SOFT COMPUTING, 2019, 23 (13) :5233-5248
[8]  
[胡婕 Hu Jie], 2017, [小型微型计算机系统, Journal of Chinese Computer Systems], V38, P522
[9]  
HYON KIM DAE, 2020, [Asia-pacific Journal of Convergent Research Interchange, 아시아태평양융합연구교류논문지], V6, P13, DOI 10.21742/apjcri.2020.04.02
[10]  
Jang Sung-Bong, 2021, [Asia-pacific Journal of Convergent Research Interchange, 아시아태평양융합연구교류논문지], V7, P1, DOI 10.47116/apjcri.2021.12.01