Feature selection based on long short term memory for text classification

被引:1
作者
Hong, Ming [1 ]
Wang, Heyong [1 ]
机构
[1] South China Univ Technol, Dept Elect Business, Guangzhou, Peoples R China
关键词
Text classification; Feature selection; Deep learning; Long short term memory; BIDIRECTIONAL LSTM; OPTIMIZATION ALGORITHM; ATTENTION MECHANISM; NAIVE BAYES; INFORMATION; PERFORMANCE; NETWORK; PREDICTION; FREQUENCY; EFFICIENT;
D O I
10.1007/s11042-023-16990-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The selection of discriminative terms from large quantity of terms in text documents is helpful for achieving better accuracy of text classification. To focus on the task of selecting discriminative terms from text, a deep learning based feature selection method is proposed. The method is developed by using the long short term memory (LSTM) network. A deep network based on LSTM is trained in unsupervised manner to extracted deep features from bag-of-words term frequency vectors. The deep features are integrated with term frequencies to evaluate the effectiveness of terms. The proposed method extends the limitation of term frequency information by applying deep features for feature selection. Experiments in nine public datasets demonstrate better performance of our method in selecting discriminative terms than comparative methods.
引用
收藏
页码:44333 / 44378
页数:46
相关论文
共 121 条
[31]   UBIS: Unigram Bigram Importance Score for Feature Selection from Short Text [J].
Garg, Muskan .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
[32]   Hybrid feature selection based on enhanced genetic algorithm for text categorization [J].
Ghareb, Abdullah Saeed ;
Abu Bakar, Azuraliza ;
Hamdan, Abdul Razak .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 49 :31-47
[33]   DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction [J].
Guo, Yanbu ;
Li, Weihua ;
Wang, Bingyi ;
Liu, Huiqing ;
Zhou, Dongming .
BMC BIOINFORMATICS, 2019, 20 (1)
[34]   Supervised Hebb rule based feature selection for text classification [J].
Heyong, Wang ;
Ming, Hong .
INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (01) :167-191
[35]   A novel binary farmland fertility algorithm for feature selection in analysis of the text psychology [J].
Hosseinalipour, Ali ;
Gharehchopogh, Farhad Soleimanian ;
Masdari, Mohammad ;
Khademi, Ali .
APPLIED INTELLIGENCE, 2021, 51 (07) :4824-4859
[36]   A phenology-based spectral and temporal feature selection method for crop mapping from satellite time series [J].
Hu, Qiong ;
Sulla-Menashe, Damien ;
Xu, Baodong ;
Yin, He ;
Tang, Huajun ;
Yang, Peng ;
Wu, Wenbin .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2019, 80 :218-229
[37]   Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism [J].
Jang, Beakcheol ;
Kim, Myeonghwi ;
Harerimana, Gaspard ;
Kang, Sang-ug ;
Kim, Jong Wook .
APPLIED SCIENCES-BASEL, 2020, 10 (17)
[38]   A two-stage Markov blanket based feature selection algorithm for text classification [J].
Javed, Kashif ;
Maruf, Sameen ;
Babri, Haroon A. .
NEUROCOMPUTING, 2015, 157 :91-104
[39]   Enhancements of Attention-Based Bidirectional LSTM for Hybrid Automatic Text Summarization [J].
Jiang, Jiawen ;
Zhang, Haiyang ;
Dai, Chenxu ;
Zhao, Qingjuan ;
Feng, Hao ;
Ji, Zhanlin ;
Ganchev, Ivan .
IEEE ACCESS, 2021, 9 :123660-123671
[40]   Chi-square Statistics Feature Selection Based on Term Frequency and Distribution for Text Categorization [J].
Jin, Chuanxin ;
Ma, Tinghuai ;
Hou, Rongtao ;
Tang, Meili ;
Tian, Yuan ;
Al-Dhelaan, Abdullah ;
Al-Rodhaan, Mznah .
IETE JOURNAL OF RESEARCH, 2015, 61 (04) :351-362