Feature selection based on long short term memory for text classification

被引:1
作者
Hong, Ming [1 ]
Wang, Heyong [1 ]
机构
[1] South China Univ Technol, Dept Elect Business, Guangzhou, Peoples R China
关键词
Text classification; Feature selection; Deep learning; Long short term memory; BIDIRECTIONAL LSTM; OPTIMIZATION ALGORITHM; ATTENTION MECHANISM; NAIVE BAYES; INFORMATION; PERFORMANCE; NETWORK; PREDICTION; FREQUENCY; EFFICIENT;
D O I
10.1007/s11042-023-16990-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The selection of discriminative terms from large quantity of terms in text documents is helpful for achieving better accuracy of text classification. To focus on the task of selecting discriminative terms from text, a deep learning based feature selection method is proposed. The method is developed by using the long short term memory (LSTM) network. A deep network based on LSTM is trained in unsupervised manner to extracted deep features from bag-of-words term frequency vectors. The deep features are integrated with term frequencies to evaluate the effectiveness of terms. The proposed method extends the limitation of term frequency information by applying deep features for feature selection. Experiments in nine public datasets demonstrate better performance of our method in selecting discriminative terms than comparative methods.
引用
收藏
页码:44333 / 44378
页数:46
相关论文
共 121 条
[1]   Deep learning-based sentiment classification of evaluative text based on Multi-feature fusion [J].
Abdi, Asad ;
Shamsuddin, Siti Mariyam ;
Hasan, Shafaatunnur ;
Piran, Jalil .
INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (04) :1245-1259
[2]   SEDAT: Sentiment and Emotion Detection in Arabic Text using CNN-LSTM Deep Learning [J].
Abdullah, Malak ;
Hadzikadic, Mirsad ;
Shaikh, Samira .
2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, :835-840
[3]   Co-Operative Binary Bat Optimizer with Rough Set Reducts for Text Feature Selection [J].
Adel, Aisha ;
Omar, Nazlia ;
Abdullah, Salwani ;
Al-Shabi, Adel .
APPLIED SCIENCES-BASEL, 2022, 12 (21)
[4]   Soft voting technique to improve the performance of global filter based feature selection in text corpus [J].
Agnihotri, Deepak ;
Verma, Kesari ;
Tripathi, Priyanka ;
Singh, Bikesh Kumar .
APPLIED INTELLIGENCE, 2019, 49 (04) :1597-1619
[5]   Variable Global Feature Selection Scheme for automatic classification of text documents [J].
Agnihotri, Deepak ;
Verma, Kesari ;
Tripathi, Priyanka .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 81 :268-281
[6]   Adaptive Binary Bat and Markov Clustering Algorithms for Optimal Text Feature Selection in News Events Detection Model [J].
Al-Dyani, Wafa Zubair ;
Ahmad, Farzana Kabir ;
Kamaruddin, Siti Sakira .
IEEE ACCESS, 2022, 10 :85655-85676
[7]   Fuzzy Ontology and LSTM-Based Text Mining: A Transportation Network Monitoring System for Assisting Travel [J].
Ali, Farman ;
El-Sappagh, Shaker ;
Kwak, Daehan .
SENSORS, 2019, 19 (02)
[8]   A new feature selection metric for text classification: eliminating the need for a separate pruning stage [J].
Asim, Muhammad ;
Javed, Kashif ;
Rehman, Abdur ;
Babri, Haroon A. .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (09) :2461-2478
[9]   Comparison of term frequency and document frequency based feature selection metrics in text categorization [J].
Azam, Nouman ;
Yao, JingTao .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) :4760-4768
[10]   Convolutional long short term memory deep neural networks for image sequence prediction [J].
Balderas, David ;
Ponce, Pedro ;
Molina, Arturo .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 122 :152-162