Feature Selection based on Supervised Topic Modeling for Boosting-Based Multi-Label Text Categorization

被引:0
|
作者
Al-Salemi, Bassam [1 ]
Ayob, Masri [1 ]
Noah, Shahrul Azman Mohd [1 ]
Ab Aziz, Mohd Juzaiddin [1 ]
机构
[1] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi, Malaysia
来源
PROCEEDINGS OF THE 2017 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI'17) | 2017年
关键词
AdaBoost.MH; feature selection; text categorization; supervised topic modeling; Latent Dirichlet Allocation; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The text representation model Bag-Of-Words is a simple and typical model which uses the single words as elements to represent the texts in the feature space. However, using the single words as features will produce a high dimensional feature space, which result in the learning computational cost, particularly for ensemble learning algorithms, such as the boosting algorithm AdaBoost.MH. The straightforward solution of this matter can be managed by using a feature selection method capable of reducing the features space effectively. This work describes how to utilize the supervised topic model Labeled Latent Dirichlet Allocation for feature selection, as well accelerating AdaBoost.MH learning for multi-label text categorization. The experimental results on three benchmarks demonstrated that using Labeled Latent Dirichlet Allocation for feature selection improves and accelerates AdaBoost.MH and exceeds the performance of three existing methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Feature ranking for enhancing boosting-based multi-label text categorization
    Al-Salemi, Bassam
    Ayob, Masri
    Noah, Shahrul Azman Mohd
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 113 : 531 - 543
  • [2] Boosting algorithms with topic modeling for multi-label text categorization: A comparative empirical study
    Al-Salemi, Bassam
    Ab Aziz, Mohd. Juzaiddin
    Noah, Shahrul Azman
    JOURNAL OF INFORMATION SCIENCE, 2015, 41 (05) : 732 - 746
  • [3] A Feature Selection Method for Multi-Label Text Based on Feature Importance
    Zhang, Lu
    Duan, Qingling
    APPLIED SCIENCES-BASEL, 2019, 9 (04):
  • [4] Deep label relevance and label ambiguity based multi-label feature selection for text classification
    Verma, Gurudatta
    Sahu, Tirath Prasad
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148
  • [5] Weakly supervised multi-label feature selection based on shared subspace
    Shi, Rongyi
    Tan, Anhui
    Shi, Suwei
    Wang, Jin
    Gu, Shenming
    Wu, Weizhi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, : 2885 - 2903
  • [6] Multi-label feature selection based on the division of label topics
    Zhang, Ping
    Gao, Wanfu
    Hu, Juncheng
    Li, Yonghao
    INFORMATION SCIENCES, 2021, 553 : 129 - 153
  • [7] Multi-Label Feature Selection Based on Min-Relevance Label
    Gao, Wanfu
    Pan, Hanlin
    IEEE ACCESS, 2023, 11 : 410 - 420
  • [8] Multi-label feature selection based on label correlations and feature redundancy
    Fan, Yuling
    Chen, Baihua
    Huang, Weiqin
    Liu, Jinghua
    Weng, Wei
    Lan, Weiyao
    KNOWLEDGE-BASED SYSTEMS, 2022, 241
  • [9] Toward embedding-based multi-label feature selection with label and feature collaboration
    Dai, Liang
    Zhang, Jia
    Du, Guodong
    Li, Candong
    Wei, Rong
    Li, Shaozi
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (06) : 4643 - 4665
  • [10] Sparse semi-supervised multi-label feature selection based on latent representation
    Zhao, Xue
    Li, Qiaoyan
    Xing, Zhiwei
    Yang, Xiaofei
    Dai, Xuezhen
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 5139 - 5151