Unsupervised Feature Selection Technique Based on Harmony Search Algorithm for Improving the Text Clustering

被引:0
|
作者
Abualigah, Laith Mohammad [1 ]
Khader, Ahamad Tajudin [1 ]
Al-Betar, Mohammed Azmi [2 ]
机构
[1] USM, Sch Comp Sci, George Town 11800, Malaysia
[2] Al Huson Univ Coll, Dept Informat Technol, Irbid, Jordan
关键词
Unsupervised Feature Selection; Harmony Search Algorithm; K-mean Text Clustering; Informative features; Sparse features; DIMENSION REDUCTION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The increasing amount of text information on the Internet web pages affects the clustering analysis. The text clustering is a favorable analysis technique used for partitioning a massive amount of information into clusters. Hence, the major problem that affects the text clustering technique is the presence uninformative and sparse features in text documents. The feature selection (FS) is an important unsupervised technique used to eliminate uninformative features to encourage the text clustering technique. Recently, the meta-heuristic algorithms are successfully applied to solve several optimization problems. In this paper, we proposed the harmony search (HS) algorithm to solve the feature selection problem (FSHSTC). The proposed method is used to enhance the text clustering (TC) technique by obtaining a new subset of informative or useful features. Experiments were applied using four benchmark text datasets. The results show that the proposed FSHSTC is improved the performance of the k-mean clustering algorithm measured by F-measure and Accuracy.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Text Categorization Based on Clustering Feature Selection
    Zhou, Xiaofei
    Hu, Yue
    Guo, Li
    2ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2014, 2014, 31 : 398 - 405
  • [22] Automatic Unsupervised Feature Selection using Gravitational Search Algorithm
    Kumar, Vijay
    Chhabra, Jitender Kumar
    Kumar, Dinesh
    IETE JOURNAL OF RESEARCH, 2015, 61 (01) : 22 - 31
  • [23] Feature selection in unsupervised context: Clustering based approach
    Klepaczko, A
    Materka, A
    Computer Recognition Systems, Proceedings, 2005, : 219 - 226
  • [24] Spectral Clustering Based Unsupervised Feature Selection Algorithms
    Xie J.-Y.
    Ding L.-J.
    Wang M.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1009 - 1024
  • [25] Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm
    Hong, Yi
    Kwong, Sam
    Chang, Yuchou
    Ren, Qingsheng
    PATTERN RECOGNITION, 2008, 41 (09) : 2742 - 2756
  • [26] A PCA Based Unsupervised Feature Selection Algorithm
    Luo, Yihui
    Xiong, Shuchu
    Wang, Sichuan
    SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 299 - 302
  • [27] Feature Selection for Colon Cancer Detection Using K-Means Clustering and Modified Harmony Search Algorithm
    Bae, Jin Hee
    Kim, Minwoo
    Lim, J. S.
    Geem, Zong Woo
    MATHEMATICS, 2021, 9 (05)
  • [28] Introducing clustering based population in Binary Gravitational Search Algorithm for Feature Selection
    Guha, Ritam
    Ghosh, Manosij
    Chakrabarti, Akash
    Sarkar, Ram
    Mirjalili, Seyedali
    APPLIED SOFT COMPUTING, 2020, 93
  • [29] Application of Harmony Search Algorithm on Clustering
    Amiri, Babak
    Hossain, Liaquat
    Mosavi, Seyyed Esmaeil
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS 1 AND 2, 2010, : 460 - +
  • [30] Improving binary crow search algorithm for feature selection
    Alnaish, Zakaria A. Hamed A.
    Algamal, Zakariya Yahya
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)