Unsupervised Feature Selection Technique Based on Harmony Search Algorithm for Improving the Text Clustering

被引:0
|
作者
Abualigah, Laith Mohammad [1 ]
Khader, Ahamad Tajudin [1 ]
Al-Betar, Mohammed Azmi [2 ]
机构
[1] USM, Sch Comp Sci, George Town 11800, Malaysia
[2] Al Huson Univ Coll, Dept Informat Technol, Irbid, Jordan
关键词
Unsupervised Feature Selection; Harmony Search Algorithm; K-mean Text Clustering; Informative features; Sparse features; DIMENSION REDUCTION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The increasing amount of text information on the Internet web pages affects the clustering analysis. The text clustering is a favorable analysis technique used for partitioning a massive amount of information into clusters. Hence, the major problem that affects the text clustering technique is the presence uninformative and sparse features in text documents. The feature selection (FS) is an important unsupervised technique used to eliminate uninformative features to encourage the text clustering technique. Recently, the meta-heuristic algorithms are successfully applied to solve several optimization problems. In this paper, we proposed the harmony search (HS) algorithm to solve the feature selection problem (FSHSTC). The proposed method is used to enhance the text clustering (TC) technique by obtaining a new subset of informative or useful features. Experiments were applied using four benchmark text datasets. The results show that the proposed FSHSTC is improved the performance of the k-mean clustering algorithm measured by F-measure and Accuracy.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Unsupervised Feature Selection Technique Based on Genetic Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [2] A harmony search algorithm for clustering with feature selection
    Cobos, Carlos
    Leon, Elizabeth
    Mendoza, Martha
    REVISTA FACULTAD DE INGENIERIA-UNIVERSIDAD DE ANTIOQUIA, 2010, (55): : 153 - 164
  • [3] Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (11): : 4773 - 4795
  • [4] Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering
    Laith Mohammad Abualigah
    Ahamad Tajudin Khader
    The Journal of Supercomputing, 2017, 73 : 4773 - 4795
  • [5] An Unsupervised Attribute Clustering Algorithm for Unsupervised Feature Selection
    Zhou, Pei-Yuan
    Chan, Keith C. C.
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 710 - 716
  • [6] Text stream clustering algorithm based on adaptive feature selection
    Gong, Linghui
    Zeng, Jianping
    Zhang, Shiyong
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 1393 - 1399
  • [7] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Pirooz Shamsinejadbabki
    Mohammad Saraee
    Journal of Intelligent Information Systems, 2012, 38 : 669 - 684
  • [8] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Shamsinejadbabki, Pirooz
    Saraee, Mohammad
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (03) : 669 - 684
  • [9] Harmony Search Algorithm for Feature Selection in Face Recognition
    Kumar, Dinesh
    Shrutika
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 554 - 559
  • [10] A comparative study on unsupervised feature selection methods for text clustering
    Liu, LY
    Kang, JC
    Yu, J
    Wang, ZL
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 597 - 601