A text clustering algorithm based on find of density peaks

被引:3
|
作者
Liu, Peiyu [1 ]
Liu, Yingying [2 ]
Hou, Xiuyan [2 ]
Li, Qingqing [2 ]
Zhu, Zhenfang [3 ]
机构
[1] Shandong Yingcai Univ, Jinan, Peoples R China
[2] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
[3] Shandong Jiaotong Univ, Sch Informat Sci & Elect Engn, Jinan, Peoples R China
关键词
Density; Text clustering; Feature term; Vector distance; Similarity;
D O I
10.1109/ITME.2015.103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The text clustering is one of core issues in the field of text mining and information retrieval. The clustering algorithm is divided into four categories: the partitioned clustering algorithm, the hierarchical clustering algorithm, density-based clustering algorithm, as well as intelligence clustering algorithm, but at present, many of which cannot meet the demand of speed and self-adapting about text clustering. Therefore this paper proposed a text clustering algorithm based on find of density peaks. The algorithm was implemented by the calculation of text distance and density, which was in accordance with calculation of the text vector similarity. SVM was used to express text to obtain the vector mapping for the similarity calculation. The next work was the finding of the local density and the distance from points of higher density of each text, removing the noise points, selecting the cluster center. The remaining points were assigned into the cluster which its nearest cluster center represented. According to several sets of contrast experiment, the density-based text clustering has an advantage of reliability and robustness.
引用
收藏
页码:348 / 352
页数:5
相关论文
共 50 条
  • [41] Optimized Density Peaks Clustering Algorithm Based on Dissimilarity Measure
    Ding S.-F.
    Xu X.
    Wang Y.-R.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (11): : 3321 - 3333
  • [42] Adaptive fuzzy clustering by fast search and find of density peaks
    Bie, Rongfang
    Mehmood, Rashid
    Ruan, Shanshan
    Sun, Yunchuan
    Dawood, Hussain
    PERSONAL AND UBIQUITOUS COMPUTING, 2016, 20 (05) : 785 - 793
  • [43] Adaptive fuzzy clustering by fast search and find of density peaks
    Rongfang Bie
    Rashid Mehmood
    Shanshan Ruan
    Yunchuan Sun
    Hussain Dawood
    Personal and Ubiquitous Computing, 2016, 20 : 785 - 793
  • [44] Clustering by Fast Search and Find of Density Peaks with Data Field
    WANG Shuliang
    WANG Dakui
    LI Caoyuan
    LI Yan
    DING Gangyi
    Chinese Journal of Electronics, 2016, 25 (03) : 397 - 402
  • [45] Clustering by Fast Search and Find of Density Peaks with Data Field
    Wang Shuliang
    Wang Dakui
    Li Caoyuan
    Li Yan
    Ding Gangyi
    CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (03) : 397 - 402
  • [46] Clustering Mixed Data by Fast Search and Find of Density Peaks
    Liu, Shihua
    Zhou, Bingzhong
    Huang, Decai
    Shen, Liangzhong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [47] Automatic Determination of Clustering Centers for "Clustering by Fast Search and Find of Density Peaks"
    Min, Xiangqiang
    Huang, Yi
    Sheng, Yehua
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [48] Optimized Fuzzy Clustering by Fast Search and Find of Density Peaks
    Wan, Man
    Yin, Shiqun
    Tan, Tao
    Sun, Pengchao
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 83 - 87
  • [49] Clustering by Search in Descending Order and Automatic Find of Density Peaks
    Liu, Tong
    Li, Hangyu
    Zhao, Xudong
    IEEE ACCESS, 2019, 7 : 133772 - 133780
  • [50] Automatic Determination of Clustering Center for Clustering by Fast Search and Find of Density Peaks
    Wang W.
    Wu F.
    Lü C.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (11): : 1032 - 1041