Text clustering based on kernel KNN clustering algorithm

被引:0
|
作者
Xiong, Hao [1 ]
Sun, Sheng [1 ]
Feng, Yunfang [1 ]
机构
[1] Computer School, Hubei Polytechnic University, Huangshi 435003, Hubei, China
关键词
Attribute selection - Collection of documents - Document Clustering - Higher-dimensional - K-nearest neighbors - Kernel methods - Nonlinear functions - Text Clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Document clustering is a popular tool for automatically organizing a large collection of documents. In this paper, we propose a Kernel-based K-Nearest Neighbor (KKNNC) clustering algorithm based on the KNN method. Our algorithm maps samples into a higher-dimensional feature space using a nonlinear function before clustering, then in kernel space divides them linearly. We also propose a new attribute selection method-ATS??algorithm, which can select important terms in documents. Our algorithm first uses ATS to eliminate redundant attributes in data sets, then gives each of the selective attributes a weight value according to the relationship between these attributes. The experimental results show that our algorithm is effective in the text clustering task. © 2013 by CESER Publications.
引用
收藏
页码:69 / 75
相关论文
共 50 条
  • [1] An Improved KNN Text Classification Algorithm Based on Clustering
    Zhou Yong
    Li Youwen
    Xia Shixiong
    JOURNAL OF COMPUTERS, 2009, 4 (03) : 230 - 237
  • [2] A Clustering-Based KNN Improved Algorithm CLKNN for Text Classification
    Zhou, Lijuan
    Wang, Linshuang
    Ge, Xuebin
    Shi, Qian
    2010 2ND INTERNATIONAL ASIA CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (CAR 2010), VOL 3, 2010, : 212 - 215
  • [3] A robust clustering algorithm based on the identification of core points and KNN kernel density estimation
    Zhou, Zhou
    Si, Gangquan
    Sun, Haodong
    Qu, Kai
    Hou, Weicheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
  • [4] Adaptive clustering algorithm based on kNN and density
    Shi, Bing
    Han, Lixin
    Yan, Hong
    PATTERN RECOGNITION LETTERS, 2018, 104 : 37 - 44
  • [5] Clustering ensemble based on the fuzzy KNN algorithm
    Weng, Fangfei
    Jiang, Qingshan
    Chen, Lifei
    Hong, Zhiling
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 1001 - +
  • [6] A new clustering algorithm based on KNN and DENCLUE
    Yu, XG
    Jian, Y
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 2033 - 2038
  • [7] A Kernel Clustering Algorithm Based on Diameters
    Costa, M. Fernanda P.
    Rocha, Ana Maria A. C.
    Fernandes, Edite M. G. P.
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2022, 2022, 1754 : 442 - 456
  • [8] RECOME: A new density-based clustering algorithm using relative KNN kernel density
    Geng, Yangli-ao
    Li, Qingyong
    Zheng, Rong
    Zhuang, Fuzhen
    He, Ruisi
    Xiong, Naixue
    INFORMATION SCIENCES, 2018, 436 : 13 - 30
  • [9] Kernel clustering algorithm
    Zhang, Li
    Zhou, Wei-Da
    Jiao, Li-Cheng
    Jisuanji Xuebao/Chinese Journal of Computers, 2002, 25 (06): : 587 - 590
  • [10] Graph Kernel Based Clustering Algorithm in MANETs
    Song, Ying
    Luo, Hongwei
    Pi, Shangchao
    Gui, Chao
    Sun, Baolin
    IEEE ACCESS, 2020, 8 : 107650 - 107660