Text clustering based on kernel KNN clustering algorithm

被引:0
|
作者
Xiong, Hao [1 ]
Sun, Sheng [1 ]
Feng, Yunfang [1 ]
机构
[1] Computer School, Hubei Polytechnic University, Huangshi 435003, Hubei, China
关键词
Attribute selection - Collection of documents - Document Clustering - Higher-dimensional - K-nearest neighbors - Kernel methods - Nonlinear functions - Text Clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Document clustering is a popular tool for automatically organizing a large collection of documents. In this paper, we propose a Kernel-based K-Nearest Neighbor (KKNNC) clustering algorithm based on the KNN method. Our algorithm maps samples into a higher-dimensional feature space using a nonlinear function before clustering, then in kernel space divides them linearly. We also propose a new attribute selection method-ATS??algorithm, which can select important terms in documents. Our algorithm first uses ATS to eliminate redundant attributes in data sets, then gives each of the selective attributes a weight value according to the relationship between these attributes. The experimental results show that our algorithm is effective in the text clustering task. © 2013 by CESER Publications.
引用
收藏
页码:69 / 75
相关论文
共 50 条
  • [21] Cosine kernel based density peaks clustering algorithm
    Wang, Jiayuan
    Lv, Li
    Wu, Runxiu
    Fan, Tanghuai
    Lee, Ivan
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2020, 12 (01) : 1 - 20
  • [22] A new algorithm for clustering based on kernel density estimation
    Matioli, L. C.
    Santos, S. R.
    Kleina, M.
    Leite, E. A.
    JOURNAL OF APPLIED STATISTICS, 2018, 45 (02) : 347 - 366
  • [23] Kernel Function Clustering Based on Ant Colony Algorithm
    Li, Jinjiang
    Fan, Hui
    Yuan, Da
    Zhang, Caiming
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 7, PROCEEDINGS, 2008, : 645 - +
  • [24] A new kernel-based algorithm for online clustering
    Boubacar, HA
    Lecoeuche, S
    ARTIFICIAL NEURAL NETWORKS: FORMAL MODELS AND THEIR APPLICATIONS - ICANN 2005, PT 2, PROCEEDINGS, 2005, 3697 : 583 - 588
  • [25] Ant colony clustering Algorithm based on kernel method
    Li, Jinjiang
    Fan, Hui
    Wang, Jinpeng
    Li, Yewei
    ICIC Express Letters, 2011, 5 (11): : 4183 - 4188
  • [26] Enhancement of Kernel Clustering Based on Pigeon Optimization Algorithm
    Thamer, Mathil K.
    Algamal, Zakariya Yahya
    Zine, Raoudha
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2023, 31 (SUPP01) : 121 - 133
  • [27] Kernel method-based fuzzy clustering algorithm
    Wu Zhongdong 1
    2. College of Information Engineering
    Journal of Systems Engineering and Electronics, 2005, (01) : 160 - 166
  • [28] A dynamic fuzzy clustering algorithm based on kernel methods
    Zhang, L. B.
    Zhou, C. G.
    Ma, M.
    Sun, C. T.
    Liu, M.
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 1653 - 1656
  • [29] Multiple Kernel Based Collaborative Fuzzy Clustering Algorithm
    Trong Hop Dang
    Long Thanh Ngo
    Pedrycz, Wiltold
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 585 - 594
  • [30] A novel hierarchical document clustering algorithm based on a kNN connection graph
    Zhu, Qiaoming
    Li, Junhui
    Zhou, Guodong
    Li, Peifeng
    Qian, Peide
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 120 - +