An efficient semi-supervised graph based clustering

被引:7
|
作者
Viet-Vu Vu [1 ]
机构
[1] Vietnam Natl Univ, Informat Technol Inst, 144 Xuan Thuy St, Hanoi, Vietnam
关键词
Semi-supervised clustering; seed; k-nearest neighbors graph; ALGORITHM; SELECTION; NEIGHBORS;
D O I
10.3233/IDA-163296
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is one of the most important tools in data mining and knowledge discovery from data. In recent years, semi-supervised clustering, that integrates side information (seeds or constraints) in the clustering process, has been known as a good strategy to boost clustering results. In this article, a new semi-supervised graph based clustering (SSGC) is presented. Using a graph of the k-nearest neighbors and a measure of local density for the similarity between vertex, SSGC integrates the seeds in the process of building clusters and hence can improve the quality of clustering. More over, SSGC can deal with noise, differential density of data, and uses only one parameter (i.e. the number of nearest neighbors). Experiments conducted on real data sets from UCI show that our method can produce good clustering results compared with the related techniques such as semi-supervised density based clustering (SSDBSCAN). Moreover, the computational cost of SSGC is much less than that of SSDBSCAN.
引用
收藏
页码:297 / 307
页数:11
相关论文
共 50 条
  • [41] SGAClust: Semi-supervised Graph Attraction Clustering of gene expression data
    Mandal, Koyel
    Sarmah, Rosy
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2022, 11 (01):
  • [42] Scalable Semi-Supervised Learning by Efficient Anchor Graph Regularization
    Wang, Meng
    Fu, Weijie
    Hao, Shijie
    Tao, Dacheng
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (07) : 1864 - 1877
  • [43] SGAClust: Semi-supervised Graph Attraction Clustering of gene expression data
    Koyel Mandal
    Rosy Sarmah
    Network Modeling Analysis in Health Informatics and Bioinformatics, 2022, 11
  • [44] Efficient dynamic graph construction for inductive semi-supervised learning
    Dornaika, F.
    Dahbi, R.
    Bosaghzadeh, A.
    Ruichek, Y.
    NEURAL NETWORKS, 2017, 94 : 192 - 203
  • [45] Semi-supervised dimensionality reduction based on composite graph
    Yu, Guoxian, 1600, Binary Information Press (10):
  • [46] Semi-supervised projected model-based clustering
    Guerra, Luis
    Bielza, Concha
    Robles, Victor
    Larranaga, Pedro
    DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (04) : 882 - 917
  • [47] Network anomaly detection based on semi-supervised clustering
    Wei Xiaotao
    Huang Houkuan
    Tian Shengfeng
    NEW ADVANCES IN SIMULATION, MODELLING AND OPTIMIZATION (SMO '07), 2007, : 440 - +
  • [48] Semi-supervised fuzzy clustering algorithm based on QPSO
    School of IoT Engineering, Jiangnan University, Wuxi 214122, China
    J. Inf. Comput. Sci., 1 (93-101):
  • [49] Selective Bayesian Classifier Based on Semi-supervised Clustering
    Cheng Yuhu
    Tong Yaoyao
    Wang Xuesong
    CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (01): : 73 - 77
  • [50] Semi-supervised clustering for vigilance analysis based on EEG
    Shi, Li-Chen
    Yu, Hong
    Lu, Bao-Liang
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1518 - 1523