A versatile framework for attributed network clustering via K-nearest neighbor augmentation

被引:0
|
作者
Li, Yiran [1 ]
Guo, Gongyao [1 ]
Shi, Jieming [1 ]
Yang, Renchi [2 ]
Shen, Shiqi [3 ]
Li, Qing [1 ]
Luo, Jun [4 ]
机构
[1] Hong Kong Polytech Univ, Hung Hom, Hong Kong, Peoples R China
[2] Hong Kong Baptist Univ, Kowloon Tong, Hong Kong, Peoples R China
[3] WeChat Tencent, Beijing, Peoples R China
[4] Logist & Supply Chain MultiTech R&D Ctr, Pok Fu Lam, Hong Kong, Peoples R China
关键词
Clustering; Attributed Graph; Random Walks; KNN; GPU Computing; PAGERANK;
D O I
10.1007/s00778-024-00875-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Attributed networks containing entity-specific information in node attributes are ubiquitous in modeling social networks, e-commerce, bioinformatics, etc. Their inherent network topology ranges from simple graphs to hypergraphs with high-order interactions and multiplex graphs with separate layers. An important graph mining task is node clustering, aiming to partition the nodes of an attributed network into k disjoint clusters such that intra-cluster nodes are closely connected and share similar attributes, while inter-cluster nodes are far apart and dissimilar. It is highly challenging to capture multi-hop connections via nodes or attributes for effective clustering on multiple types of attributed networks. In this paper, we first present AHCKA as an efficient approach to attributed hypergraph clustering (AHC). AHCKA includes a carefully-crafted K-nearest neighbor augmentation strategy for the optimized exploitation of attribute information on hypergraphs, a joint hypergraph random walk model to devise an effective AHC objective, and an efficient solver with speedup techniques for the objective optimization. The proposed techniques are extensible to various types of attributed networks, and thus, we develop ANCKA as a versatile attributed network clustering framework, capable of attributed graph clustering, attributed multiplex graph clustering, and AHC. Moreover, we devise ANCKA-GPU with algorithmic designs tailored for GPU acceleration to boost efficiency. We have conducted extensive experiments to compare our methods with 19 competitors on 8 attributed hypergraphs, 16 competitors on 6 attributed graphs, and 16 competitors on 3 attributed multiplex graphs, all demonstrating the superb clustering quality and efficiency of our methods.
引用
收藏
页码:1913 / 1943
页数:31
相关论文
共 50 条
  • [1] KNNCC: An Algorithm for K-Nearest Neighbor Clique Clustering
    Qu Chao
    Yuan Ruifen
    Wei Xiaorui
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 1763 - 1766
  • [2] Fast agglomerative clustering using a k-nearest neighbor graph
    Franti, Pasi
    Virmajoki, Olli
    Hautamaki, Ville
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (11) : 1875 - 1881
  • [3] Connected K-nearest neighbor clustering algorithm for radar signal sorting
    Si W.
    Zhang Y.
    Deng Z.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (08): : 2463 - 2470
  • [4] An Enhanced K-Nearest Neighbor Algorithm Using Information Gain and Clustering
    Taneja, Shweta
    Gupta, Charu
    Goyal, Kratika
    Gureja, Dharna
    2014 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES (ACCT 2014), 2014, : 325 - 329
  • [5] Movie Recommender System Using K-Means Clustering AND K-Nearest Neighbor
    Ahuja, Rishabh
    Solanki, Arun
    Nayyar, Anand
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 263 - 268
  • [6] Searching k-Nearest Neighbor Trajectories on Road Networks
    Yuan, Pengcheng
    Zhao, Qinpei
    Rao, Weixiong
    Yuan, Mingxuan
    Zeng, Jia
    DATABASES THEORY AND APPLICATIONS, ADC 2017, 2017, 10538 : 85 - 97
  • [7] Map Reduce by K-Nearest Neighbor Joins
    Bethu, Srikanth
    Babu, B. Sankara
    Rao, S. Govinda
    Florence, R. Aruna
    2018 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC 2018), 2018, : 222 - 231
  • [8] Efficient Hand Movement Detection Using k-Means Clustering and k-Nearest Neighbor Algorithms
    Erhan Bergil
    Canan Oral
    Engin Ufuk Ergul
    Journal of Medical and Biological Engineering, 2021, 41 : 11 - 24
  • [9] Efficient Hand Movement Detection Using k-Means Clustering and k-Nearest Neighbor Algorithms
    Bergil, Erhan
    Oral, Canan
    Ergul, Engin Ufuk
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2021, 41 (01) : 11 - 24
  • [10] KNN-DBSCAN: Using k-nearest Neighbor Information for Parameter-free Density Based Clustering
    Sharma, Ankush
    Sharma, Amit
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 787 - 792