A versatile framework for attributed network clustering via K-nearest neighbor augmentation

被引:0
|
作者
Li, Yiran [1 ]
Guo, Gongyao [1 ]
Shi, Jieming [1 ]
Yang, Renchi [2 ]
Shen, Shiqi [3 ]
Li, Qing [1 ]
Luo, Jun [4 ]
机构
[1] Hong Kong Polytech Univ, Hung Hom, Hong Kong, Peoples R China
[2] Hong Kong Baptist Univ, Kowloon Tong, Hong Kong, Peoples R China
[3] WeChat Tencent, Beijing, Peoples R China
[4] Logist & Supply Chain MultiTech R&D Ctr, Pok Fu Lam, Hong Kong, Peoples R China
关键词
Clustering; Attributed Graph; Random Walks; KNN; GPU Computing; PAGERANK;
D O I
10.1007/s00778-024-00875-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Attributed networks containing entity-specific information in node attributes are ubiquitous in modeling social networks, e-commerce, bioinformatics, etc. Their inherent network topology ranges from simple graphs to hypergraphs with high-order interactions and multiplex graphs with separate layers. An important graph mining task is node clustering, aiming to partition the nodes of an attributed network into k disjoint clusters such that intra-cluster nodes are closely connected and share similar attributes, while inter-cluster nodes are far apart and dissimilar. It is highly challenging to capture multi-hop connections via nodes or attributes for effective clustering on multiple types of attributed networks. In this paper, we first present AHCKA as an efficient approach to attributed hypergraph clustering (AHC). AHCKA includes a carefully-crafted K-nearest neighbor augmentation strategy for the optimized exploitation of attribute information on hypergraphs, a joint hypergraph random walk model to devise an effective AHC objective, and an efficient solver with speedup techniques for the objective optimization. The proposed techniques are extensible to various types of attributed networks, and thus, we develop ANCKA as a versatile attributed network clustering framework, capable of attributed graph clustering, attributed multiplex graph clustering, and AHC. Moreover, we devise ANCKA-GPU with algorithmic designs tailored for GPU acceleration to boost efficiency. We have conducted extensive experiments to compare our methods with 19 competitors on 8 attributed hypergraphs, 16 competitors on 6 attributed graphs, and 16 competitors on 3 attributed multiplex graphs, all demonstrating the superb clustering quality and efficiency of our methods.
引用
收藏
页码:1913 / 1943
页数:31
相关论文
共 50 条
  • [21] Hybrid SORN Implementation of k-Nearest Neighbor Algorithm on FPGA
    Huelsmeier, Nils
    Baerthel, Moritz
    Karsthof, Ludwig
    Rust, Jochen
    Paul, Steffen
    2022 20TH IEEE INTERREGIONAL NEWCAS CONFERENCE (NEWCAS), 2022, : 163 - 167
  • [22] A fall detection system using k-nearest neighbor classifier
    Liu, Chien-Liang
    Lee, Chia-Hoang
    Lin, Ping-Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) : 7174 - 7181
  • [23] Fault classification method based on fast k-nearest neighbor with hybrid feature generation and K-medoids clustering
    Zhou, Zhe
    Zeng, Fanliang
    Huang, Jiacheng
    Zheng, Jinhui
    Li, Zuxin
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 568 - 573
  • [24] A Monitoring Model for Abnormal Electricity Consumption Based on K-Means++ Clustering and Improved K-Nearest Neighbor Algorithm
    Yang, Jinfeng
    Que, Huakun
    Liu, Wenjia
    Xiao, Jiang
    SMART GRIDS AND SUSTAINABLE ENERGY, 2024, 9 (02)
  • [25] A direct boosting algorithm for the k-nearest neighbor classifier via local warping of the distance metric
    Neo, Toh Koon Charlie
    Ventura, Dan
    PATTERN RECOGNITION LETTERS, 2012, 33 (01) : 92 - 102
  • [26] Enhancing Clustering Efficiency in Heterogeneous Wireless Sensor Network Protocols Using the K-Nearest Neighbours Algorithm
    Juwaied, Abdulla
    Jackowska-Strumillo, Lidia
    Sierszen, Artur
    SENSORS, 2025, 25 (04)
  • [27] k-nearest-neighbor clustering and percolation theory
    Teng, Shang-Hua
    Yao, Frances F.
    ALGORITHMICA, 2007, 49 (03) : 192 - 211
  • [28] k-Nearest-Neighbor Clustering and Percolation Theory
    Shang-Hua Teng
    Frances F. Yao
    Algorithmica, 2007, 49 : 192 - 211
  • [29] Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation
    Gallego, Antonio-Javier
    Calvo-Zaragoza, Jorge
    Valero-Mas, Jose J.
    Rico-Juan, Juan R.
    PATTERN RECOGNITION, 2018, 74 : 531 - 543
  • [30] Enhanced K-Nearest Neighbor for Intelligent Fault Diagnosis of Rotating Machinery
    Lu, Jiantao
    Qian, Weiwei
    Li, Shunming
    Cui, Rongqing
    APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 15