A versatile framework for attributed network clustering via K-nearest neighbor augmentation

被引:0
|
作者
Li, Yiran [1 ]
Guo, Gongyao [1 ]
Shi, Jieming [1 ]
Yang, Renchi [2 ]
Shen, Shiqi [3 ]
Li, Qing [1 ]
Luo, Jun [4 ]
机构
[1] Hong Kong Polytech Univ, Hung Hom, Hong Kong, Peoples R China
[2] Hong Kong Baptist Univ, Kowloon Tong, Hong Kong, Peoples R China
[3] WeChat Tencent, Beijing, Peoples R China
[4] Logist & Supply Chain MultiTech R&D Ctr, Pok Fu Lam, Hong Kong, Peoples R China
关键词
Clustering; Attributed Graph; Random Walks; KNN; GPU Computing; PAGERANK;
D O I
10.1007/s00778-024-00875-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Attributed networks containing entity-specific information in node attributes are ubiquitous in modeling social networks, e-commerce, bioinformatics, etc. Their inherent network topology ranges from simple graphs to hypergraphs with high-order interactions and multiplex graphs with separate layers. An important graph mining task is node clustering, aiming to partition the nodes of an attributed network into k disjoint clusters such that intra-cluster nodes are closely connected and share similar attributes, while inter-cluster nodes are far apart and dissimilar. It is highly challenging to capture multi-hop connections via nodes or attributes for effective clustering on multiple types of attributed networks. In this paper, we first present AHCKA as an efficient approach to attributed hypergraph clustering (AHC). AHCKA includes a carefully-crafted K-nearest neighbor augmentation strategy for the optimized exploitation of attribute information on hypergraphs, a joint hypergraph random walk model to devise an effective AHC objective, and an efficient solver with speedup techniques for the objective optimization. The proposed techniques are extensible to various types of attributed networks, and thus, we develop ANCKA as a versatile attributed network clustering framework, capable of attributed graph clustering, attributed multiplex graph clustering, and AHC. Moreover, we devise ANCKA-GPU with algorithmic designs tailored for GPU acceleration to boost efficiency. We have conducted extensive experiments to compare our methods with 19 competitors on 8 attributed hypergraphs, 16 competitors on 6 attributed graphs, and 16 competitors on 3 attributed multiplex graphs, all demonstrating the superb clustering quality and efficiency of our methods.
引用
收藏
页码:1913 / 1943
页数:31
相关论文
共 50 条
  • [31] Implementing k-Nearest Neighbor Algorithm on Scanning Aperture for Accuracy Improvement
    Real-Moreno, Oscar
    Castro-Toscano, Moises J.
    Rodriguez-Quinonez, Julio C.
    Hernandez-Balbuena, Daniel
    Flores-Fuentes, Wendy
    Rivas-Lopez, Moises
    IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 3182 - 3186
  • [32] A novel K-nearest neighbor classifier for lung cancer disease diagnosis
    Sachdeva, Ravi Kumar
    Bathla, Priyanka
    Rani, Pooja
    Lamba, Rohit
    Ghantasala, G. S. Pradeep
    Nassar, Ibrahim F.
    Neural Computing and Applications, 2024, 36 (35) : 22403 - 22416
  • [33] An Approach for Fault Diagnosis Based on an Improved k-Nearest Neighbor Algorithm
    Yu Feng
    Liu Lian-chang
    Liu Dong-ming
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 6521 - 6525
  • [34] Research on the Improvement of K-Nearest Neighbor Classifier for Imbalanced Text Categorization
    Yang Yanmei
    Xu Linying
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 968 - 972
  • [35] PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures
    Patwary, Md. Mostofa Ali
    Satish, Nadathur Rajagopalan
    Sundaram, Narayanan
    Liu, Jialin
    Sadowski, Peter
    Racah, Evan
    Byna, Suren
    Tull, Craig
    Bhimji, Wahid
    Prabhat
    Dubey, Pradeep
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 494 - 503
  • [36] Effective Classification of EEG Signals using K-Nearest Neighbor Algorithm
    Awan, Umer I.
    Rajput, U. H.
    Syed, Ghazaal
    Iqbal, Rimsha
    Sabat, Ifra
    Mansoor, M.
    PROCEEDINGS OF 14TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY PROCEEDINGS - FIT 2016, 2016, : 120 - 124
  • [37] Botnet Identification Based on Flow Traffic by Using K-Nearest Neighbor
    Gunawan, Dani
    Hairani, Tika
    Hizriadi, Ainul
    2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2019), 2019, : 95 - 99
  • [38] SOFTWARE ARCHITECTURE DECOMPOSITION USING ADAPTIVE K-NEAREST NEIGHBOR ALGORITHM
    Alkhalid, Abdulaziz
    Lung, Chung-Horng
    Ajila, Samuel
    2013 26TH ANNUAL IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2013, : 676 - 679
  • [39] Fuzzy-belief K-nearest neighbor classifier for uncertain data
    Liu, Zhun-ga
    Pan, Quan
    Dezert, Jean
    Mercier, Gregoire
    Liu, Yong
    2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,
  • [40] Motorcycle Apprehension using Deep Learning and K-Nearest Neighbor Algorithm
    Garcia, Maria Rosario T.
    Bandala, Argel A.
    Dadios, Elmer P.
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,