A hybrid clustering approach for link prediction in heterogeneous information networks

被引:3
作者
Sajjadi, Zahra Sadat [1 ]
Esmaeili, Mahdi [2 ]
Ghobaei-Arani, Mostafa [1 ]
Minaei-Bidgoli, Behrouz [3 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Qom Branch, Qom, Iran
[2] Islamic Azad Univ, Dept Comp Engn, Kashan Branch, Kashan, Iran
[3] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
关键词
Social network; Graph clustering; Structural similarity; Attribute similarity; Hybrid similarity; K-Medoids; COMMUNITY DETECTION; ALGORITHM;
D O I
10.1007/s10115-023-01914-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, researchers from academic and industrial fields have become increasingly interested in social network data to extract meaningful information. This information is used in applications such as link prediction between people groups, community detection, protein module identification, etc. Therefore, the clustering technique has emerged as a solution to finding similarities between social network members. Recently, in most graph clustering solutions, the structural similarity of nodes is combined with their attribute similarity. The results of these solutions indicate that the graph's topological structure is more important. Since most social networks are sparse, these solutions often suffer from insufficient use of node features. This paper proposes a hybrid clustering approach as an application for link prediction in heterogeneous information networks (HINs). In our approach, an adjacency vector is determined for each node until, in this vector, the weight of the direct edge or the weight of the shortest communication path among every pair of nodes is considered. A similarity metric is presented that calculates similarity using the direct edge weight between two nodes and the correlation between their adjacency vectors. Finally, we evaluated the effectiveness of our proposed method using DBLP, Political blogs, and Citeseer datasets under entropy, density, purity, and execution time metrics. The simulation results demonstrate that while maintaining the cluster density significantly reduces the entropy and the execution time compared with the other methods.
引用
收藏
页码:4905 / 4937
页数:33
相关论文
共 35 条
  • [1] Aggarwal CC, 2011, SOCIAL NETWORK DATA ANALYTICS, P1
  • [2] SAG Cluster: An unsupervised graph clustering based on collaborative similarity for community detection in complex networks
    Agrawal, Smita
    Patel, Atul
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2021, 563
  • [3] Fast graph clustering with a new description model for community detection
    Bai, Liang
    Cheng, Xueqi
    Liang, Jiye
    Guo, Yike
    [J]. INFORMATION SCIENCES, 2017, 388 : 37 - 47
  • [4] BERAHMAND K, 2022, CLUSTER COMPUT, P1
  • [5] A new attributed graph clustering by using label propagation in complex networks
    Berahmand, Kamal
    Haghani, Sogol
    Rostami, Mehrdad
    Li, Yuefeng
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (05) : 1869 - 1883
  • [6] Recommender systems survey
    Bobadilla, J.
    Ortega, F.
    Hernando, A.
    Gutierrez, A.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 46 : 109 - 132
  • [7] Clustering Large Attributed Graphs: A Balance between Structural and Attribute Similarities
    Cheng, Hong
    Zhou, Yang
    Yu, Jeffrey Xu
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2011, 5 (02)
  • [8] Fortunato S., 2016, arXiv
  • [9] Improving link prediction in social networks using local and global features: a clustering-based approach
    Ghasemi, S.
    Zarei, A.
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2022, 11 (01) : 79 - 92
  • [10] A hybrid method of link prediction in directed graphs
    Ghorbanzadeh, Hossien
    Sheikhahmadi, Amir
    Jalili, Mahdi
    Sulaimany, Sadegh
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165