Link Prediction via Higher-Order Motif Features

被引:8
作者
Abuoda, Ghadeer [1 ]
Morales, Gianmarco De Francisci [2 ]
Aboulnaga, Ashraf [3 ]
机构
[1] HBKU, Coll Sci & Engn, Doha, Qatar
[2] ISI Fdn, Turin, Italy
[3] Qatar Comp Res Inst, Doha, Qatar
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I | 2020年 / 11906卷
关键词
Link prediction; Motifs; NETWORK MOTIFS;
D O I
10.1007/978-3-030-46150-8_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Link prediction requires predicting which new links are likely to appear in a graph. In this paper, we present an approach for link prediction that relies on higher-order analysis of the graph topology, well beyond the typical approach which relies on common neighbors. We treat the link prediction problem as a supervised classification problem, and we propose a set of features that depend on the patterns or motifs that a pair of nodes occurs in. By using motifs of sizes 3, 4, and 5, our approach captures a high level of detail about the graph topology. In addition, we propose two optimizations to construct the classification dataset from the graph. First, we propose adding negative examples to the graph as an alternative to the common approach of removing positive ones. Second, we show that it is important to control for the shortest-path distance when sampling pairs of nodes to form negative examples, since the difficulty of prediction varies with the distance. We experimentally demonstrate that using our proposed motif features in off-the-shelf classifiers results in up to 10% points increase in accuracy over prior topology-based and feature-learning methods.
引用
收藏
页码:412 / 429
页数:18
相关论文
共 45 条
  • [1] Abuoda G., 2019, ARXIV PREPRINT ARXIV
  • [2] Friends and neighbors on the Web
    Adamic, LA
    Adar, E
    [J]. SOCIAL NETWORKS, 2003, 25 (03) : 211 - 230
  • [3] Efficient Graphlet Counting for Large Networks
    Ahmed, Nesreen K.
    Neville, Jennifer
    Rossi, Ryan A.
    Duffield, Nick
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 1 - 10
  • [4] Friendship Prediction and Homophily in Social Media
    Aiello, Luca Maria
    Barrat, Alain
    Schifanella, Rossano
    Cattuto, Ciro
    Markines, Benjamin
    Menczer, Filippo
    [J]. ACM TRANSACTIONS ON THE WEB, 2012, 6 (02)
  • [5] Airoldi E.M., 2006, INT BIOM SOC ANN M, V15
  • [6] Al Hasan M., 2006, SDM06, V30, P798
  • [7] Al Hasan M, 2011, SOCIAL NETWORK DATA ANALYTICS, P243
  • [8] Scale-Free Networks: A Decade and Beyond
    Barabasi, Albert-Laszlo
    [J]. SCIENCE, 2009, 325 (5939) : 412 - 413
  • [9] Counting Graphlets: Space vs Time
    Bressan, Marco
    Chierichetti, Flavio
    Kumar, Ravi
    Leucci, Stefano
    Panconesi, Alessandro
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 557 - 566
  • [10] Cukierski W, 2011, 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), P1237, DOI 10.1109/IJCNN.2011.6033365