A Ranking Approach on Large-Scale Graph With Multidimensional Heterogeneous Information

被引:10
|
作者
Wei, Wei [1 ]
Gao, Bin [2 ]
Liu, Tie-Yan [2 ]
Wang, Taifeng [2 ]
Li, Guohui [1 ]
Li, Hang [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
[3] Noahs Ark Labs Huawei Technol, Hong Kong, Hong Kong, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
Graph-based ranking; MapReduce; parameterized graph model; CONVERGENCE CONDITIONS; PAGERANK;
D O I
10.1109/TCYB.2015.2418233
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph-based ranking has been extensively studied and frequently applied in many applications, such as webpage ranking. It aims at mining potentially valuable information from the raw graph-structured data. Recently, with the proliferation of rich heterogeneous information (e.g., node/edge features and prior knowledge) available in many real-world graphs, how to effectively and efficiently leverage all information to improve the ranking performance becomes a new challenging problem. Previous methods only utilize part of such information and attempt to rank graph nodes according to link-based methods, of which the ranking performances are severely affected by several well-known issues, e.g., over-fitting or high computational complexity, especially when the scale of graph is very large. In this paper, we address the large-scale graph-based ranking problem and focus on how to effectively exploit rich heterogeneous information of the graph to improve the ranking performance. Specifically, we propose an innovative and effective semi-supervised PageRank (SSP) approach to parameterize the derived information within a unified semi-supervised learning framework (SSLF-GR), then simultaneously optimize the parameters and the ranking scores of graph nodes. Experiments on the real-world large-scale graphs demonstrate that our method significantly outperforms the algorithms that consider such graph information only partially.
引用
收藏
页码:930 / 944
页数:15
相关论文
共 50 条
  • [1] Pre-training on Large-Scale Heterogeneous Graph
    Jiang, Xunqiang
    Jia, Tianrui
    Fang, Yuan
    Shi, Chuan
    Lin, Zhe
    Wang, Hui
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 756 - 766
  • [2] Graph Neural Networks for Friend Ranking in Large-scale Social Platforms
    Sankar, Aravind
    Liu, Yozen
    Yu, Jun
    Shah, Neil
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2535 - 2546
  • [3] Graph Mining-based Trust Evaluation Mechanism with Multidimensional Features for Large-scale Heterogeneous Threat Intelligence
    Gao, Yali
    Li, Xiaoyong
    Li, Jirui
    Gao, Yunquan
    Guo, Ning
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1272 - 1277
  • [4] Large-Scale Graph Mining and Learning for Information Retrieval
    Gao, Bin
    Wang, Taifeng
    Liu, Tie-Yan
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1194 - 1195
  • [5] Techniques for Solving Large-Scale Graph Problems on Heterogeneous Platforms
    Afanasyev, Ilya
    Daryin, Alexander
    Dongarra, Jack
    Nikitenko, Dmitry
    Teplov, Alexey
    Voevodin, Vladimir
    SUPERCOMPUTING, RUSCDAYS 2016, 2016, 687 : 318 - 332
  • [6] CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification
    Cao, Lele
    von Ehrenheim, Vilhelm
    Granroth-Wilding, Mark
    Stahl, Richard Anselmo
    Mccornack, Andrew
    Catovic, Armin
    Rocha, Dhiana Deva Cavalcanti
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (01) : 247 - 258
  • [7] Heterogeneous Graph Propagation for Large-Scale Web Image Search
    Xie, Lingxi
    Tian, Qi
    Zhou, Wengang
    Zhang, Bo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (11) : 4287 - 4298
  • [8] CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification
    Cao, Lele
    von Ehrenheim, Vilhelm
    Granroth-Wilding, Mark
    Stahl, Richard Anselmo
    McCornack, Andrew
    Catovic, Armin
    Rocha, Dhiana Deva Cavalcanti
    PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024, 2024, : 4816 - 4827
  • [9] FRANK: A Fast Node Ranking Approach in Large-Scale Networks
    Zhang, Yu
    Gu, Lin
    Liao, Xiaofei
    Jin, Hai
    Zeng, Deze
    Zhou, Bing Bing
    IEEE NETWORK, 2017, 31 (01): : 36 - 43
  • [10] Large-scale graph signal denoising: A heuristic approach
    Fattahi, Mohammadreza
    Saeedi-Sourck, Hamid
    Abootalebi, Vahid
    DIGITAL SIGNAL PROCESSING, 2025, 158