Estimating Node Importance Values in Heterogeneous Information Networks

被引:8
作者
Huang, Chenji [1 ]
Fang, Yixiang [2 ]
Lin, Xuemin [1 ]
Cao, Xin [1 ]
Zhang, Wenjie [1 ]
Orlowska, Maria [3 ]
机构
[1] Univ New South Wales, Sydney, NSW, Australia
[2] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen, Peoples R China
[3] Polish Japanese Inst Informat Technol, Warsaw, Poland
来源
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022) | 2022年
基金
澳大利亚研究理事会;
关键词
Node importance value; heterogeneous information network; semi-supervised learning; graph neural networks; INDIVIDUALS; CENTRALITY; INDEX; WEB;
D O I
10.1109/ICDE53745.2022.00068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Node importance estimation is a fundamental task in graph data analysis. Extensive studies have focused on this task, and various downstream applications have benefited from it, such as recommendation, resource allocation optimization, and missing value completion. However, existing works either focus on the homogeneous network or only study importance-based ranking. We are the first to consider the node importance values as heterogeneous values in heterogeneous information networks (HINs). A typical HIN is built of several distinguished node types where each type has its own measure of importance value (e.g., in the DBLP network, the importance values of authors and papers can be reflected by their h-index and citation numbers, respectively). This characteristic makes the above problem more challenging than computing the node importance in conventional homogeneous networks. In this paper, we formally introduce the problem of node importance value estimation in HINs; that is, given the importance values of a subset of nodes in an HIN, we aim to estimate the importance values of the remaining nodes. To solve this problem, we propose an effective graph neural network (GNN) model, called HIN Importance Value Estimation Network (HIVEN). HIVEN traces the local information of each node, specifically by utilizing the heterogeneity of the HIN. Furthermore, the meta schema is deployed to alleviate the node type domination issue. Additionally, HIVEN exploits the node similarity within each type to remedy the shortcoming of GNN models in capturing global information. Extensive experiments on real-world HIN datasets demonstrate that HIVEN superiorly outperforms the baseline methods.
引用
收藏
页码:846 / 858
页数:13
相关论文
共 50 条
[1]  
[Anonymous], Proceedings of the 2008 ACM SIGMOD international conference on Management of data, SIGMOD '08
[2]  
[Anonymous], 1999, P WEB C
[3]  
[Anonymous], 2006, ISSI NEWSLETTER
[4]  
[Anonymous], 2015, INT C LEARNING REPRE
[5]   DBpedia: A nucleus for a web of open data [J].
Auer, Soeren ;
Bizer, Christian ;
Kobilarov, Georgi ;
Lehmann, Jens ;
Cyganiak, Richard ;
Ives, Zachary .
SEMANTIC WEB, PROCEEDINGS, 2007, 4825 :722-+
[6]   A Framework to Quantify Approximate Simulation on Graph Data [J].
Chen, Xiaoshuang ;
Lai, Longbin ;
Qin, Lu ;
Lin, Xuemin ;
Liu, Boge .
2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, :1308-1319
[7]  
Chen Z, 2018, PR MACH LEARN RES, V80
[8]   Visual Recommendations for Network Navigation [J].
Crnovrsanin, Tarik ;
Liao, Isaac ;
Wu, Yingcai ;
Ma, Kwan-Liu .
COMPUTER GRAPHICS FORUM, 2011, 30 (03) :1081-1090
[9]   Study on centrality measures in social networks: a survey [J].
Das K. ;
Samanta S. ;
Pal M. .
Social Network Analysis and Mining, 2018, 8 (01)
[10]   The price prediction for the energy market based on a new method [J].
Ebrahimian, Homayoun ;
Barmayoon, Saeed ;
Mohammadi, Mohsen ;
Ghadimi, Noradin .
ECONOMIC RESEARCH-EKONOMSKA ISTRAZIVANJA, 2018, 31 (01) :313-337