Constrained-meta-path-based ranking in heterogeneous information network

被引:19
作者
Shi, Chuan [1 ]
Li, Yitong [1 ]
Yu, Philip S. [2 ]
Wu, Bin [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Intelligent Telecommun Software &, Beijing, Peoples R China
[2] Univ Illinois, Chicago, IL USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Heterogeneous information network; Ranking; Random walk; Tensor analysis;
D O I
10.1007/s10115-016-0916-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there is a surge of interests on heterogeneous information network analysis, where the network includes different types of objects or links. As a newly emerging network model, heterogeneous information networks have many unique features, e.g., complex structure and rich semantics. Moreover, meta path, the sequence of relations connecting two object types, is widely used to integrate different types of objects and mine the semantics information in this kind of networks. The object ranking is an important and basic function in network analysis, which has been extensively studied in homogeneous networks including the same type of objects and links. However, it is not well exploited in heterogeneous networks until now, since the characteristics of heterogeneous networks introduce new challenges for object ranking. In this paper, we study the ranking problem in heterogeneous networks and propose the HRank method to evaluate the importance of multiple types of objects and meta paths. Since the traditional meta path coarsely embodies path semantics, we propose a constrained meta path to subtly capture the refined semantics through confining constraints on objects. Based on a path-constrained random walk process, HRank can simultaneously determine the importance of objects and constrained meta paths through applying the tensor analysis. Extensive experiments on three real datasets show that HRank can effectively evaluate the importance of objects and paths together. Moreover, the constrained meta path shows its potential on mining subtle semantics by obtaining more accurate ranking results.
引用
收藏
页码:719 / 747
页数:29
相关论文
共 27 条
  • [1] [Anonymous], 2011, SIGKDD
  • [2] [Anonymous], 1998, P 7 INT WORLD WID WE
  • [3] [Anonymous], 2011, P 17 ACM SIGKDD INT
  • [4] [Anonymous], 2010, SIGKDD
  • [5] [Anonymous], 2015, P CIKM
  • [6] [Anonymous], 2012, Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, DOI DOI 10.1145/2339530.2339738
  • [7] Chakrabarti S., 2007, P INT C WORLD WID WE, P571, DOI [10.1145/ 1242572.1242650, DOI 10.1145/1242572.1242650]
  • [8] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [9] Deng HB, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P239
  • [10] Han JW, 2009, LECT NOTES ARTIF INT, V5808, P13