Semantic Sensitive Simultaneous Tensor Factorization

被引:3
作者
Nakatsuji, Makoto [1 ]
机构
[1] NTT Resonant Inc, Minato Ku, Granparktower,3-4-1 Shibaura, Tokyo 1080023, Japan
来源
SEMANTIC WEB - ISWC 2016, PT I | 2016年 / 9981卷
关键词
KNOWLEDGE-BASE; WEB;
D O I
10.1007/978-3-319-46523-4_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The semantics distributed over large-scale knowledge bases can be used to intermediate heterogeneous users' activity logs created in services; such information can be used to improve applications that can help users to decide the next activities/services. Since user activities can be represented in terms of relationships involving three or more things (e.g. a user tags movie items on a webpage), tensors are an attractive approach to represent them. The recently introduced Semantic Sensitive Tensor Factorization (SSTF) is promising as it achieves high accuracy in predicting users' activities by basing tensor factorization on the semantics behind objects (e.g. item categories). However, SSTF currently focuses on the factorization of a tensor for a single service and thus has two problems: (1) the balance problem occurs when handling heterogeneous datasets simultaneously, and (2) the sparsity problem triggered by insufficient observations within a single service. Our solution, Semantic Sensitive Simultaneous Tensor Factorization ((STF)-T-3), tackles the problems by: (1) Creating tensors for individual services and factorizing them simultaneously; it does not force the creation of a tensor from multiple services and factorize the single tensor. This avoids the low prediction accuracy caused by the balance problem. (2) Utilizing shared semantics behind distributed activity logs and assigning semantic bias to each tensor factorization. This avoids the sparsity problem by sharing semantics among services. Experiments using real-world datasets show that (STF)-T-3 achieves higher accuracy in rating prediction than the current best tensor method. It also extracts implicit relationships across services in the feature spaces by simultaneous factorization with shared semantics.
引用
收藏
页码:411 / 427
页数:17
相关论文
共 22 条
[1]  
[Anonymous], 2013, P 2013 ACM SIGMOD IN
[2]  
[Anonymous], 2012, Proceedings of the 21st international conference on World Wide Web
[3]  
[Anonymous], 2010, P 4 ACM C RECOMMEND
[4]  
[Anonymous], 2010, PROC SIAM INT C DATA, DOI DOI 10.1137/1.9781611972801.19
[5]  
Bizer C., 2009, INT J SEMANT WEB INF, V5, P122
[6]   DBpedia - A crystallization point for the Web of Data [J].
Bizer, Christian ;
Lehmann, Jens ;
Kobilarov, Georgi ;
Auer, Soeren ;
Becker, Christian ;
Cyganiak, Richard ;
Hellmann, Sebastian .
JOURNAL OF WEB SEMANTICS, 2009, 7 (03) :154-165
[7]  
Cemgil A. T., 2009, COMPUT INTEL NEUROSC, V2009, P4
[8]   Knowledge Vault: A Web-Scale Approach to Probabilistic Knowledge Fusion [J].
Dong, Xin Luna ;
Gabrilovich, Evgeniy ;
Heitz, Geremy ;
Horn, Wilko ;
Lao, Ni ;
Murphy, Kevin ;
Strohmann, Thomas ;
Sun, Shaohua ;
Zhang, Wei .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :601-610
[9]  
Franz T, 2009, LECT NOTES COMPUT SC, V5823, P213, DOI 10.1007/978-3-642-04930-9_14
[10]   YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia [J].
Hoffart, Johannes ;
Suchanek, Fabian M. ;
Berberich, Klaus ;
Weikum, Gerhard .
ARTIFICIAL INTELLIGENCE, 2013, 194 :28-61