Cross-Domain Recommendation to Cold-Start Users via Variational Information Bottleneck

被引:53
作者
Cao, Jiangxia [1 ,2 ]
Sheng, Jiawei [1 ,2 ]
Cong, Xin [1 ,2 ]
Liu, Tingwen [1 ,2 ]
Wang, Bin [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Xiaomi AI Lab, Beijing, Peoples R China
来源
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022) | 2022年
关键词
Cross-Domain Recommendation; User ColdStart Recommendation; Information Bottleneck;
D O I
10.1109/ICDE53745.2022.00211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recommender systems have been widely deployed in many real-world applications, but usually suffer from the longstanding user cold-start problem. As a promising way, CrossDomain Recommendation (CDR) has attracted a surge of interest, which aims to transfer the user preferences observed in the source domain to make recommendations in the target domain. Previous CDR approaches mostly achieve the goal by following the Embedding and Mapping (EMCDR) idea which attempts to learn a mapping function to transfer the pre-trained user representations (embeddings) from the source domain into the target domain. However, they pre-train the user/item representations independently for each domain, ignoring to consider both domain interactions simultaneously. Therefore, the biased pre-trained representations inevitably involve the domain-specific information which may lead to negative impact to transfer information across domains. In this work, we consider a key point of the CDR task: what information needs to be shared across domains? To achieve the above idea, this paper utilizes the information bottleneck (IB) principle, and proposes a novel approach termed as CDRIB to enforce the representations encoding the domainshared information. To derive the unbiased representations, we devise two IB regularizers to model the cross-domain/in-domain user-item interactions simultaneously and thereby CDRIB could consider both domain interactions jointly for de-biasing. With an additional contrastive information regularizer, CDRIB can also capture cross-domain user-user correlations. In this way, those regularizers encourage the representations to encode the domain-shared information, which has the capability to make recommendations in both domains directly. To the best of our knowledge, this paper is the first work to capture the domain-shared information for cold-start users via variational information bottleneck. Empirical experiments illustrate that CDRIB outperforms the state-of-the-art approaches on four realworld cross-domain datasets, demonstrating the effectiveness of adopting the information bottleneck for CDR.
引用
收藏
页码:2209 / 2223
页数:15
相关论文
共 68 条
[1]  
Ahangama S., 2019, LATENT USER LINKING
[2]  
Alemi A.A., 2017, INT C LEARN REPR
[3]  
Andrychowicz M, 2016, ADV NEUR IN, V29
[4]  
[Anonymous], 2018, ICML
[5]  
[Anonymous], 2018, IEEE T KNOWLEDGE DAT
[6]  
[Anonymous], 2018, ICML
[7]  
Bi Y., 2020, ACM INT C RES DEV IN
[8]  
Cao J., 2021, EUR C MACH LEARN PRI
[9]  
Cui S., 2020, EDGE ENHANCED GRAPH
[10]  
Fan W., 2021, IEEE ICDE