Long Short-Term Planning for Conversational Recommendation Systems

被引：0

作者：

Li, Xian ^{[1
]}

Shi, Hongguang ^{[1
]}

Wang, Yunfei ^{[1
]}

Zhang, Yeqin ^{[1
]}

Li, Xubin ^{[1
]}

Nguyen, Cam-Tu ^{[1
]}

机构：

[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China

来源：

NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI | 2024年 / 14452卷

关键词：

Conversational Recommendation Systems; Planning;

D O I：

10.1007/978-981-99-8076-5_28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In Conversational Recommendation Systems (CRS), the central question is how the conversational agent can naturally ask for user preferences and provide suitable recommendations. Existing works mainly follow the hierarchical architecture, where a higher policy decides whether to invoke the conversation module (to ask questions) or the recommendation module (to make recommendations). This architecture prevents these two components from fully interacting with each other. In contrast, this paper proposes a novel architecture, the long short-term feedback architecture, to connect these two essential components in CRS. Specifically, the recommendation predicts the long-term recommendation target based on the conversational context and the user history. Driven by the targeted recommendation, the conversational model predicts the next topic or attribute to verify if the user preference matches the target. The balance feedback loop continues until the short-term planner output matches the long-term planner output, that is when the system should make the recommendation.

引用

页码：383 / 395

页数：13

共 20 条

[1] Bordes A., 2013, ADV NEURAL INFORM PR, V2013, P2787, DOI DOI 10.5555/2999792.2999923
[2] Chen Qibin., 2019, EMNLP
[3] Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning
Deng, Yang
Li, Yaliang
Sun, Fei
Ding, Bolin
Lam, Wai
[J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1431 - 1441
[4] Han X, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P139
[5] Cumulated gain-based evaluation of IR techniques
Järvelin, K
Kekäläinen, J
[J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) : 422 - 446
[6] Kenton J.D. M.-W.C., 2016, ARXIV181004805, P4171
[7] Interactive Path Reasoning on Graph for Conversational Recommendation
Lei, Wenqiang
Zhang, Gangyi
He, Xiangnan
Miao, Yisong
Wang, Xiang
Chen, Liang
Chua, Tat-Seng
[J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2073 - 2083
[8] Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems
Lei, Wenqiang
He, Xiangnan
Miao, Yisong
Wu, Qingyun
Hong, Richang
Kan, Min-Yen
Chua, Tat-Seng
[J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 304 - 312
[9] Li JC, 2020, PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), P322, DOI 10.1145/3336191.3371786
[10] Li R., 2018, NIPS

← 1 2 →