Long Short-Term Planning for Conversational Recommendation Systems

被引:0
作者
Li, Xian [1 ]
Shi, Hongguang [1 ]
Wang, Yunfei [1 ]
Zhang, Yeqin [1 ]
Li, Xubin [1 ]
Nguyen, Cam-Tu [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI | 2024年 / 14452卷
关键词
Conversational Recommendation Systems; Planning;
D O I
10.1007/978-981-99-8076-5_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Conversational Recommendation Systems (CRS), the central question is how the conversational agent can naturally ask for user preferences and provide suitable recommendations. Existing works mainly follow the hierarchical architecture, where a higher policy decides whether to invoke the conversation module (to ask questions) or the recommendation module (to make recommendations). This architecture prevents these two components from fully interacting with each other. In contrast, this paper proposes a novel architecture, the long short-term feedback architecture, to connect these two essential components in CRS. Specifically, the recommendation predicts the long-term recommendation target based on the conversational context and the user history. Driven by the targeted recommendation, the conversational model predicts the next topic or attribute to verify if the user preference matches the target. The balance feedback loop continues until the short-term planner output matches the long-term planner output, that is when the system should make the recommendation.
引用
收藏
页码:383 / 395
页数:13
相关论文
共 20 条
  • [1] Bordes A., 2013, ADV NEURAL INFORM PR, V2013, P2787, DOI DOI 10.5555/2999792.2999923
  • [2] Chen Qibin., 2019, EMNLP
  • [3] Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning
    Deng, Yang
    Li, Yaliang
    Sun, Fei
    Ding, Bolin
    Lam, Wai
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1431 - 1441
  • [4] Han X, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P139
  • [5] Cumulated gain-based evaluation of IR techniques
    Järvelin, K
    Kekäläinen, J
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) : 422 - 446
  • [6] Kenton J.D. M.-W.C., 2016, ARXIV181004805, P4171
  • [7] Interactive Path Reasoning on Graph for Conversational Recommendation
    Lei, Wenqiang
    Zhang, Gangyi
    He, Xiangnan
    Miao, Yisong
    Wang, Xiang
    Chen, Liang
    Chua, Tat-Seng
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2073 - 2083
  • [8] Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems
    Lei, Wenqiang
    He, Xiangnan
    Miao, Yisong
    Wu, Qingyun
    Hong, Richang
    Kan, Min-Yen
    Chua, Tat-Seng
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 304 - 312
  • [9] Li JC, 2020, PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), P322, DOI 10.1145/3336191.3371786
  • [10] Li R., 2018, NIPS