共 64 条
[1]
ON THE ROLE OF DYNAMIC-PROGRAMMING IN STATISTICAL COMMUNICATION-THEORY
[J].
IRE TRANSACTIONS ON INFORMATION THEORY,
1957, 3 (03)
:197-203
[2]
Bordes A., 2013, P 26 INT C NEUR INF, V2, P2787
[3]
A comprehensive survey of multiagent reinforcement learning
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2008, 38 (02)
:156-172
[4]
Q&R: A Two-Stage Approach toward Interactive Recommendation
[J].
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING,
2018,
:139-147
[5]
Towards Conversational Recommender Systems
[J].
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING,
2016,
:815-824
[6]
Deng Y., 2022, arXiv
[7]
Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning
[J].
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL,
2021,
:1431-1441
[8]
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning
[J].
WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018),
2018,
:1939-1948
[9]
Foerster JN, 2016, ADV NEUR IN, V29
[10]
Gao C., 2021, arXiv, DOI DOI 10.1016/J.AIOPEN.2021.06.002