共 66 条
- [22] Hu L, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1858
- [23] Gulrajani I, 2017, ADV NEUR IN, V30
- [24] Kim H. R., 2003, IUI 03. 2003 International Conference on Intelligent User Interfaces, P101, DOI 10.1145/604045.604064
- [25] Kingma DP, 2014, ADV NEUR IN, V27
- [26] Konda VR, 2000, ADV NEUR IN, V12, P1008
- [27] Kostrikov I., 2019, P INT C LEARN REPR
- [28] Learning Behavior Styles with Inverse Reinforcement Learning [J]. ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
- [29] Lillicrap T.P., 2015, CONTINUOUS CONTROL D
- [30] End-to-End Deep Reinforcement Learning based Recommendation with Supervised Embedding [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 384 - 392