共 46 条
[1]
Rusu AA, 2016, Arxiv, DOI arXiv:1511.06295
[2]
Basilico J., 2004, P 21 INT C MACH LEAR, P9
[3]
Causal Embeddings for Recommendation
[J].
12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS),
2018,
:104-112
[4]
Chen HK, 2019, AAAI CONF ARTIF INTE, P3312
[5]
Counterfactual Samples Synthesizing for Robust Visual Question Answering
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10797-10806
[6]
Counterfactual Critic Multi-Agent Training for Scene Graph Generation
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4612-4622
[7]
Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation
[J].
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING,
2018,
:1187-1196
[8]
Generative Inverse Deep Reinforcement Learning for Online Recommendation
[J].
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021,
2021,
:201-210
[9]
Chen XC, 2021, Arxiv, DOI arXiv:2109.03540
[10]
Locality-Sensitive State-Guided Experience Replay Optimization for Sparse Rewards in Online Recommendation
[J].
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22),
2022,
:1316-1325