共 50 条
- [22] Benefits of Combining Dimensional Attention and Working Memory for Partially Observable Reinforcement Learning Problems [J]. ACMSE 2021: PROCEEDINGS OF THE 2021 ACM SOUTHEAST CONFERENCE, 2021, : 209 - 213
- [23] Abstraction in Model Based Partially Observable Reinforcement Learning using Extended Sequence Trees [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 2, 2012, : 348 - 355
- [26] Learning what to memorize: Using intrinsic motivation to form useful memory in partially observable reinforcement learning [J]. Applied Intelligence, 2023, 53 : 19074 - 19092
- [27] A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2008, 24 (07): : 687 - 693
- [29] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (05): : 1004 - 1011