共 22 条
- [1] Kakade S M., On the sample complexity of reinforcement learning, (2003)
- [2] Sutton R S, Barto A G., Reinforcement learning: An introduction, IEEE Transactions on Neural Networks, 9, 5, (1998)
- [3] Taylor M E, Stone P., Transfer learning for reinforcement learning domains: A survey, Journal of Machine Learning Research, 10, 7, pp. 1633-1685, (2009)
- [4] Da Silva F L, Costa A H R., A survey on transfer learning for multiagent reinforcement learning systems, Journal of Artificial Intelligence Research, 64, pp. 645-703, (2019)
- [5] Yang T P, Hao J Y, Meng Z P, Et al., Towards efficient detection and optimal response against sophisticated opponents, Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 623-629, (2019)
- [6] Chen H, Liu Q, Huang J, Et al., Efficiently tracking multi-strategic opponents: A context-aware Bayesian policy reuse approach, Applied Soft Computing, 121, (2022)
- [7] Ammar H B, Eaton E, Taylor M E, Et al., An automated measure of MDP similarity for transfer in reinforcement learning
- [8] Song J H, Gao Y, Wang H, Et al., Measuring the distance between finite Markov decision processes, Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, pp. 468-476, (2016)
- [9] Brys T, Harutyunyan A, Taylor M E, Et al., Policy transfer using reward shaping, Proceedings of the 14th International Conference on Autonomous Agents and Multiagent Systems, pp. 181-188, (2015)
- [10] Bianchi R A C, Martins M F, Ribeiro C H C, Et al., Heuristically-accelerated multiagent reinforcement learning, IEEE Transactions on Cybernetics, 44, 2, pp. 252-265, (2014)