共 50 条
- [1] Policy Gradient Algorithm in Two-Player Zero-Sum Markov Games Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (01): : 81 - 91
- [2] Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1321 - 1329
- [3] When are Offline Two-Player Zero-Sum Markov Games Solvable? ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [4] Policy gradient algorithm and its convergence analysis for two-player zero-sum Markov games Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (03): : 480 - 491
- [5] Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [6] Corruption-Robust Offline Two-Player Zero-Sum Markov Games INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [7] Inverse Two-Player Zero-Sum Dynamic Games 2016 AUSTRALIAN CONTROL CONFERENCE (AUCC), 2016, : 192 - 196
- [10] Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,