共 50 条
- [31] Policy Gradient Algorithm in Two-Player Zero-Sum Markov Games Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (01): : 81 - 91
- [32] Zero-sum infinite-horizon discounted piecewise deterministic Markov games Mathematical Methods of Operations Research, 2023, 97 : 179 - 205