共 14 条
- [2] Chen Y., 2020, KNOWLEDGE SCI ENG MA, P388
- [3] Chen Y., 2020, INT C KNOWLEDGE SCI, P388
- [4] A Generic Markov Decision Process Model and Reinforcement Learning Method for Scheduling Agile Earth Observation Satellites [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (03): : 1463 - 1474