共 50 条
- [7] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning The International Journal of Advanced Manufacturing Technology, 2007, 34 : 968 - 980
- [8] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 34 (9-10): : 968 - 980
- [9] On-policy Q-learning for Adaptive Optimal Control 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014, : 301 - 306
- [10] Input-Decoupled Q-Learning for Optimal Control The Journal of the Astronautical Sciences, 2020, 67 : 630 - 656