共 37 条
[2]
Beard R. W., 1995, THESIS
[5]
An MDP Model-Based Reinforcement Learning Approach for Production Station Ramp-Up Optimization: Q-Learning Analysis
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS,
2014, 44 (09)
:1125-1138
[6]
Adaptive Learning and Control for MIMO System Based on Adaptive Dynamic Programming
[J].
IEEE TRANSACTIONS ON NEURAL NETWORKS,
2011, 22 (07)
:1133-1148