共 19 条
[1]
POOR P, BASL J., Role of collaborative robots in industry 4.0 with target on education in industrial engineering[C], 2019 4th International Conference on Control,Robotics and Cybernetics(CRC), pp. 42-46, (2019)
[2]
WU J,, HE H, PENG J, Et al., Continuous reinforcement learning of energy management with deep Q network for a power split hybrid electric bus[J], Applied Energy, 222, pp. 799-811, (2018)
[3]
SCHULMAN J, LEVINE S, MORITZ P, Et al., Trust region policy optimization[J]arXiv:1502.05477, (2015)
[4]
ZHANG Y, DENG Z, GAO Y., Angle of arrival passive location algorithm based on proximal policy optimization[J], Electronics, 8, 12, (2019)
[5]
HAARNOJA T, ZHOU A, ABBEEL P, Et al., Soft actor-critic:off-policy maximum entropy deep reinforcement learning with a stochastic actor[J], (2018)
[6]
MORALES E F, ZARAGOZA J H., An introduction to reinforcement learning[J], IEEE, 11, 4, pp. 219-354, (2011)
[7]
LUONG N C, HOANG D T, GONG S,, Et al., Applications of deep reinforcement learning in communications and networking:a survey[J], IEEE Communications Surveys & Tutorials, 21, 4, pp. 3133-3174, (2019)
[8]
LIU Y, WU Z,, Et al., Multiobjective preimpact trajectory planning of space manipulator for self-assembling a heavy payload:[J], International Journal of Advanced Robotic Systems, 18, 1, pp. 1-26, (2021)
[9]
WANG J., Analysis and design of a k-winners-take-all model with a single state variable and the heaviside step activation function[J], IEEE Transactions on Neural Networks, 21, 9, pp. 1496-1506, (2010)
[10]
KORMUSHEV P, CALINON S, CALDWELL D G., Imitation learning of positional and force skills demonstrated via kinesthetic teaching and haptic input[J], Advanced Robotics, 25, 5, pp. 581-603, (2011)