共 50 条
[31]
REINFORCEMENT LEARNING OF SPEECH RECOGNITION SYSTEM BASED ON POLICY GRADIENT AND HYPOTHESIS SELECTION
[J].
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2018,
:5759-5763
[33]
Model-free Reinforcement Learning of Semantic Communication by Stochastic Policy Gradient
[J].
2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024,
2024,
:367-373
[35]
Continuous Parameter Control in Genetic Algorithms using Policy Gradient Reinforcement Learning
[J].
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI),
2021,
:115-122
[36]
QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning
[J].
IEEE ACCESS,
2021, 9
:129728-129741
[37]
Reinforcement Learning for Mobile Robot Obstacle Avoidance with Deep Deterministic Policy Gradient
[J].
INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT III,
2022, 13457
:197-204
[38]
Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning
[J].
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211,
2023, 211
[40]
Learning Heuristics for the TSP by Policy Gradient
[J].
INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2018,
2018, 10848
:170-181