共 50 条
[42]
Local Optimization Policy for Link Prediction via Reinforcement Learning
[J].
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING,
2025, 12 (02)
:1224-1236
[43]
QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning
[J].
IEEE ACCESS,
2021, 9
:129728-129741
[46]
Risk-Sensitive Piecewise-Linear Policy Iteration for Stochastic Shortest Path Markov Decision Processes
[J].
ADVANCES IN SOFT COMPUTING, MICAI 2020, PT I,
2020, 12468
:383-395
[50]
Verification of Markov Decision Processes with Risk-Sensitive Measures
[J].
2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC),
2018,
:2371-2377