共 50 条
- [1] Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [2] Constrained Dirichlet Distribution Policy: Guarantee Zero Constraint Violation Reinforcement Learning for Continuous Robotic Control IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 11690 - 11697
- [3] Model-free Safe Reinforcement Learning Method Based on Constrained Markov Decision Processes Ruan Jian Xue Bao/Journal of Software, 2022, 33 (08): : 3086 - 3102
- [5] Policy Learning with Constraints in Model-free Reinforcement Learning: A Survey PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4508 - 4515
- [7] Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3682 - 3689
- [8] Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6737 - 6744
- [9] Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [10] Budget Constrained Bidding by Model-free Reinforcement Learning in Display Advertising CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1443 - 1451