共 50 条
- [12] Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness Journal of Artificial Intelligence Research, 2024, 81 : 481 - 509
- [13] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16908 - 16916
- [14] Efficient and Stable Offline-to-online Reinforcement Learning via Continual Policy Revitalization PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 4317 - 4325
- [15] SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 11961 - 11969
- [17] Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [18] O2OAT: Efficient Offline-to-Online Reinforcement Learning with Adaptive Transition Strategy 2024 10TH INTERNATIONAL CONFERENCE ON BIG DATA AND INFORMATION ANALYTICS, BIGDIA 2024, 2024, : 569 - 576