共 14 条
- [1] Chen V., 2021, Ask your humans: Using human instructions to improve generalization in reinforcement learning. Proceedings of the International Conference on Learning Representations
- [2] Kwon M., 2023, Reward design with language models
- [3] A Closer Look at Reward Decomposition for High-level Robotic Explanations [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL, 2023, : 429 - 436
- [4] Ma YJ., 2024, EUREKA: Humanlevel reward design via coding large language models. Proceedings of the International Conference on Learning Representations
- [5] OpenAI, ModelsOpenAI API. OpenAI API Documentation
- [6] Prakash B., 2023, Proceedings of the 37th Conference on Neural Information Processing Systems, P1
- [7] Qing YP, 2023, Arxiv, DOI arXiv:2211.06665
- [8] Schulman J, 2017, Arxiv, DOI arXiv:1707.06347
- [9] Shota T., 2023, Analysis on task versatility of instruction based robot learning guided by large language models. Proceedings of the 37th Annual Conference of the Japanese Society for Artificial Intelligence, 2O1GS805