共 8 条
- [1] Cao YJ, 2024, Arxiv, DOI arXiv:2404.00282
- [2] Chakraborty S, 2023, Arxiv, DOI arXiv:2303.07622
- [3] Du Y., 2023, Guiding pretraining in reinforcement learning with large language models
- [5] Kwon M, 2023, Arxiv, DOI arXiv:2303.00001
- [6] Li H, 2024, Arxiv, DOI arXiv:2312.09238
- [7] Lin J, 2024, Arxiv, DOI arXiv:2308.01399
- [8] Pang JC, 2023, Arxiv, DOI arXiv:2302.09368