共 50 条
- [42] Evaluation of Pretrained Large Language Models in Embodied Planning Tasks ARTIFICIAL GENERAL INTELLIGENCE, AGI 2023, 2023, 13921 : 222 - 232
- [43] Level Generation Through Large Language Models PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES, FDG 2023, 2023,
- [44] On the Capacity of Citation Generation by Large Language Models INFORMATION RETRIEVAL, CCIR 2024, 2025, 15418 : 109 - 123
- [47] Evaluating Large Language Models for Tax Law Reasoning INTELLIGENT SYSTEMS, BRACIS 2024, PT I, 2025, 15412 : 460 - 474
- [48] Evaluating alignment in large language models: a review of methodologies AI and Ethics, 2025, 5 (3): : 3233 - 3240
- [49] A Chinese Dataset for Evaluating the Safeguards in Large Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 3106 - 3119
- [50] EconNLI: Evaluating Large Language Models on Economics Reasoning FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 982 - 994