ChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level Generation

被引：0

作者：

Taveekitworachai, Pittawat ^{[1
]}

Abdullah, Febri ^{[1
]}

Dewantoro, Mury F. ^{[1
]}

Xia, Yi ^{[1
]}

Suntichaikul, Pratch ^{[1
]}

Thawonmas, Ruck ^{[2
]}

Togelius, Julian ^{[3
]}

Renz, Jochen ^{[4
]}

机构：

[1] Ritsumeikan Univ, Grad Sch, Osaka, Japan

[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Osaka, Japan

[3] NYU, Tandon Sch Engn, New York, NY USA

[4] Australian Natl Univ, Sch Comp, Canberra, ACT, Australia

来源：

2024 IEEE CONFERENCE ON GAMES, COG 2024 | 2024年

关键词：

Angry birds; procedural content generation; large language model; conversational agent; prompt engineering;

D O I：

10.1109/CoG60054.2024.10645641

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents the second ChatGPT4PCG competition at the 2024 IEEE Conference on Games. In this edition of the competition, we follow the first edition, but make several improvements and changes. We introduce a new evaluation metric along with allowing a more flexible format for participants' submissions and making several improvements to the evaluation pipeline. Continuing from the first edition, we aim to foster and explore the realm of prompt engineering (PE) for procedural content generation (PCG). While the first competition saw success, it was hindered by various limitations; we aim to mitigate these limitations in this edition. We introduce diversity as a new metric to discourage submissions aimed at producing repetitive structures. Furthermore, we allow submission of a Python program instead of a prompt text file for greater flexibility in implementing advanced PE approaches, which may require control flow, including conditions and iterations. We also make several improvements to the evaluation pipeline with a better classifier for similarity evaluation and better-performing function signatures. We thoroughly evaluate the effectiveness of the new metric and the improved classifier. Additionally, we perform an ablation study to select a function signature to instruct ChatGPT for level generation. Finally, we provide implementation examples of various PE techniques in Python and evaluate their preliminary performance. We hope this competition serves as a resource and platform for learning about PE and PCG in general1.

引用

页数：8

共 19 条

[1]

Abdullah F, 2020, IEEE CONF COMPU INTE, P698, DOI 10.1109/CoG47356.2020.9231547

[2]

Besta M., 2023, Graph of thoughts: Solving elaborate problems with large language models

[3]

Brown TB, 2020, ADV NEUR IN, V33

[4]

Choi J., 2002, Rarity Score: A New Metric to Evaluate the Uncommonness of Synthesized Images

[5]

Ding R., 2023, Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation

[6]

Ferreira L, 2014, IEEE CONF COMPU INTE

[7]

Gravina D, 2019, IEEE CONF COMPU INTE, DOI 10.1109/cig.2019.8848053

[8]

Kojima T, 2022, ADV NEUR IN

[9]

Lu Y, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P8086

[10]

Sclar M., 2024, 12 INT C LEARN REPR

← 1 2 →