Evaluating the Performance of Code Generation Models for Solving Parsons Problems With Small Prompt Variations

被引:40
作者
Reeves, Brent [1 ]
Sarsa, Sami [2 ]
Prather, James [1 ]
Denny, Paul [3 ]
Becker, Brett A. [4 ]
Hellas, Arto [2 ]
Kimmel, Bailey [1 ]
Powell, Garrett [1 ]
Leinonen, Juho [3 ]
机构
[1] Abilene Christian Univ, Abilene, TX 79699 USA
[2] Aalto Univ, Espoo, Finland
[3] Univ Auckland, Auckland, New Zealand
[4] Univ Coll Dublin, Dublin, Ireland
来源
PROCEEDINGS OF THE 2023 CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, ITICSE 2023, VOL 1 | 2023年
关键词
academic integrity; AI; artificial intelligence; ChatGPT; code generation; code writing; Codex; computer programming; Copilot; CS1; deep learning; generative AI; introductory programming; GitHub; GPT-3; large language models; machine learning; ML; neural networks; natural language processing; novice programming; OpenAI;
D O I
10.1145/3587102.3588805
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The recent emergence of code generation tools powered by large language models has attracted wide attention. Models such as OpenAI Codex can take natural language problem descriptions as input and generate highly accurate source code solutions, with potentially significant implications for computing education. Given the many complexities that students face when learning to write code, they may quickly become reliant on such tools without properly understanding the underlying concepts. One popular approach for scaffolding the code writing process is to use Parsons problems, which present solution lines of code in a scrambled order. These remove the complexities of low-level syntax, and allow students to focus on algorithmic and design-level problem solving. It is unclear how well code generation models can be applied to solve Parsons problems, given the mechanics of these models and prior evidence that they underperform when problems include specific restrictions. In this paper, we explore the performance of the Codex model for solving Parsons problems over various prompt variations. Using a corpus of Parsons problems we sourced from the computing education literature, we find that Codex successfully reorders the problem blocks about half of the time, a much lower rate of success when compared to prior work on more free-form programming tasks. Regarding prompts, we find that small variations in prompting have a noticeable effect on model performance, although the effect is not as pronounced as between different problems.
引用
收藏
页码:299 / 305
页数:7
相关论文
共 43 条
[1]   Programming Is Hard - Or at Least It Used to Be: Educational Opportunities and Challenges of AI Code Generation [J].
Becker, Brett A. ;
Denny, Paul ;
Finnie-Ansley, James ;
Luxton-Reilly, Andrew ;
Prather, James ;
Santos, Eddie Antonio .
PROCEEDINGS OF THE 54TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, VOL 1, SIGCSE 2023, 2023, :500-506
[2]   The Code Mangler: Evaluating Coding Ability Without Writing any Code [J].
Cheng, Nick ;
Harrington, Brian .
PROCEEDINGS OF THE 2017 ACM SIGCSE TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION (SIGCSE'17), 2017, :123-128
[3]  
Denny P., 2022, arXiv
[4]  
Denny P., 2008, P 4 INT WORKSHOP COM, P113
[5]  
Denny P, 2022, Arxiv, DOI arXiv:2210.15157
[6]   A Review of Research on Parsons Problems [J].
Du, Yuemeng ;
Luxton-Reilly, Andrew ;
Denny, Paul .
PROCEEDINGS OF THE TWENTY-SECOND AUSTRALASIAN COMPUTING EDUCATION CONFERENCE, ACE'20, 2020, :195-202
[7]  
Ericson Barbara J., 2022, ITiCSE-WGR '22: Proceedings of the 2022 Working Group Reports on Innovation and Technology in Computer Science Education, P191, DOI 10.1145/3571785.3574127
[8]  
Ericson B. J., 2015, ACM C INT COMP ED RE, P169, DOI [DOI 10.1145/2787622.2787731, 10.1145/2787622.2787731]
[9]   Adaptive Parsons Problems as Active Learning Activities During Lecture [J].
Ericson, Barbara ;
Haynes-Magyar, Carl .
PROCEEDINGS OF THE 27TH ACM CONFERENCE ON INNOVATION AND TECHNOLOGY IN COMPUTER SCIENCE EDUCATION, ITICSE 2022, VOL 1, 2022, :290-296
[10]   Evaluating the Efficiency and Effectiveness of Adaptive Parsons Problems [J].
Ericson, Barbara J. ;
Foley, James D. ;
Rick, Jochen .
ICER'18: PROCEEDINGS OF THE 2018 ACM CONFERENCE ON INTERNATIONAL COMPUTING EDUCATION RESEARCH, 2018, :60-68