AI-Generated Test Scripts forWeb E2E Testing with ChatGPT and Copilot: A Preliminary Study

被引：2

作者：

Leotta, Maurizio ^{[1
]}

Yousaf, Hafiz Zeeshan ^{[1
]}

Ricca, Filippo ^{[1
]}

Garcia, Boni ^{[2
]}

机构：

[1] Univ Genoa, DIBRIS, Genoa, Italy

[2] Univ Carlos III Madrid, Madrid, Spain

来源：

PROCEEDINGS OF 2024 28TH INTERNATION CONFERENCE ON EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING, EASE 2024 | 2024年

关键词：

E2E Testing; Test Automation; Selenium WebDriver; GitHub Copilot; ChatGPT; LLM; Empirical Study;

D O I：

10.1145/3661167.3661192

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Automated testing is vital for ensuring the reliability of web applications. This paper presents a preliminary study on leveraging artificial intelligence (AI) models, specifically ChatGPT and Github Copilot, to generate test scripts forweb end-to-end testing. Through experimentation, we evaluated the feasibility and effectiveness of AI language models in generating test scripts based on natural language descriptions of user interactions with web applications. Our preliminary results show that AI-based generation generally provides an advantage over fully manual test scripts development. Starting from test cases clearly defined in Gherkin, a reduction in development time is always observable. In some cases, this reduction is statistically significant (e.g., Manual vs. a particular use of ChatGPT). These results are valid provided that the tester has some skills in manual test script development and is therefore able to modify the code produced by the AI-generation tools. This study contributes to the exploration of AI-driven solutions in web test scripts generation and lays the foundation for future research in this domain.

引用

页码：339 / 344

页数：6

共 21 条

[1]

[Anonymous], 2000, Experimentation in Software Engineering: An Introduction (International Series in Software Engineering

[2]

[Anonymous], 2005, Effect sizes for research: A broad practical approach

[3] Towards the Generation of End-to-End Web Test Scripts from Requirements Specifications [J].

Clerissi, Diego ;

Leotta, Maurizio ;

Reggio, Gianna ;

Ricca, Filippo .

2017 IEEE 25TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS (REW), 2017, :343-350

[4] MULTIPLE COMPARISONS AMONG MEANS [J].

DUNN, OJ .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&

[5] Enhancing Web Applications Observability through Instrumented Automated Browsers [J].

Garcia, Boni ;

Ricca, Filippo ;

del Alamo, Jose M. ;

Leotta, Maurizio .

JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 203

[6]

Garcia Boni, 2022, Hands- On Selenium WebDriver with United States

[7] An initial investigation of ChatGPT unit test generation capability [J].

Guilherme, Vitor H. ;

Vincenzi, Auri M. R. .

PROCEEDINGS OF THE 8TH BRAZILIAN SYMPOSIUM ON SYSTEMATIC AND AUTOMATED SOFT-WARE TESTING, SAST 2023, 2023, :15-24

[8]

Inc. GitHub, 2024, GitHub Copilot

[9] An empirical study to compare three web test automation approaches: NLP-based, programmable, and capture&replay [J].

Leotta, Maurizio ;

Ricca, Filippo ;

Marchetto, Alessandro ;

Olianas, Dario .

JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (05)

[10] Invariant-Based Automatic Testing of Modern Web Applications [J].

Mesbah, Ali ;

van Deursen, Arie ;

Roest, Danny .

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2012, 38 (01) :35-53

← 1 2 3 →