Once Upon a GPT-4: Enhancing Diversity in Automated Reading Comprehension Story Generation with Classic Tales

被引：0

作者：

Shankarnarayanan, Aadhith ^{[1
]}

Syed, Taufiq ^{[1
]}

Shapsough, Salsabeel ^{[1
]}

Zualkernan, Imran ^{[1
]}

机构：

[1] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, ICALT 2024 | 2024年

关键词：

Literacy; Early Grade Reading Assessment; GPT-4; Comprehension; Story Generation;

D O I：

10.1109/ICALT61570.2024.00063

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Generating content for large-scale reading comprehension assessments, particularly stories, poses significant challenges including money, time, and effort. To serve as effective assessment tools, these comprehension stories must adhere to specific criteria defined in early grade reading assessment standards dictating narrative structure, character types, readability levels, and other elements. Recently Natural Language Processing (NLP) techniques, mainly Large Language Models (LI:Ms), have been used to automate the story generation process. One key challenge is ensuring diversity across the many generated stories. For example, the Early reading standards such as Early Grade Reading Assessment (EGRA) requires comprehension stories to he similar in difficulty level but narratively different across multiple implementations to ensure fairness. This paper proposes a solution to increase diversity across stories by employing GPT-4 to generate children's reading comprehension stories mediated by a database of classic tales. By leveraging existing narratives, this method drastically reduces the resources required for content creation while ensuring alignment with the EGRA criteria. We present a systematic framework for selecting, adapting, and evaluating the stories, aiming to streamline the content generation process. The generated stories are additionally evaluated using well-defined text metrics and by a human evaluator for reliability. Our findings underscore the potential of integrating GPT-4 with classic tales to optimize resources and enhance scalability in literacy assessment practices, offering practical implications for educators and policymakers in early grade learning.

引用

页码：196 / 200

页数：5

共 27 条

[1] [Anonymous], 2024, UCTS TAIRYIALEQADATA
[2] The interactive reading task: Transformer-based automatic item generation
Attali, Yigal
Runge, Andrew
LaFlair, Geoffrey T.
Yancey, Kevin
Goodwin, Sarah
Park, Yena
von Davier, Alina A.
[J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
[3] Readability assessment of online peripheral artery disease education materials
Avra, Tucker D.
Le, Monica
Hernandez, Stephanie
Thure, Katie
Ulloa, Jesus G.
[J]. JOURNAL OF VASCULAR SURGERY, 2022, 76 (06) : 1728 - 1732
[4] Bezirhan U., 2023, Computers and Education: Artificial Intelligence, V5, DOI [10.1016/j.caeai.2023.100161, DOI 10.1016/J.CAEAI.2023.100161]
[5] Camilli G., 2010, TEACHERCOLL RECORD, V112
[6] A Survey on Evaluation of Large Language Models
Chang, Yupeng
Wang, Xu
Wang, Jindong
Wu, Yuan
Yang, Linyi
Zhu, Kaijie
Chen, Hao
Yi, Xiaoyuan
Wang, Cunxiang
Wang, Yidong
Ye, Wei
Zhang, Yue
Chang, Yi
Yu, Philip S.
Yang, Qiang
Xie, Xing
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (03)
[7] The early grade reading assessment (EGRA): Its theoretical foundation, purpose, and limitations
Dubeck, Margaret M.
Gove, Amber
[J]. INTERNATIONAL JOURNAL OF EDUCATIONAL DEVELOPMENT, 2015, 40 : 315 - 322
[8] Enhancing Software Comments Readability Using Flesch Reading Ease Score
Eleyan, Derar
Othman, Abed
Eleyan, Amna
[J]. INFORMATION, 2020, 11 (09)
[9] Franceschelli G, 2024, J ARTIF INTELL RES, V79, P417
[10] Readability and Quality of Online Information on Sickle Cell Retinopathy for Patients
Gbedemah, Zulfiya Emefa Edugle
Fuseini, Mohammed-Sherrif Napari
Fordjuor, Sam Kwaku Esson Jonah
Baisie-Nkrumah, Eugene Jojo
Beecham, Rya-Marie Esi Mensima
Amissah-Arthur, Kwesi Nyan
[J]. AMERICAN JOURNAL OF OPHTHALMOLOGY, 2024, 259 : 45 - 52

← 1 2 3 →