LLM-CDM: A Large Language Model Enhanced Cognitive Diagnosis for Intelligent Education

被引:0
作者
Chen, Xin [1 ]
Zhang, Jin [1 ]
Zhou, Tong [2 ]
Zhang, Feng [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Civil Engn & Architecture, Qingdao 266590, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Education; Large language models; Annotations; Accuracy; Semantics; Prompt engineering; Printers; Optimization; Manuals; Long short term memory; Cognitive diagnosis; large language models; exercise texts; higher education and intelligent education;
D O I
10.1109/ACCESS.2025.3549309
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cognitive diagnosis is a key component of intelligent education to assess students' comprehension of specific knowledge concepts. Current methodologies predominantly rely on students' historical performance records and manually annotated knowledge concepts for analysis. However, the extensive semantic information embedded in exercises, including latent knowledge concepts, has not been fully utilized. This paper presents a novel cognitive diagnosis model based on the LLAMA3-70B framework (referred to as LLM-CDM), which integrates prompt engineering with the rich semantic information inherent in exercise texts to uncover latent knowledge concepts and improve diagnostic accuracy. Specifically, this study first inputs exercise texts into a large language model and develops an innovative prompting method to facilitate deep mining of implicit knowledge concepts within these texts by the model. Following the integration of these newly extracted knowledge concepts into the existing Q matrix, this paper employs a neural network to diagnose students' understanding of knowledge concepts while applying the monotonicity assumption to ensure the interpretability of model factors. Experimental results from an examination data set for course completion assessments demonstrate that LLM-CDM exhibits superior performance in both accuracy and explainability.
引用
收藏
页码:47165 / 47180
页数:16
相关论文
共 52 条
[1]  
AI@Meta, 2024, Llama 3 model card
[2]  
Al Faraby Said, 2024, Comput. Educ. Artif. Intell, V7, DOI [10.1016/j.caeai.2024.100298, DOI 10.1016/J.CAEAI.2024.100298]
[3]   The emergent role of artificial intelligence, natural learning processing, and large language models in higher education and research [J].
Alqahtani, Tariq ;
Badreldin, Hisham A. ;
Alrashed, Mohammed ;
Alshaya, Abdulrahman I. ;
Alghamdi, Sahar S. ;
bin Saleh, Khalid ;
Alowais, Shuroug A. ;
Alshaya, Omar A. ;
Rahman, Ishrat ;
Al Yami, Majed S. ;
Albekairy, Abdulkareem M. .
RESEARCH IN SOCIAL & ADMINISTRATIVE PHARMACY, 2023, 19 (08) :1236-1242
[4]  
Bahrini A, 2023, Arxiv, DOI [arXiv:2304.09103, 10.48550/arXiv.2304.09103, DOI 10.48550/ARXIV.2304.09103, 10.48550/arxiv.2304.09103]
[5]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[6]   DIRT: Deep Learning Enhanced Item Response Theory for Cognitive Diagnosis [J].
Cheng, Song ;
Liu, Qi ;
Chen, Enhong ;
Huang, Zai ;
Huang, Zhenya ;
Chen, Yuying ;
Ma, Haiping ;
Hu, Guoping .
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, :2397-2400
[7]   Finance-specific large language models: Advancing sentiment analysis and return prediction with LLaMA 2 [J].
Chiu, I. -Chan ;
Hung, Mao-Wei .
PACIFIC-BASIN FINANCE JOURNAL, 2025, 90
[8]   miRTarBase 2025: updates to the collection of experimentally validated microRNA-target interactions [J].
Cui, Shidong ;
Yu, Sicong ;
Huang, Hsi-Yuan ;
Lin, Yang-Chi-Dung ;
Huang, Yixian ;
Zhang, Bojian ;
Xiao, Jihan ;
Zuo, Huali ;
Wang, Jiayi ;
Li, Zhuoran ;
Li, Guanghao ;
Ma, Jiajun ;
Chen, Baiming ;
Zhang, Haoxuan ;
Fu, Jiehui ;
Wang, Liang ;
Huang, Hsien-Da .
NUCLEIC ACIDS RESEARCH, 2024, 53 (D1) :D147-D156
[9]   DINA Model and Parameter Estimation: A Didactic [J].
de la Torre, Jimmy .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2009, 34 (01) :115-130
[10]  
DiBello LV, 2007, HANDB STAT, V26, P979, DOI 10.1016/S0169-7161(06)26031-0