HiTSKT: A hierarchical transformer model for session-aware knowledge tracing

被引:8
作者
Ke, Fucai [1 ]
Wang, Weiqing [1 ]
Tan, Weicong [1 ]
Du, Lan [1 ]
Jin, Yuan [1 ]
Huang, Yujin [1 ]
Yin, Hongzhi [2 ]
机构
[1] Monash Univ, Fac Informat Technol, Melbourne, Vic 3800, Australia
[2] Univ Queensland, Fac Engn, Brisbane, Qld 4072, Australia
关键词
User behaviour modelling; Knowledge tracing; Educational data mining; Learner modelling; Hierarchical transformer; RETRIEVAL PRACTICE; MEMORY;
D O I
10.1016/j.knosys.2023.111300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge tracing (KT) aims to leverage students' learning histories to estimate their mastery levels on a set of pre-defined skills, based on which the corresponding future performance can be accurately predicted. In practice, a student's learning history comprises answers to sets of massed questions, each known as a session, rather than merely being a sequence of independent answers. Theoretically, within and across these sessions, students' learning dynamics can be very different. Therefore, how to effectively model the dynamics of students' knowledge states within and across the sessions is crucial for handling the KT problem. Most existing KT models treat student's learning records as a single continuing sequence, without capturing the sessional shift of students' knowledge state. To address the above issue, we propose a novel hierarchical transformer model, named HiTSKT, comprises an interaction(-level) encoder to capture the knowledge a student acquires within a session, and a session(-level) encoder to summarize acquired knowledge across the past sessions. To predict an interaction in the current session, a knowledge retriever integrates the summarized past-session knowledge with the previous interactions' information into proper knowledge representations. These representations are then used to compute the student's current knowledge state. Additionally, to model the student's long-term forgetting behaviour across the sessions, a power-law-decay attention mechanism is designed and deployed in the session encoder, allowing it to emphasize more on the recent sessions. Extensive experiments on four public datasets demonstrate that HiTSKT achieves new state-of-the-art performance on all the datasets compared with seven state-of-the-art KT models.
引用
收藏
页数:13
相关论文
共 56 条
  • [1] Deep Graph Memory Networks for Forgetting-Robust Knowledge Tracing
    Abdelrahman, Ghodai
    Wang, Qing
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 7844 - 7855
  • [2] Knowledge Tracing with Sequential Key-Value Memory Networks
    Abdelrahman, Ghodai
    Wang, Qing
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 175 - 184
  • [3] Benefits from retrieval practice are greater for students with lower working memory capacity
    Agarwal, Pooja K.
    Finley, Jason R.
    Rose, Nathan S.
    Roediger, Henry L., III
    [J]. MEMORY, 2017, 25 (06) : 764 - 771
  • [4] [Anonymous], 2014, The Science of Successful Learning
  • [5] Atkinson R. C., 1968, Psychology of learning and motivation, V2, P89, DOI [10.1016/S0079-7421(08)60422-3, 10.1016/S0079-7421, DOI 10.1016/S0079-7421, DOI 10.1016/S0079-7421(08)60422-3]
  • [6] The form of the forgetting curve and the fate of memories
    Averell, Lee
    Heathcote, Andrew
    [J]. JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2011, 55 (01) : 25 - 35
  • [7] Working Memory: Theories, Models, and Controversies
    Baddeley, Alan
    [J]. ANNUAL REVIEW OF PSYCHOLOGY, VOL 63, 2012, 63 : 1 - 29
  • [8] Baker RSJD, 2008, LECT NOTES COMPUT SC, V5091, P406
  • [9] Bjork R. A., 1988, Practical aspects of memory II, P396
  • [10] Knowledge Tracing Model with Learning and Forgetting Behavior
    Chen, Mingzhi
    Guan, Quanlong
    He, Yizhou
    He, Zhenyu
    Fang, Liangda
    Luo, Weiqi
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3863 - 3867