Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment

被引:20
|
作者
Zhou, Todd [1 ]
Jiao, Hong [2 ]
机构
[1] Winston Churchill High Sch, Potomac, MD USA
[2] Univ Maryland, College Pk, MD 20742 USA
关键词
cheating detection; stacking; machine learning; ensemble learning algorithms; response time; resampling; oversampling; SMOTE; under-sampling; dual resampling; data augmentation; ITEM PREKNOWLEDGE; MODEL;
D O I
10.1177/00131644221117193
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Cheating detection in large-scale assessment received considerable attention in the extant literature. However, none of the previous studies in this line of research investigated the stacking ensemble machine learning algorithm for cheating detection. Furthermore, no study addressed the issue of class imbalance using resampling. This study explored the application of the stacking ensemble machine learning algorithm to analyze the item response, response time, and augmented data of test-takers to detect cheating behaviors. The performance of the stacking method was compared with that of two other ensemble methods (bagging and boosting) as well as six base non-ensemble machine learning algorithms. Issues related to class imbalance and input features were addressed. The study results indicated that stacking, resampling, and feature sets including augmented summary data generally performed better than its counterparts in cheating detection. Compared with other competing machine learning algorithms investigated in this study, the meta-model from stacking using discriminant analysis based on the top two base models-Gradient Boosting and Random Forest-generally performed the best when item responses and the augmented summary statistics were used as the input features with an under-sampling ratio of 10:1 among all the study conditions.
引用
收藏
页码:831 / 854
页数:24
相关论文
共 50 条
  • [1] StackBRAF: A Large-Scale Stacking Ensemble Learning for BRAF Affinity Prediction
    Syahid, Nur Fadhilah
    Weerapreeyakul, Natthida
    Srisongkram, Tarapong
    ACS OMEGA, 2023, 8 (23): : 20881 - 20891
  • [2] An Ensemble Learning Platform for the Large-Scale Exploration of New Double Perovskites
    Wang, Zhilong
    Han, Yanqiang
    Lin, Xirong
    Cai, Junfei
    Wu, Sicheng
    Li, Jinjin
    ACS APPLIED MATERIALS & INTERFACES, 2022, 14 (01) : 717 - 725
  • [3] A Universal Machine Learning Algorithm for Large-Scale Screening of Materials
    Fanourgakis, George S.
    Gkagkas, Konstantinos
    Tylianakis, Emmanuel
    Froudakis, George E.
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2020, 142 (08) : 3814 - 3822
  • [4] An ensemble bat algorithm for large-scale optimization
    Cai, Xingjuan
    Zhang, Jiangjiang
    Liang, Hao
    Wang, Lei
    Wu, Qidi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (11) : 3099 - 3113
  • [5] An ensemble bat algorithm for large-scale optimization
    Xingjuan Cai
    Jiangjiang Zhang
    Hao Liang
    Lei Wang
    Qidi Wu
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 3099 - 3113
  • [6] Ensemble Learning for Large-Scale Workload Prediction
    Singh, Nidhi
    Rao, Shrisha
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (02) : 149 - 165
  • [7] DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps
    Bertucci D.
    Hamid M.M.
    Anand Y.
    Ruangrotsakun A.
    Tabatabai D.
    Perez M.
    Kahng M.
    IEEE Transactions on Visualization and Computer Graphics, 2023, 29 (01) : 320 - 330
  • [8] A Survey on Large-Scale Machine Learning
    Wang, Meng
    Fu, Weijie
    He, Xiangnan
    Hao, Shijie
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
  • [9] Intelligent Assessment for Visual Quality of Streets: Exploration Based on Machine Learning and Large-Scale Street View Data
    Zhao, Jing
    Guo, Qi
    SUSTAINABILITY, 2022, 14 (13)
  • [10] Stacking Ensemble Machine Learning Algorithm with an Application to Heart Disease Prediction
    Fatima, Ruhi
    Kazi, Sabeena
    Tassaddiq, Asifa
    Farhat, Nilofer
    Naaz, Humera
    Jabeen, Sumera
    CONTEMPORARY MATHEMATICS, 2023, 4 (04): : 905 - 925