Feature Enhanced Capsule Networks for Robust Automatic Essay Scoring

被引:8
作者
Sharma, Arushi [1 ,3 ]
Kabra, Anubha [2 ,3 ]
Kapoor, Rajiv [3 ]
机构
[1] Optum Global Advantage, Delhi, India
[2] Adobe Syst, Noida, India
[3] Delhi Technol Univ, Delhi, India
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT V | 2021年 / 12979卷
关键词
Automatic scoring; Capsule Neural Networks; Adversarial testing; BERT; Machine learning;
D O I
10.1007/978-3-030-86517-7_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Essay Scoring (AES) Engines have gained popularity amongst a multitude of institutions for scoring test-taker's responses and therefore witnessed rising demand in recent times. However, several studies have demonstrated that the adversarial attacks severely hamper existing state-of-the-art AES Engines' performance. As a result, we propose a robust architecture for AES systems that leverages Capsule Neural Networks, contextual BERT-based text representation, and key textually extracted features. This end-to-end pipeline captures semantics, coherence, and organizational structure along with fundamental rule-based features such as grammatical and spelling errors. The proposed method is validated by extensive experimentation and comparison with the state-of-the-art baseline models. Our results demonstrate that this approach performs significantly better on 6 out of 8 prompts on the Automated Student Assessment Prize (ASAP) dataset. In addition, it shows an overall best performance with a Quadratic Weighted Kappa (QWK) metric of 81%. Moreover, we empirically demonstrate that it is successful in identifying adversarial responses and scoring them lower.
引用
收藏
页码:365 / 380
页数:16
相关论文
共 32 条
[1]  
Alikaniotis D, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P715
[2]  
Attali Y., 2006, J Technol Learn Assess, V4, DOI 10.1002/j.2333-8504.2004.tb01972.x
[3]  
Attali Y., AUTOMATED ESSAY SCOR
[4]  
Chen H., 2013, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, P1741
[5]   Automated Essay Scoring by Capturing Relative Writing Quality [J].
Chen, Hongbo ;
Xu, Jungang ;
He, Ben .
COMPUTER JOURNAL, 2014, 57 (09) :1318-1330
[6]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[7]  
Cozma M, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P503
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]  
Ding Yuning., 2020, Proceedings of the 28th international conference on computational linguistics, P882
[10]  
Farag Y., 2018, ARXIV180406898