Feature Enhanced Capsule Networks for Robust Automatic Essay Scoring

被引：8

作者：

Sharma, Arushi ^{[1
,3
]}

Kabra, Anubha ^{[2
,3
]}

Kapoor, Rajiv ^{[3
]}

机构：

[1] Optum Global Advantage, Delhi, India

[2] Adobe Syst, Noida, India

[3] Delhi Technol Univ, Delhi, India

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT V | 2021年 / 12979卷

关键词：

Automatic scoring; Capsule Neural Networks; Adversarial testing; BERT; Machine learning;

D O I：

10.1007/978-3-030-86517-7_23

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic Essay Scoring (AES) Engines have gained popularity amongst a multitude of institutions for scoring test-taker's responses and therefore witnessed rising demand in recent times. However, several studies have demonstrated that the adversarial attacks severely hamper existing state-of-the-art AES Engines' performance. As a result, we propose a robust architecture for AES systems that leverages Capsule Neural Networks, contextual BERT-based text representation, and key textually extracted features. This end-to-end pipeline captures semantics, coherence, and organizational structure along with fundamental rule-based features such as grammatical and spelling errors. The proposed method is validated by extensive experimentation and comparison with the state-of-the-art baseline models. Our results demonstrate that this approach performs significantly better on 6 out of 8 prompts on the Automated Student Assessment Prize (ASAP) dataset. In addition, it shows an overall best performance with a Quadratic Weighted Kappa (QWK) metric of 81%. Moreover, we empirically demonstrate that it is successful in identifying adversarial responses and scoring them lower.

引用

页码：365 / 380

页数：16

共 32 条

[1]

Alikaniotis D, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P715

[2]

Attali Y., 2006, J Technol Learn Assess, V4, DOI 10.1002/j.2333-8504.2004.tb01972.x

[3]

Attali Y., AUTOMATED ESSAY SCOR

[4]

Chen H., 2013, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, P1741

[5] Automated Essay Scoring by Capturing Relative Writing Quality [J].

Chen, Hongbo ;

Xu, Jungang ;

He, Ben .

COMPUTER JOURNAL, 2014, 57 (09) :1318-1330

[6] XGBoost: A Scalable Tree Boosting System [J].

Chen, Tianqi ;

Guestrin, Carlos .

KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794

[7]

Cozma M, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P503

[8]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[9]

Ding Yuning., 2020, Proceedings of the 28th international conference on computational linguistics, P882

[10]

Farag Y., 2018, ARXIV180406898

← 1 2 3 4 →