Feature Enhanced Capsule Networks for Robust Automatic Essay Scoring

被引:8
作者
Sharma, Arushi [1 ,3 ]
Kabra, Anubha [2 ,3 ]
Kapoor, Rajiv [3 ]
机构
[1] Optum Global Advantage, Delhi, India
[2] Adobe Syst, Noida, India
[3] Delhi Technol Univ, Delhi, India
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT V | 2021年 / 12979卷
关键词
Automatic scoring; Capsule Neural Networks; Adversarial testing; BERT; Machine learning;
D O I
10.1007/978-3-030-86517-7_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Essay Scoring (AES) Engines have gained popularity amongst a multitude of institutions for scoring test-taker's responses and therefore witnessed rising demand in recent times. However, several studies have demonstrated that the adversarial attacks severely hamper existing state-of-the-art AES Engines' performance. As a result, we propose a robust architecture for AES systems that leverages Capsule Neural Networks, contextual BERT-based text representation, and key textually extracted features. This end-to-end pipeline captures semantics, coherence, and organizational structure along with fundamental rule-based features such as grammatical and spelling errors. The proposed method is validated by extensive experimentation and comparison with the state-of-the-art baseline models. Our results demonstrate that this approach performs significantly better on 6 out of 8 prompts on the Automated Student Assessment Prize (ASAP) dataset. In addition, it shows an overall best performance with a Quadratic Weighted Kappa (QWK) metric of 81%. Moreover, we empirically demonstrate that it is successful in identifying adversarial responses and scoring them lower.
引用
收藏
页码:365 / 380
页数:16
相关论文
共 32 条
[21]  
Sabour S, 2017, ADV NEUR IN, V30
[22]   BERT-Caps: A Transformer-Based Capsule Network for Tweet Act Classification [J].
Saha, Tulika ;
Ramesh Jayashree, Srivatsa ;
Saha, Sriparna ;
Bhattacharyya, Pushpak .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (05) :1168-1179
[23]  
Somasundaran S., 2014, P 25 INT C COMPUTATI, P950
[24]  
Taghipour K., 2016, A neural approach to automated essay scoring, DOI [DOI 10.18653/V1/D16-1193, 10.18653/v1/D16-1193]
[25]  
Tay Y, 2018, AAAI CONF ARTIF INTE, P5948
[26]  
Wang YC, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P791
[27]  
Yang Ruosong, 2020, P EMPIRICAL METHODS, P1560
[28]  
Yannakoudakis Helen, 2011, P 49 ANN M ASS COMPU, P180
[29]  
Zesch T., 2015, P 10 WORKSH INN US N, P224, DOI DOI 10.3115/V1/W15-0626
[30]   A Memory-Augmented Neural Model for Automated Grading [J].
Zhao, Siyuan ;
Zhang, Yaqiong ;
Xiong, Xiaolu ;
Botelho, Anthony ;
Heffernan, Neil .
PROCEEDINGS OF THE FOURTH (2017) ACM CONFERENCE ON LEARNING @ SCALE (L@S'17), 2017, :189-192