Neural Automated Essay Scoring for Improved Confidence Estimation and Score Prediction Through Integrated Classification and Regression

被引：0

作者：

Uto, Masaki ^{[1
]}

Takahashi, Yuto ^{[1
]}

机构：

[1] Univ Electrocommun, Tokyo, Japan

来源：

ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, PT I | 2024年 / 2150卷

关键词：

Automated essay scoring; reliability; confidence estimation; human-in-the-loop; educational measurement;

D O I：

10.1007/978-3-031-64315-6_42

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Essay writing questions are a type of constructed-response question commonly used in educational assessments. However, substantial scoring costs and reduced evaluation reliability due to rater biases can be problematic, especially in large-scale assessments. To overcome these challenges, automated essay-scoring models utilizing machine learning technologies have gained significant attention. In recent years, scoring models based on deep neural networks have achieved high accuracy. However, even highly accurate neural models still suffer from scoring errors, limiting their adoption in high-stakes assessments. To address this issue, recent studies have explored scoring models that provide confidence levels along with score predictions. This study proposes improvements to the latest confidence-predicting scoring model, enhancing its performance in both confidence estimation and score prediction.

引用

页码：444 / 451

页数：8

共 13 条

[1] Abosalem Y., 2016, International Journal of Secondary Education, V4, P1, DOI DOI 10.11648/J.IJSEDU.20160401.11
[2] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[3] Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture
Eigen, David
Fergus, Rob
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2650 - 2658
[4] Balancing Cost and Quality: An Exploration of Human-in-the-Loop Frameworks for Automated Short Answer Scoring
Funayama, Hiroaki
Sato, Tasuku
Matsubayashi, Yuichiroh
Mizumoto, Tomoya
Suzuki, Jun
Inui, Kentaro
[J]. ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, 2022, 13355 : 465 - 476
[5] Jiang Heinrich, 2018, Advances in neural information processing systems, V31
[6] Ke ZX, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P6300
[7] Learning Multiple Pixelwise Tasks Based on Loss Scale Balancing
Lee, Jae-Han
Lee, Chul
Kim, Chang-Su
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5087 - 5096
[8] Enhanced hybrid neural network for automated essay scoring
Li, Xia
Yang, Huali
Hu, Shengze
Geng, Jing
Lin, Keke
Li, Yuhai
[J]. EXPERT SYSTEMS, 2022, 39 (10)
[9] Nadeem F, 2019, INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, P484
[10] Uto Masaki, 2021, Behaviormetrika, V48, P459, DOI [10.1007/s41237-021-00142-y, DOI 10.1007/S41237-021-00142-Y]

← 1 2 →