Neural Automated Essay Scoring for Improved Confidence Estimation and Score Prediction Through Integrated Classification and Regression

被引:0
作者
Uto, Masaki [1 ]
Takahashi, Yuto [1 ]
机构
[1] Univ Electrocommun, Tokyo, Japan
来源
ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, PT I | 2024年 / 2150卷
关键词
Automated essay scoring; reliability; confidence estimation; human-in-the-loop; educational measurement;
D O I
10.1007/978-3-031-64315-6_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Essay writing questions are a type of constructed-response question commonly used in educational assessments. However, substantial scoring costs and reduced evaluation reliability due to rater biases can be problematic, especially in large-scale assessments. To overcome these challenges, automated essay-scoring models utilizing machine learning technologies have gained significant attention. In recent years, scoring models based on deep neural networks have achieved high accuracy. However, even highly accurate neural models still suffer from scoring errors, limiting their adoption in high-stakes assessments. To address this issue, recent studies have explored scoring models that provide confidence levels along with score predictions. This study proposes improvements to the latest confidence-predicting scoring model, enhancing its performance in both confidence estimation and score prediction.
引用
收藏
页码:444 / 451
页数:8
相关论文
共 13 条
  • [1] Abosalem Y., 2016, International Journal of Secondary Education, V4, P1, DOI DOI 10.11648/J.IJSEDU.20160401.11
  • [2] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [3] Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture
    Eigen, David
    Fergus, Rob
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2650 - 2658
  • [4] Balancing Cost and Quality: An Exploration of Human-in-the-Loop Frameworks for Automated Short Answer Scoring
    Funayama, Hiroaki
    Sato, Tasuku
    Matsubayashi, Yuichiroh
    Mizumoto, Tomoya
    Suzuki, Jun
    Inui, Kentaro
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, 2022, 13355 : 465 - 476
  • [5] Jiang Heinrich, 2018, Advances in neural information processing systems, V31
  • [6] Ke ZX, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P6300
  • [7] Learning Multiple Pixelwise Tasks Based on Loss Scale Balancing
    Lee, Jae-Han
    Lee, Chul
    Kim, Chang-Su
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5087 - 5096
  • [8] Enhanced hybrid neural network for automated essay scoring
    Li, Xia
    Yang, Huali
    Hu, Shengze
    Geng, Jing
    Lin, Keke
    Li, Yuhai
    [J]. EXPERT SYSTEMS, 2022, 39 (10)
  • [9] Nadeem F, 2019, INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, P484
  • [10] Uto Masaki, 2021, Behaviormetrika, V48, P459, DOI [10.1007/s41237-021-00142-y, DOI 10.1007/S41237-021-00142-Y]