Content-Based Similarity for Automatic Scoring of Handwritten Descriptive Answers

被引:0
作者
Nghia Thanh Truong [1 ]
Hung Tuan Nguyen [1 ]
Ly, Nam Tuan [1 ]
Horie, Toshihiko [2 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Tokyo, Japan
[2] Wacom Co Ltd, Saitama, Japan
来源
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT II | 2024年 / 14805卷
关键词
handwriting recognition; automatic scoring; deep neural networks; ensemble recognition; TEXT;
D O I
10.1007/978-3-031-70536-6_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces content-based similarity for automatic scoring of handwritten descriptive answers, focusing on Japanese, English, and mathematical expressions. Our experiments were made on a collection of handwritten descriptive answers from elementary school students, encompassing 37,500 Japanese, 15,896 English, and 86,264 math answers. We used neural network-based online and offline handwriting recognizers for each answer and applied automatic scoring of recognized candidates with expected answers. In the initial experiment, we applied a perfect match with expected answers, revealing issues and challenges, especially with the rate of correct answers scored as wrong (false negatives) exceeding 30% in some subjects. Then, we propose a recognition confidence-based rejection scheme to reduce false positives. Moreover, we propose content-awareness similarity that calculates a similarity between the recognized candidates of an answer and the expected answers. According to the computed similarity, it scores the answers as correct, wrong, or rejected. Human scorers should score false negative answers that are likely claimed by students and rejected answers. The experiment suggests that human scorers need to score 14.39% after applying the automatic scoring method with the rate of incorrect answers scored correct of 3.03% for Japanese, the former as 10.75% and the latter as 1.79% for English, and the former as 27.34% and the latter as 0.45% for math. These promising results underscore the system's effectiveness.
引用
收藏
页码:268 / 281
页数:14
相关论文
共 26 条
[1]  
[Anonymous], 2007, P 1 INT WORKSH PEN B, DOI [10.5555/1338440, DOI 10.5555/1338440]
[2]   The Eras and Trends of Automatic Short Answer Grading [J].
Burrows, Steven ;
Gurevych, Iryna ;
Stein, Benno .
INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2015, 25 (01) :60-117
[3]   CNN based spatial classification features for clustering offline handwritten mathematical expressions [J].
Cuong Tuan Nguyen ;
Vu Tran Minh Khuong ;
Hung Tuan Nguyen ;
Nakagawa, Masaki .
PATTERN RECOGNITION LETTERS, 2020, 131 :113-120
[4]  
Nguyen CT, 2016, INT CONF FRONT HAND, P246, DOI [10.1109/ICFHR.2016.0055, 10.1109/ICFHR.2016.35]
[5]   The ASSISTments Ecosystem: Building a Platform that Brings Scientists and Teachers Together for Minimally Invasive Research on Human Learning and Teaching [J].
Heffernan, Neil T. ;
Heffernan, Cristina Lindquist .
INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2014, 24 (04) :470-497
[6]   Online Japanese Handwriting Recognizers using Recurrent Neural Networks [J].
Hung Tuan Nguyen ;
Cuong Tuan Nguyen ;
Nakagawa, Masaki .
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, :435-440
[7]  
Khuong V.T.M., A Synthetic Dataset for Clustering Handwritten Math Expression TUAT (Dset_Mix)
[8]  
Koile K., 2007, 1 INT WORKSH PEN BAS, P1, DOI [10.1109/PLT.2007.24, DOI 10.1109/PLT.2007.24]
[9]  
Koyama K., 2010, E-Learn 2010, P1073
[10]   IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard [J].
Liwicki, M ;
Bunke, H .
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, :956-961