Automatic Exam Correction Framework (AECF) for the MCQs, Essays, and Equations Matching

被引:21
作者
Balaha, Hossam Magdy [1 ]
Saafan, Mahmoud M. [1 ]
机构
[1] Mansoura Univ, Fac Engn, Comp & Syst Engn Dept, Mansoura 35516, Egypt
关键词
Semantics; Mathematical model; Bit error rate; Feature extraction; Training; Task analysis; Tokenization; Automatic exam correction; document embedding; expression trees; MCQ matching; word embedding; WORD2VEC; PREFIX;
D O I
10.1109/ACCESS.2021.3060940
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic grading requires the adaption of the latest technologies. It has become essential especially when most of the courses became online courses (MOOCs). The objectives of the current work are (1) Reviewing the literature on the text semantic similarity and automatic exam correction systems, (2) Proposing an automatic exam correction framework (HMB-AECF) for MCQs, essays, and equations that is abstracted into five layers, (3) Suggesting equations similarity checker algorithm named "HMB-MMS-EMA", (4) Presenting an expression matching dataset named "HMB-EMD-v1", (5) Comparing the different approaches to convert textual data into numerical data (Word2Vec, FastText, Glove, and Universal Sentence Encoder (USE)) using three well-known Python packages (Gensim, SpaCy, and NLTK), and (6) Comparing the proposed equations similarity checker algorithm (HMB-MMS-EMA) with a Python package (SymPy) on the proposed dataset (HMB-EMD-v1). Eight experiments were performed on the Quora Questions Pairs and the UNT Computer Science Short Answer datasets. The best-achieved highest accuracy in the first four experiments was 77.95% without fine-tuning the pre-trained models by the USE. The best-achieved lowest root mean square error (RMSE) in the second four experiments was 1.09 without fine-tuning the used pre-trained models by the USE. The proposed equations similarity checker algorithm (HMB-MMS-EMA) reported 100% accuracy over the SymPy Python package which reported 71.33% only on "HMB-EMD-v1".
引用
收藏
页码:32368 / 32389
页数:22
相关论文
共 61 条
[1]  
Al-Shammari E. T, 2013, U.S. Patent, Patent No. [8 473 279, 8473279]
[2]  
[Anonymous], 2006, AAAI
[3]  
[Anonymous], 2017, ESPACY IND STRENGTH
[4]  
Araki K., 2020, ARXIV201012077
[5]   A semantic similarity-based perspective of affect lexicons for sentiment analysis [J].
Araque, Oscar ;
Zhu, Ganggao ;
Iglesias, Carlos A. .
KNOWLEDGE-BASED SYSTEMS, 2019, 165 :346-359
[6]   Expression-tree-based algorithms for code compression on embedded RISC architectures [J].
Araujo, G ;
Centoducatte, P ;
Azevedo, R ;
Pannain, R .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2000, 8 (05) :530-533
[7]   Improving the accuracy using pre-trained word embeddings on deep neural networks for Turkish text classification [J].
Aydogan, Murat ;
Karci, Ali .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 541
[8]   Comparison of term frequency and document frequency based feature selection metrics in text categorization [J].
Azam, Nouman ;
Yao, JingTao .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) :4760-4768
[9]   An efficient recommendation generation using relevant Jaccard similarity [J].
Bag, Sujoy ;
Kumar, Sri Krishna ;
Tiwari, Manoj Kumar .
INFORMATION SCIENCES, 2019, 483 :53-64
[10]  
Balakrishnan V., 2014, P SCEI SEOUL C SEOUL