Automatic Essay Grading for Bahasa Indonesia with Support Vector Machine and Latent Semantic Analysis

被引:0
作者
Ratna, Anak Agung Putri [1 ]
Khairunissa, Hanifah [1 ]
Kaltsum, Aaliyah [1 ]
Ibrahim, Ihsan [1 ]
Purnamasari, Prima Dewi [1 ]
机构
[1] Univ Indonesia, Fac Engn, Dept Elect Engn, Depok, Indonesia
来源
2019 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICECOS 2019) | 2019年
关键词
essay grading; e-learning; Japanese language; latent semantic analysis; term frequency-inverse document frequency; support vector machine;
D O I
10.1109/icecos47637.2019.8984528
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This research is used to increase the accuracy of automatic short essay grading for Bahasa Indonesia. Short essay in Bahasa Indonesia are classified using Support Vector Machine (SVM) based on its topic to decrease unrelated answer then assessed using Latent Semantic Analysis. Term Frequency-Inverse Document Frequency (TF-IDF) is used to weigh word in short essay and the result will be an input on SVM. The output of SVM are assessed using Latent Semantic Analysis (LSA). Latent Semantic Analysis uses Term Frequency Matrix to represent text in matrix, Singular Value Decomposition to decompose these matrix, and Frobenius Norm to find the similarity of lectures' answer and students' answer In this research, parameter C value as 1 and kernel linear are used to obtain the highest accuracy of classification using Support Vector Machine, 97,297% with 50% portion of data as training and 50% portion of data as testing. The accuracy score obtained from LSA is 72,01%.
引用
收藏
页码:363 / 367
页数:5
相关论文
共 50 条
[21]   Automatic document classification based on latent semantic analysis [J].
I. Kuralenok ;
I. Nekrest'yanov .
Programming and Computer Software, 2000, 26 :199-206
[22]   Automatic text summarization using latent semantic analysis [J].
I. V. Mashechkin ;
M. I. Petrovskiy ;
D. S. Popov ;
D. V. Tsarev .
Programming and Computer Software, 2011, 37 :299-305
[23]   Answer Categorization Method Using K-Means for Indonesian Language Automatic Short Answer Grading System Based on Latent Semantic Analysis [J].
Ratna, Anak Agung Putri ;
Wulandari, Naiza Astri ;
Kaltsum, Aaliyah ;
Ibrahim, Ihsan ;
Purnamasari, Prima Dewi .
2019 16TH INTERNATIONAL CONFERENCE ON QUALITY IN RESEARCH (QIR) / INTERNATIONAL SYMPOSIUM ON ELECTRICAL AND COMPUTER ENGINEERING, 2019, :110-114
[24]   Automatic Text Summarization Using Latent Semantic Analysis [J].
Mashechkin, I. V. ;
Petrovskiy, M. I. ;
Popov, D. S. ;
Tsarev, D. V. .
PROGRAMMING AND COMPUTER SOFTWARE, 2011, 37 (06) :299-305
[25]   Comparison of Latent Semantic Analysis and Vector Space Model for Automatic Identification of Competent Reviewers to Evaluate Papers [J].
Kalmukov, Yordan .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (02) :77-85
[26]   Automatic document classification based on latent semantic analysis [J].
Kuralenok, I ;
Nekrest'yanov, I .
PROGRAMMING AND COMPUTER SOFTWARE, 2000, 26 (04) :199-206
[27]   Automatic signal detection based on support vector machine [J].
王海军 ;
刘贵忠 .
Acta Seismologica Sinica(English Edition), 2007, (01) :88-97
[28]   AUTOMATIC TEXT SUMMARIZATION USING SUPPORT VECTOR MACHINE [J].
Begum, Nadira ;
Fattah, Mohamed Abdel ;
Ren, Fuji .
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (07) :1987-1996
[29]   Automatic signal detection based on support vector machine [J].
Wang, Hai-jun ;
Liu, Gui-zhong .
EARTHQUAKE SCIENCE, 2007, 20 (01) :88-97
[30]   Research on automatic fingerprint classification based on support vector machine [J].
Guo, Lei ;
Wu, Youxi ;
Wu, Qing ;
Yan, Weili ;
Shen, Xueqin .
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, :4093-+