Reducing Annotation Effort in Automatic Essay Evaluation Using Locality Sensitive Hashing

被引:1
|
作者
Tashu, Tsegaye Misikir [1 ,2 ]
Szabo, David [1 ]
Horvath, Tomas [1 ,3 ]
机构
[1] Eotvos Lorand Univ, Telekom Innovat Labs, Dept Data Sci & Engn, Fac Informat, H-1117 Budapest, Hungary
[2] Eotvos Lorand Univ, Fac Informat, 3in Res Grp, Martonvasar, Hungary
[3] Pavol Jozef Safarik Univ, Inst Comp Sci, Fac Sci, Jesenna 5, Kosice 04001, Slovakia
来源
INTELLIGENT TUTORING SYSTEMS (ITS 2019) | 2019年 / 11528卷
关键词
Locality Sensitive Hashing; Automatic essay scoring; Similarity search;
D O I
10.1007/978-3-030-22244-4_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated essay evaluation systems use machine learning models to predict the score for an essay. For such, a training essay set is required which is usually created by human requiring time-consuming effort. Popular choice for scoring is a nearest neighbor model which requires on-line computation of nearest neighbors to a given essay. This is, however, a time-consuming task. In this work, we propose to use locality sensitive hashing that helps to select a small subset of a large set of essays such that it will likely contain the nearest neighbors for a given essay. We provided experiments on real-world data sets provided by Kaggle. According to the experimental results, it is possible to achieve good performance on scoring by using the proposed approach. The proposed approach is efficient with regard to time complexity. Also, it works well in case of a small number of training essays labeled by human and gives comparable results to the case when a large essay sets are used.
引用
收藏
页码:186 / 192
页数:7
相关论文
共 42 条
  • [21] Fast agglomerative hierarchical clustering algorithm using Locality-Sensitive Hashing
    Koga, Hisashi
    Ishibashi, Tetsuo
    Watanabe, Toshinori
    KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (01) : 25 - 53
  • [22] Using Locality-Sensitive Hashing for SVM Classification of Large Data Sets
    Gonzalez-Lima, Maria D.
    Ludena, Carenne C.
    MATHEMATICS, 2022, 10 (11)
  • [23] Image super resolution using distributed locality sensitive hashing for manifold learning
    Tripathi, Anurag
    Gupta, Abhinav
    Chaudhury, Santanu
    Singh, Arun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25673 - 25684
  • [24] Query by Humming by Using Locality Sensitive Hashing based on Combination of Pitch and Note
    Wang, Qiang
    Guo, Zhiyuan
    Liu, Gang
    Guo, Jun
    Lu, Yueming
    2012 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2012, : 302 - 307
  • [25] Fast agglomerative hierarchical clustering algorithm using Locality-Sensitive Hashing
    Hisashi Koga
    Tetsuo Ishibashi
    Toshinori Watanabe
    Knowledge and Information Systems, 2007, 12 : 25 - 53
  • [26] Estimating user response rate using locality sensitive hashing in search marketing
    Maryam Almasharawi
    Ahmet Bulut
    Electronic Commerce Research, 2022, 22 : 37 - 51
  • [27] Image super resolution using distributed locality sensitive hashing for manifold learning
    Anurag Tripathi
    Abhinav Gupta
    Santanu Chaudhury
    Arun Singh
    Multimedia Tools and Applications, 2019, 78 : 25673 - 25684
  • [28] Chrysanthemum Petal Similarity Evaluation Based on Multi-probe Locality Sensitive Hashing
    Yuan P.
    Zhai Z.
    Qian S.
    Xu H.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2019, 50 (07): : 208 - 215
  • [29] Estimating user response rate using locality sensitive hashing in search marketing
    Almasharawi, Maryam
    Bulut, Ahmet
    ELECTRONIC COMMERCE RESEARCH, 2022, 22 (01) : 37 - 51
  • [30] Hardware acceleration of k-mer clustering using locality-sensitive hashing
    Soto, Javier E.
    Krohmer, Thomas
    Hernandez, Cecilia
    Figueroa, Miguel
    2019 22ND EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2019, : 659 - 662