Reducing Annotation Effort in Automatic Essay Evaluation Using Locality Sensitive Hashing

被引:1
作者
Tashu, Tsegaye Misikir [1 ,2 ]
Szabo, David [1 ]
Horvath, Tomas [1 ,3 ]
机构
[1] Eotvos Lorand Univ, Telekom Innovat Labs, Dept Data Sci & Engn, Fac Informat, H-1117 Budapest, Hungary
[2] Eotvos Lorand Univ, Fac Informat, 3in Res Grp, Martonvasar, Hungary
[3] Pavol Jozef Safarik Univ, Inst Comp Sci, Fac Sci, Jesenna 5, Kosice 04001, Slovakia
来源
INTELLIGENT TUTORING SYSTEMS (ITS 2019) | 2019年 / 11528卷
关键词
Locality Sensitive Hashing; Automatic essay scoring; Similarity search;
D O I
10.1007/978-3-030-22244-4_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated essay evaluation systems use machine learning models to predict the score for an essay. For such, a training essay set is required which is usually created by human requiring time-consuming effort. Popular choice for scoring is a nearest neighbor model which requires on-line computation of nearest neighbors to a given essay. This is, however, a time-consuming task. In this work, we propose to use locality sensitive hashing that helps to select a small subset of a large set of essays such that it will likely contain the nearest neighbors for a given essay. We provided experiments on real-world data sets provided by Kaggle. According to the experimental results, it is possible to achieve good performance on scoring by using the proposed approach. The proposed approach is efficient with regard to time complexity. Also, it works well in case of a small number of training essays labeled by human and gives comparable results to the case when a large essay sets are used.
引用
收藏
页码:186 / 192
页数:7
相关论文
共 42 条
  • [31] LSHvec: A Vector Representation of DNA Sequences Using Locality Sensitive Hashing and FastTextWord Embeddings
    Shi, Lizhen
    Chen, Bo
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [32] Trusted Player Transfer Evaluation for Sport Markets Based on Blockchain and Locality-Sensitive Hashing
    Liu, Chao
    Li, Zengxi
    Liu, Shunshun
    Xie, Jushi
    Yan, Chao
    Huang, Wanli
    IEEE ACCESS, 2021, 9 : 87332 - 87339
  • [33] Locality-sensitive Hashing scheme for Bangla News Article Clustering using Bloom Filter
    Nath, Subrata
    Singha, Pranab
    Islam, Md. Saiful
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION ENGINEERING (ECCE), 2017, : 17 - 21
  • [34] An incremental community detection method for social tagging systems using locality-sensitive hashing
    Wu, Zhenyu
    Zou, Ming
    NEURAL NETWORKS, 2014, 58 : 14 - 28
  • [35] Fast Distributed kNN Graph Construction Using Auto-tuned Locality-sensitive Hashing
    Eiras-Franco, Carlos
    Martinez-Rego, David
    Kanthan, Leslie
    Pineiro, Cesar
    Bahamonde, Antonio
    Guijarro-Berdinas, Bertha
    Alonso-Betanzos, Amparo
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (06)
  • [36] Multi-view content-based mammogram retrieval using dynamic similarity and locality sensitive hashing
    Jouirou, Amira
    Baazaoui, Abir
    Barhoumi, Walid
    PATTERN RECOGNITION, 2021, 112
  • [37] Improving binary diffing speed and accuracy using community detection and locality-sensitive hashing: an empirical study
    Chariton Karamitas
    Athanasios Kehagias
    Journal of Computer Virology and Hacking Techniques, 2023, 19 : 319 - 337
  • [38] Improving binary diffing speed and accuracy using community detection and locality-sensitive hashing: an empirical study
    Karamitas, Chariton
    Kehagias, Athanasios
    JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2023, 19 (02) : 319 - 337
  • [39] Scalable resource description framework clustering: A distributed approach for analyzing knowledge graphs using minHash locality sensitive hashing
    Agarwal, Pratik
    Sinha, Bam Bahadur
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (15)
  • [40] A Layered Approach to Automatic Essay Evaluation Using Word-Embedding
    Tashu, Tsegaye Misikir
    Horvath, Tomas
    COMPUTER SUPPORTED EDUCATION, 2019, 1022 : 77 - 94