Semantic Similarity Measures for the Generation of Science Tests in Basque
被引:24
作者:
论文数: 引用数:
h-index:
机构:
Aldabe, Itziar
[1
]
Maritxalar, Montse
论文数: 0引用数: 0
h-index: 0
机构:
Univ Basque Country UPV EHU, Dept Comp Languages & Syst, San Sebastian 20018, Gipuzkoa, SpainUniv Basque Country UPV EHU, Dept Comp Languages & Syst, San Sebastian 20018, Gipuzkoa, Spain
Maritxalar, Montse
[1
]
机构:
[1] Univ Basque Country UPV EHU, Dept Comp Languages & Syst, San Sebastian 20018, Gipuzkoa, Spain
Natural language processing;
text analysis;
computers and education;
e-learning tools;
MULTIPLE-CHOICE;
D O I:
10.1109/TLT.2014.2355831
中图分类号:
TP39 [计算机的应用];
学科分类号:
081203 ;
0835 ;
摘要:
The work we present in this paper aims to help teachers create multiple-choice science tests. We focus on a scientific vocabulary-learning scenario taking place in a Basque-language educational environment. In this particular scenario, we explore the option of automatically generating Multiple-Choice Questions (MCQ) by means of Natural Language Processing (NLP) techniques and the use of corpora. More specifically, human experts select scientific articles and identify the target terms (i.e., words). These terms are part of the vocabulary studied in the school curriculum for 13-14-year-olds and form the starting point for our system to generate MCQs. We automatically generate distractors that are similar in meaning to the target term. To this end, the system applies semantic similarity measures making use of a variety of corpus-based and graph-based approaches. The paper presents a qualitative and a quantitative analysis of the generated tests to measure the quality of the proposed methods. The qualitative analysis is based on expert opinion, whereas the quantitative analysis is based on the MCQ test responses from students in secondary school. Nine hundred and fifty one students from 18 schools took part in the experiments. The results show that our system could help experts in the generation of MCQ.