LEMAZA: An Arabic why-question answering system*

被引:22
作者
Azmi, Aqil M. [1 ]
Alshenaifi, Nouf A. [1 ]
机构
[1] King Saud Univ, Dept Comp Sci, Riyadh 11543, Saudi Arabia
关键词
LANGUAGE;
D O I
10.1017/S1351324917000304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering systems retrieve information from documents in response to queries. Most of the questions are who- and what-type questions that deal with named entities. A less common and more challenging question to deal with is the why -question. In this paper, we introduce Lemaza (Arabic for why), a system for automatically answering why -questions for Arabic texts. The system is composed of four main components that make use of the Rhetorical Structure Theory. To evaluate Lemaza, we prepared a set of why -question-answer pairs whose answer can be found in a corpus that we compiled out of Open Source Arabic Corpora. Lemaza performed best when the stop-words were not removed. The performance measure was 72.7%, 79.2% and 78.7% for recall, precision and c@1, respectively.
引用
收藏
页码:877 / 903
页数:27
相关论文
共 63 条
[1]  
Abouenour L., 2013, INT J INFORM COMMUNI, V3, P37
[2]  
Abouenour L, 2008, P 2008 INT AR C INF
[3]  
Akour Mohammed, 2011, American Journal of Applied Sciences, V8, P652, DOI 10.3844/ajassp.2011.652.661
[4]   A novel root based Arabic stemmer [J].
Al-Kabi, Mohammed N. ;
Kazakzeh, Saif A. ;
Abu Ata, Belal M. ;
Al-Rababah, Saif A. ;
Alsmadi, Izzat M. .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (02) :94-103
[5]  
AlSanie W., 2005, THESIS
[6]  
[Anonymous], 2016, P SEMEVAL 2016
[7]  
[Anonymous], 2010, WORKSHOP LANGUAGE RE
[8]  
[Anonymous], 1999, Technical Report
[9]  
[Anonymous], 2016, P 10 INT WORKSH SEM
[10]  
[Anonymous], THESIS