IMPROVE THE AUTOMATIC SUMMARIZATION OF ARABIC TEXT DEPENDING ON RHETORICAL STRUCTURE THEORY

被引:10
作者
Ibrahim, Ahmed [1 ]
Elghazaly, Tarek [1 ]
机构
[1] Cairo Univ, Inst Stat Studies & Res, Dept Comp & Informat Sci, Cairo, Egypt
来源
2013 12TH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (MICAI 2013) | 2013年
关键词
Arabic text summarization; Rhetorical Structure Theory; RST; Vector Space Model; VSM;
D O I
10.1109/MICAI.2013.35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
this paper uses a semantic technique by adopting a Rhetorical Structure Theory (RST) for summarization purpose, to discover the most significant paragraphs based on functional and semantic criteria. However, the quality of RST summarization suffers when dealing with large documents. This paper proposes a new hybrid summarization model for Arabic text, which mingles two sub-models: The first sub-model produces a primary summary by using Rhetorical Structure Theory for identifying a range of the most significant parts of the text (the nucleus). Then the second sub-model ranks the significant parts in the primary rhetorical-summary based on the cosine similarity feature. To evaluate the proposed model, a prototype was developed on a range of articles, which have been classified into three groups different in size. The final output summary was evaluated in relation to its manual counterpart. In terms of enhancement of the rhetorical-summary precision, the experiment shows that proposed model HSM average precision is 71.6%, superior over the primary rhetorical-summary precision 56.3%.
引用
收藏
页码:223 / 227
页数:5
相关论文
共 13 条
[1]  
AlSanie W., 2005, THESIS KING SAUD U R
[2]  
[Anonymous], 2003, Finite state morphology
[3]  
[Anonymous], 2008, Introduction to information retrieval
[4]  
Fattah MA, 2008, PROC WRLD ACAD SCI E, V27, P192
[5]  
Friedman V., 2008, Graphics, Monday Inspiration
[6]  
Hammo B., 2011, INT J COMPUTER PROCE, V23, P39, DOI DOI 10.1142/S1793840611002206
[7]  
Hemida M., 1997, NZAM ALARTBAT WA ALR
[8]  
Ibrahim Ahmed, 2013, Natural Language Processing and Information Systems. 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013. Proceedings: LNCS 7934, P421, DOI 10.1007/978-3-642-38824-8_53
[9]  
Ibrahim Ahmed, 2012, INF SYST INFOS 8 INT, P34
[10]  
Khalifa I., 2011, INT J ELECT COMPUT S, V11, P10