Query-based multi-documents summarization using linguistic knowledge and content word expansion

被引:20
作者
Abdi, Asad [1 ]
Idris, Norisma [1 ]
Alguliyev, Rasim M. [2 ]
Aliguliyev, Ramiz M. [2 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Artificial Intelligence, Kuala Lumpur 50603, Malaysia
[2] Azerbaijan Natl Acad Sci, Inst Informat Technol, 9 B Vahabzade St, AZ-1141 Baku, Azerbaijan
关键词
Query-based multi-document summarization; Graph-based sentence ranking; Query expansion; Extractive summarization; GRAPH;
D O I
10.1007/s00500-015-1881-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a query-based summarization method, which uses a combination of semantic relations between words and their syntactic composition, to extract meaningful sentences from document sets is introduced. The problem with current statistical methods is that they fail to capture the meaning when comparing a sentence and a user query; hence there is often a conflict between the extracted sentences and users' requirements. However, this particular method can improve the quality of document summaries because it is able to avoid extracting a sentence whose similarity with the query is high but whose meaning is different. The method is executed by computing the semantic and syntactic similarity of the sentence-to-sentence and sentence-to-query. To reduce redundancy in summary, this method uses the greedy algorithm to impose diversity penalty on the sentences. In addition, the proposed method expands the words in both the query and the sentences to tackle the problem of information limit. It bridges the lexical gaps for semantically similar contexts that are expressed using different wording. The experimental results display that the proposed method is able to improve performance compared with the participating systems in DUC 2006. The experimental results also showed that the proposed method demonstrates better performance as compared to other existing techniques on DUC 2005 and DUC 2006 datasets.
引用
收藏
页码:1785 / 1801
页数:17
相关论文
共 54 条
[1]   Automatic summarization assessment through a combination of semantic and syntactic information for intelligent educational systems [J].
Abdi, Asad ;
Idris, Norisma ;
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. .
INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (04) :340-358
[2]   Sentence selection for generic document summarization using an adaptive differential evolution algorithm [J].
Alguliev, Rasim M. ;
Aliguliyev, Ramiz M. ;
Mehdiyev, Chingiz A. .
SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (04) :213-222
[3]   A new sentence similarity measure and sentence based extractive technique for automatic text summarization [J].
Aliguliyev, Ramiz M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7764-7772
[4]  
[Anonymous], P 22 ANN M COGN SCI
[5]  
[Anonymous], FLAIRS C
[6]  
[Anonymous], P DUC
[7]  
[Anonymous], 2009, P 2009 SIAM INT C DA
[8]  
[Anonymous], P DUC2006
[9]  
[Anonymous], NEAREST NEIGHBOR ALG
[10]  
[Anonymous], 2006, P DUC