An Intelligent Web Search Using Multi-Document Summarization

被引:3
作者
Takale, Sheetal A. [1 ]
Kulkarni, Prakash J. [2 ]
Shah, Sahil K. [1 ]
机构
[1] Vidya Pratishthans Coll Engn, Baramati, India
[2] Walchand Coll Engn, Sangli, India
关键词
Document Clustering; Multi-Document Summarization; Semantic Role; Semantic Similarity; Web Search Result Summarization;
D O I
10.4018/IJIRR.2016040103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information available on the internet is huge, diverse and dynamic. Current Search Engine is doing the task of intelligent help to the users of the internet. For a query, it provides a listing of best matching or relevant web pages. However, information for the query is often spread across multiple pages which are returned by the search engine. This degrades the quality of search results. So, the search engines are drowning in information, but starving for knowledge. Here, we present a query focused extractive summarization of search engine results. We propose a two level summarization process: identification of relevant theme clusters, and selection of top ranking sentences to form summarized result for user query. A new approach to semantic similarity computation using semantic roles and semantic meaning is proposed. Document clustering is effectively achieved by application of MDL principle and sentence clustering and ranking is done by using SNMF. Experiments conducted demonstrate the effectiveness of system in semantic text understanding, document clustering and summarization.
引用
收藏
页码:41 / 65
页数:25
相关论文
共 34 条
[1]   Multi-document Text Summarization: SimWithFirst Based Features and Sentence Co-selection Based Evaluation [J].
Ali, Md. Mohsin ;
Ghosh, Monotosh Kumar ;
Abdullah-Al-Mamun .
INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATIONS, PROCEEDINGS, 2009, :93-96
[2]  
Amitay E., 2000, P 9 ACM INT C INF KN
[3]  
Baeza-Yates R., 2011, MODERN INFORM RETRIE
[4]   Sentence fusion for multidocument news summarization [J].
Barzilay, R ;
McKeown, KR .
COMPUTATIONAL LINGUISTICS, 2005, 31 (03) :297-327
[5]  
Berger A.L., 2000, P 23 ANN INT ACM SIG, P144, DOI DOI 10.1145/345508.345565
[6]   A preference learning approach to sentence ordering for multi-document summarization [J].
Bollegala, Danushka ;
Okazaki, Naoaki ;
Ishizuka, Mitsuru .
INFORMATION SCIENCES, 2012, 217 :78-95
[7]   A bottom-up approach to sentence ordering for multi-document summarization [J].
Bollegala, Danushka ;
Okazaki, Naoaki ;
Ishizuka, Mitsuru .
INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (01) :89-109
[8]   Mutually Reinforced Manifold-Ranking Based Relevance Propagation Model for Query-Focused Multi-Document Summarization [J].
Cai, Xiaoyan ;
Li, Wenjie .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05) :1597-1607
[9]  
Chim H., 2008, IEEE T KNOWLEDGE DAT, V20
[10]   LexRank: Graph-based lexical centrality as salience in text summarization [J].
Erkan, G ;
Radev, DR .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :457-479