Enhancing web search by using query-based clusters and multi-document summaries

被引:4
作者
Qumsiyeh, Rani [1 ]
Ng, Yiu-Kai [1 ]
机构
[1] Brigham Young Univ, Dept Comp Sci, Provo, UT 84604 USA
关键词
Summarization; Clustering; Web search; Query processing;
D O I
10.1007/s10115-015-0852-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current web search engines, such as Google, Bing, and Yahoo!, rank the set of documents SD retrieved in response to a user query and display each document D in SD with a title and a snippet, which serves as an abstract of D. Snippets, however, are not as useful as they are designed for, i.e., assisting its users to quickly identify results of interest, if they exist. These snippets are inadequate in providing distinct information and capturing the main contents of the corresponding documents. Moreover, when the intended information need specified in a search query is ambiguous, it is very difficult, if not impossible, for a search engine to identify precisely the set of documents that satisfy the user's intended request without requiring additional inputs. Furthermore, a document title is not always a good indicator of the content of the corresponding document. All of these design problems can be solved by our proposed query-based cluster and summarizer, called . generates a concise/comprehensive summary for each cluster of documents retrieved in response to a user query, which saves the user's time and effort in searching for specific information of interest without having to browse through the documents one by one. Experimental results show that is effective and efficient in generating a high-quality summary for each cluster of documents on a specific topic.
引用
收藏
页码:355 / 380
页数:26
相关论文
共 50 条
[41]   What Can Pictures Tell Us About Web Pages? Improving Document Search using Images [J].
Rodriguez-Vaamonde, Sergio ;
Torresani, Lorenzo ;
Fitzgibbon, Andrew .
SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, :849-852
[42]   Web search using dynamic keyword suggestion based on formal concept analysis [J].
Kim, BS ;
Park, Y .
INFORMATION REUSE AND INTEGRATION, 2001, :108-114
[43]   Using a graph-based data mining system to perform web search [J].
Cook, DJ ;
Manocha, N ;
Holder, LB .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2003, 17 (05) :705-720
[44]   Query based approach for referrer field analysis of log data using web mining techniques for ontology improvement [J].
Kaur N. ;
Aggarwal H. .
International Journal of Information Technology, 2018, 10 (1) :99-110
[45]   An efficient document clustering using hybridised harmony search K-means algorithm with multi-view point [J].
Siamala Devi S. ;
Anto S. ;
Siddique Ibrahim S.P. .
Siamala Devi, S. (siamalamagesh@gmail.com), 1600, Inderscience Publishers (10) :129-143
[46]   A Multi-Agent based Query Processing System using RETSINA with Intelligent Agents in Cloud Environment [J].
Tharunya, S. ;
Divya, M. ;
Shunmuganathan, K. L. .
2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
[47]   STRUCTURESELECTOR: A web-based software to select and visualize the optimal number of clusters using multiple methods [J].
Li, Yu-Long ;
Liu, Jin-Xian .
MOLECULAR ECOLOGY RESOURCES, 2018, 18 (01) :176-177
[48]   Semantic Conceptual Relational Similarity Based Web Document Clustering for Efficient Information Retrieval Using Semantic Ontology [J].
Selvalakshmi, B. ;
Subramaniam, M. ;
Sathiyasekar, K. .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (09) :3102-3119
[49]   An efficient document information retrieval using hybrid global search optimization algorithm with density based clustering technique [J].
Inje, Bhushan ;
Nagwanshi, Kapil Kumar ;
Rambola, Radha Krishna .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (01) :689-705
[50]   An efficient document information retrieval using hybrid global search optimization algorithm with density based clustering technique [J].
Bhushan Inje ;
Kapil Kumar Nagwanshi ;
Radha Krishna Rambola .
Cluster Computing, 2024, 27 :689-705