Detecting and Clustering Similar Results of Search Engine by Exploiting Web Page's Contents

被引：0

作者：

Gao, Kai ^{[1
]}

Wu, Hui-cong ^{[1
]}

机构：

[1] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China

来源：

2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31 | 2008年

关键词：

search engine; hash function; Chinese key concept; clustering;

D O I：

暂无

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

This paper presents an approach to detect and cluster similar results of search engine based on analyzing pages' URLs and their contents. A novel hash function, together with a Chinese key concept extractor module, has been used. The similar measurement on key concept overlap degree is proposed to cluster similar retrieval results. This can minimize the overlap effectively. The experimental results show the feasibility of the approach. On the basis of the above works, a search engine has been developed.

引用

页码：10960 / 10963

页数：4