Detecting and Clustering Similar Results of Search Engine by Exploiting Web Page's Contents

被引:0
作者
Gao, Kai [1 ]
Wu, Hui-cong [1 ]
机构
[1] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
来源
2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31 | 2008年
关键词
search engine; hash function; Chinese key concept; clustering;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This paper presents an approach to detect and cluster similar results of search engine based on analyzing pages' URLs and their contents. A novel hash function, together with a Chinese key concept extractor module, has been used. The similar measurement on key concept overlap degree is proposed to cluster similar retrieval results. This can minimize the overlap effectively. The experimental results show the feasibility of the approach. On the basis of the above works, a search engine has been developed.
引用
收藏
页码:10960 / 10963
页数:4
相关论文
共 7 条
[1]  
[Anonymous], DATA MINING
[2]  
Baeza-Yates R., 2004, MODERN INFORM RETRIE, P367
[3]  
BORDER AZ, 2000, P 11 ANN S COMB PATT
[4]  
Cho J., 2000, P ACM SIGMOD INT C M
[5]  
CHUNG CY, 2002, P 11 INT C INF KNOWL
[6]  
Li Xiao-Ming, 2004, Journal of Software, V15, P179
[7]  
NTOULAS A, 2004, P 13 ACM INT C WORLD