Search Result Clustering Through Density Analysis Based K-Medoids Method

被引:1
作者
Hung, Hungming [1 ]
Watada, Junzo [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka 8080135, Japan
来源
2014 IIAI 3RD INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2014) | 2014年
关键词
clustering; search result organization; K-Medoids;
D O I
10.1109/IIAI-AAI.2014.41
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
After obtaining search results through web search engine, classifying into clusters enables us to quickly browse them. Currently, famous search engines like Google, Bing and Baidu always return a long list of web pages which can be more than a hundred million that are ranked by their relevancies to the search key words. Users are forced to examine the results to look for their required information. This consumes a lot of time when the results come into so huge a number that consisting various kinds. Traditional clustering techniques are inadequate for readable descriptions. In this research, we first build a local semantic thesaurus (L.S.T) to transform natural language into two dimensional numerical points. Second, we analyze and gather different attributes of the search results so as to cluster them through on density analysis based K-Medoids method. Without defining categories in advance, K-Medoids method generates clusters with less susceptibility to noise. Experimental results verify our method's feasibility and effectiveness.
引用
收藏
页码:155 / 160
页数:6
相关论文
共 8 条
  • [1] [Anonymous], P 99 INF RES MAN ASS
  • [2] [Anonymous], 2013, GOOGLE SEARCH ENGINE
  • [3] CUTTING DR, 1993, P 16 ANN INT ACM SIG, P125
  • [4] Hearst M.A., 1996, SIGIR 96, P76
  • [5] Leouski A., 1996, IR76 U MASS DEP COMP
  • [6] Maslowska I, 2003, LECT NOTES COMPUT SC, V2633, P555
  • [7] Grouper: a dynamic clustering interface to Web search results
    Zamir, O
    Etzioni, O
    [J]. COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 1999, 31 (11-16): : 1361 - 1374
  • [8] Zhang Dong, 2002, THESIS SE U NANJING