The New Clustering Strategy and Algorithm Based on Latent Semantic Indexing

被引:0
作者
Yan, Bing [1 ]
Du, YaJun [1 ]
Li, ZhanShen [1 ]
机构
[1] Xihua Univ, Sch Math & Comp Sci, Chengdu 610039, Sichuan, Peoples R China
来源
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS | 2008年
关键词
D O I
10.1109/ICNC.2008.699
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, the technology of search engine is a hot in IR research. Clustering according to the themes of the search results will be well to help user to find the information. In this paper the new clustering algorithm, which named MyCluster and based on the phrase and Latent Semantic Indexing, is proposed. The result of MyCluster is composed of class labels and class contents. The class contents is a entry for users getting the information. Each class label corresponding to some class contents. The readability of cluster labels will effect the efficiency of finding a useful information. We adopt a method of Singular Value Decomposition to induce class labels and find class contents, so that the clusters have the characteristic that objects belonging to the same cluster are "similar", while objects from different clusters are "dissimilar". Lastly, we incorporate and sort the clusters. By experiments, our MyCluster has some advantages of the readability of class labels and the relevance of class contents.
引用
收藏
页码:486 / 490
页数:5
相关论文
共 8 条
[1]  
CUTTING DR, 1992, P 15 ANN INT ACM SIG, P318, DOI DOI 10.1145/133160.133214
[2]  
DU YJ, 2005, T COMPUTER INFORM TH, V1, P40
[3]  
DU YJ, 2007, ISKE 2007 P
[4]  
HEARST MA, 1996, P 19 ANN INT ACM SIG, P76
[5]  
Sun Ji-Gui, 2008, Journal of Software, V19, P48, DOI 10.3724/SP.J.1001.2008.00048
[6]  
WANG J, P 2007 INT C INT SYS
[7]   Grouper: a dynamic clustering interface to Web search results [J].
Zamir, O ;
Etzioni, O .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 1999, 31 (11-16) :1361-1374
[8]  
Zhang D., 2004, P 6 AS PAC WEB C APW