A web document clustering algorithm based on concept of neighbor

被引:0
|
作者
Song, JC [1 ]
Shen, JY [1 ]
机构
[1] Xian Jiaotong Univ, Dept Comp Sci & Technol, Xian 710049, Peoples R China
关键词
web mining; document mining; document clustering; nearest neigbor technique;
D O I
10.1109/ICMLC.2003.1264440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the WWW devloped rapidly, it becomes the most important resource gradually that transfers and shares the global information as well as being full of the latent capacity. Recent years, the researches of the Web mining have been concerned broadly and gotten a great deal of achievements simultaneously. The nearest neighbor technique, which is hierarchical clustering method based on distance, has been applied to many cases widely for the efficiency and validity. In this paper, based on the Vector Space Model (VSM) of the Web documents, We improved the nearest neighbor method, put forward a new Web document clustering algorithm, and researched the validity and scalability of the algorithm, the time and space complexity of the algorithm.
引用
收藏
页码:46 / 50
页数:5
相关论文
共 50 条
  • [1] A fuzzy-based algorithm for Web document clustering
    Friedman, M
    Kandel, A
    Schneider, M
    Last, M
    Shapira, B
    Elovici, Y
    Zaafrany, O
    NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 524 - 527
  • [2] Clustering algorithm based on swarm intelligence for Web document
    Wu, Bin
    Fu, Wei-Peng
    Zheng, Yi
    Liu, Shao-Hui
    Shi, Zhong-Zhi
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2002, 39 (11):
  • [3] An improved clustering algorithm for web document
    Wang, Jing
    Liu, Zhijing
    Journal of Information and Computational Science, 2009, 6 (02): : 959 - 966
  • [4] Concept based document clustering using K prototype Algorithm
    Pasarate, Sneha
    Shedge, Rajashree
    2018 INTERNATIONAL CONFERENCE ON CONTROL, POWER, COMMUNICATION AND COMPUTING TECHNOLOGIES (ICCPCCT), 2018, : 579 - 583
  • [5] A co-clustering algorithm based on structured Web document
    Deng, Dong-Mei
    Long, Ji-Zhen
    Yin, Xiang-Zhou
    Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2010, 41 (05): : 1871 - 1876
  • [6] An effective web document clustering algorithm based on bisection and merge
    Ingyu Lee
    Byung-Won On
    Artificial Intelligence Review, 2011, 36 : 69 - 85
  • [7] An effective web document clustering algorithm based on bisection and merge
    Lee, Ingyu
    On, Byung-Won
    ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (01) : 69 - 85
  • [8] Fuzzy concept graph and application in web document clustering
    An, C
    Ning, C
    Jia, WJ
    Luo, SD
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C101 - C106
  • [9] Formal Concept Analysis Support for Web Document Clustering Based on Social Tagging
    Ouyang, Chunping
    Yang, Xiaohua
    Li, Xiaoyun
    Liu, Zhiming
    2012 2ND INTERNATIONAL CONFERENCE ON UNCERTAINTY REASONING AND KNOWLEDGE ENGINEERING (URKE), 2012, : 304 - 307
  • [10] Neighbor-based clustering algorithm
    Wong, Ching-Chang
    Lin, Bo-Chen
    International Journal of Electrical Engineering, 2004, 11 (02): : 183 - 191