Combining fuzzy clustering with Naive Bayes augmented learning in text classification

被引:2
作者
Liu, Lizhen [1 ]
Sun, Xiaojing [1 ]
Song, Hantao [2 ]
机构
[1] Capital Normal Univ, Informat Engn Coll, Beijing, Peoples R China
[2] Beijing Inst Technol, Dept Comp, Beijing, Peoples R China
来源
2006 1ST INTERNATIONAL SYMPOSIUM ON PERVASIVE COMPUTING AND APPLICATIONS, PROCEEDINGS | 2006年
关键词
text classification; Fuzzy clustering; Naive Bayes;
D O I
10.1109/SPCA.2006.297562
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
For obtaining labeled training samples in text data mining, transcendental knowledge of samples and non-supervisory of clustering were combined. Fuzzy Partition Clustering Method (FPCM) was presented and used to obtain a few labeled texts and some external clusters automatically by measuring the similarity degree of clustering correlation texts. So classification bases were found for supervised learning. Naive Bayes augment learning manner was further combined to design and learn classifiers, and the way of estimating the loss of classing error was used to balance the selection of those example candidates. The combination of those two methods has advanced the precision of text classification and makes classification learning of non-labeled training example with more potential applications.
引用
收藏
页码:168 / +
页数:2
相关论文
共 9 条
  • [1] GONG XJ, 2002, J COMPUTER RES DEV, V39, P574
  • [2] GORDON S, 2001, MINING WEB, P348
  • [3] HUTTER M, 2001, P 14 INT C NEUR INF
  • [4] KEOGH EJ, 2002, LEARNING AUGMENTED B
  • [5] The research of Web Mining
    Liu, LZ
    Chen, JJ
    Song, HT
    [J]. PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 2333 - 2337
  • [6] LIU LZ, 2005, P 6 INT S TEST MEAS, V6, P8517
  • [7] MENA J, 2000, DATA MINING YOUR WEB, P368
  • [8] SHI W, 2000, COMPUTER SCI, V27, P237
  • [9] ZHANG Xuegong, 2000, PATTERN RECOGNITION