The Research of kNN Text Categorization Algorithm Based On Eager Learning

被引:4
作者
Dong, Tao [1 ]
Cheng, Weinan [1 ]
Shang, Wenqian [1 ]
机构
[1] Commun Univ China, Sch Comp, Beijing, Peoples R China
来源
2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE) | 2012年
关键词
Text categorization; kNN; Eager learning;
D O I
10.1109/ICICEE.2012.297
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text categorization is a fundamental methodology of text mining and it is also a hot topic of the research of data mining and web mining in recent years. It plays an important role in business, government decision-making management, scientific research, and so on. This paper presents an improved algorithm of text categorization which combines eager learning with kNN classification. Experimental results show that the improved algorithm not only improve the efficiency of categorization, but also significantly increase the accuracy of categorization and produce a qualitative leap on the practical value of the sensitive information system.
引用
收藏
页码:1120 / 1123
页数:4
相关论文
共 7 条
  • [1] [Anonymous], 1971, The SMART Retrieval System-Experiments in Automatic Document Processing
  • [2] Jiang Tao, 2009, Computer Engineering and Applications, V45, P153, DOI 10.3778/j.issn.1002-8331.2009.07.046
  • [3] Kamber M., 2007, DATA MINGING CONCEPT
  • [4] Liu B., 2009, WEB DATA MINING
  • [5] Liu Zhen-yan, 2002, Mini-Micro Systems, V23, P1489
  • [6] VECTOR-SPACE MODEL FOR AUTOMATIC INDEXING
    SALTON, G
    WONG, A
    YANG, CS
    [J]. COMMUNICATIONS OF THE ACM, 1975, 18 (11) : 613 - 620
  • [7] COMPUTER EVALUATION OF INDEXING AND TEXT PROCESSING
    SALTON, G
    LESK, ME
    [J]. JOURNAL OF THE ACM, 1968, 15 (01) : 8 - &