A Chinese News Archiving Approach Based on Incremental Naive Bayesian Learning

被引:0
作者
Wang, Baofeng [1 ]
Long, Shun [2 ]
机构
[1] Jinan Univ, Dept Japanese Studies, Guangzhou 510632, Guangdong, Peoples R China
[2] Jinan Univ, Dept Comp Sci, Guangzhou 510632, Guangdong, Peoples R China
来源
2014 2ND INTERNATIONAL CONFERENCE ON ECONOMIC, BUSINESS MANAGEMENT AND EDUCATION INNOVATION (EBMEI 2014), VOL 39 | 2014年 / 39卷
关键词
News archiving; Feature extraction; Naive Bayes classifier; Incremental learning;
D O I
暂无
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Although various approaches have been proposed to solve the demanding news archiving problem in the past few decades, few provides results satisfying enough to be deployed in practice. We propose in this paper an approach based on both incremental learning and naive Bayesian learning. An improved feature extraction approach is developed, before a Naive Bayes classifier be modified in consideration of efficiency in practice. Preliminary experiments carried out on Nanfang Daily news archive show that these above modifications significantly improve the efficiency and solve the overflow problem in news archiving.
引用
收藏
页码:134 / 139
页数:6
相关论文
共 12 条
  • [1] Bloehdorn S., 2004, P WORKSH MIN SEM WEB
  • [2] Hayes P., 1991, P 1 INT C ART INT AP
  • [3] Hothom T., 2003, PATTERN RECOGNITION, V36
  • [4] Joachims T., 1998, P 10 EUR C MACH LEAR
  • [5] Li Q-H, 2003, J CHINESE INFORM PRO, V17
  • [6] Liu H-F., 2008, APPL RES COMPUTERS, V25
  • [7] ON RELEVANCE, PROBABILISTIC INDEXING AND INFORMATION RETRIEVAL
    MARON, ME
    KUHNS, JL
    [J]. JOURNAL OF THE ACM, 1960, 7 (03) : 216 - 244
  • [8] NIGAM K, 2000, MACH LEARN, P39
  • [9] Song Y., 2009, J SOFTWARE, V20
  • [10] Wang W-L., 2007, COMPUTER APPL, V27