Text Categorization Based on Topic Model

被引:0
|
作者
School of Computer Science and Technology, China University of Mining and Technology, Jiangsu Province, Xuzhou [1 ]
221116, China
不详 [2 ]
100081, China
机构
[1] School of Computer Science and Technology, China University of Mining and Technology, Jiangsu Province, Xuzhou
[2] School of Computer Science and Technology, Beijing Institute of Technology, Haidian District, Beijing
来源
Int. J. Comput. Intell. Syst. | 2009年 / 4卷 / 398-409期
关键词
Category Language Model; Latent Dirichlet allocation; Topic model; Variational Inference;
D O I
10.2991/ijcis.2009.2.4.8
中图分类号
学科分类号
摘要
In the text literature, many topic models were proposed to represent documents and words as topics or latent topics in order to process text effectively and accurately. In this paper, we propose LDACLM or Latent Dirichlet Allocation Category Language Model for text categorization and estimate parameters of models by variational inference. As a variant of Latent Dirichlet Allocation Model, LDACLM regards documents of category as Language Model and uses variational parameters to estimate maximum a posteriori of terms. In general, experiments show LDACLM model is effective and outperform Naïve Bayes with Laplace smoothing and Rocchio algorithm but little inferior to SVM for text categorization. © 2009, the authors.
引用
收藏
页码:398 / 409
页数:11
相关论文
共 50 条
  • [41] Feature selection for text data via topic modeling
    Jang, Woosol
    Kim, Ye Eun
    Son, Won
    KOREAN JOURNAL OF APPLIED STATISTICS, 2022, 35 (06) : 739 - 754
  • [42] A Study of Text Vectorization Method Combining Topic Model and Transfer Learning
    Yang, Xi
    Yang, Kaiwen
    Cui, Tianxu
    Chen, Min
    He, Liyan
    PROCESSES, 2022, 10 (02)
  • [43] Word co-occurrence augmented topic model in short text
    Chen, Guan-Bin
    Kao, Hung-Yu
    INTELLIGENT DATA ANALYSIS, 2017, 21 : S55 - S70
  • [44] Temporal-based Feature Selection and Transfer Learning for Text Categorization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 17 - 26
  • [45] Spatial topic pyramid model: topic model with regional spatial information
    Pan, Zhiyong
    Liu, Yang
    Liu, Guojun
    Guo, Maozu
    Li, Mingyu
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [46] Combining topic-based model and text categorisation approach for utterance understanding in human-machine dialogue
    Lichouri, Mohamed
    Djeradi, Rachida
    Djeradi, Amar
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2018, 17 (01) : 109 - 117
  • [47] A Topic Model Based on Poisson Decomposition
    Jiang, Haixin
    Zhou, Rui
    Zhang, Limeng
    Wang, Hua
    Zhang, Yanchun
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1489 - 1498
  • [48] TIARA: Interactive, Topic-Based Visual Text Summarization and Analysis
    Liu, Shixia
    Zhou, Michelle X.
    Pan, Shimei
    Song, Yangqiu
    Qian, Weihong
    Cai, Weijia
    Lian, Xiaoxiao
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (02)
  • [49] Feature Extraction of Deep Topic Model for Multi-label Text Classification
    Chen W.
    Liu X.
    Lu M.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (09): : 785 - 792
  • [50] User clustering in a dynamic social network topic model for short text streams
    Qiu, Zhangcheng
    Shen, Hong
    INFORMATION SCIENCES, 2017, 414 : 102 - 116