Sparse Topical Coding with Sparse Groups

被引:2
|
作者
Peng, Min [1 ]
Xie, Qianqian [1 ]
Huang, Jiajia [1 ]
Zhu, Jiahui [1 ]
Ouyang, Shuang [1 ]
Huang, Jimin [1 ]
Tian, Gang [1 ]
机构
[1] Wuhan Univ, Sch Comp, Wuhan, Peoples R China
来源
WEB-AGE INFORMATION MANAGEMENT, PT I | 2016年 / 9658卷
关键词
Document representation; Topic model; Sparse coding; Sparse group lasso; REGRESSION; SELECTION;
D O I
10.1007/978-3-319-39937-9_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a latent semantic representing from a large number of short text corpora makes a profound practical significance in research and engineering. However, it is difficult to use standard topic models in microblogging environments since microblogs have short length, large amount, snarled noise and irregular modality characters, which prevent topic models from using full information of microblogs. In this paper, we propose a novel non-probabilistic topic model called sparse topical coding with sparse groups (STCSG), which is capable of discovering sparse latent semantic representations of large short text corpora. STCSG relaxes the normalization constraint of the inferred representations with sparse group lasso, a sparsity-inducing regularizer, which is convenient to directly control the sparsity of document, topic and word codes. Furthermore, the relaxed non-probabilistic STCSG can be effectively learned with alternating direction method of multipliers (ADMM). Our experimental results on Twitter dataset demonstrate that STCSG performs well in finding meaningful latent representations of short documents. Therefore, it can substantially improve the accuracy and efficiency of document classification.
引用
收藏
页码:415 / 426
页数:12
相关论文
共 50 条
  • [1] Bayesian Sparse Topical Coding
    Peng, Min
    Xie, Qianqian
    Wang, Hua
    Zhang, Yanchun
    Tian, Gang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (06) : 1080 - 1093
  • [2] Block Bayesian Sparse Topical Coding
    Peng, Min
    Shi, Hongliang
    Xie, Qianqian
    Zhang, Yihan
    Wang, Hua
    Li, Zhaoyunfei
    Yong, Jianming
    PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 271 - 276
  • [3] Clustering Improvement via Integrating with Sparse Topical Coding
    Ahmadi, Parvin
    Kaviani, Razie
    Gholampour, Iman
    Tabandeh, Mahmoud
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 466 - 471
  • [4] Discovery of the Topical Object in Commercial Video: A Sparse Coding Method
    Liu, Yunhui
    Liu, Huaping
    Sun, Fuchun
    PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 245 - 254
  • [5] Universal Regularizers for Robust Sparse Coding and Modeling
    Ramirez, Ignacio
    Sapiro, Guillermo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (09) : 3850 - 3864
  • [6] Order Preserving Sparse Coding
    Ni, Bingbing
    Moulin, Pierre
    Yan, Shuicheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (08) : 1615 - 1628
  • [7] Hessian sparse coding
    Zheng, Miao
    Bu, Jiajun
    Chen, Chun
    NEUROCOMPUTING, 2014, 123 : 247 - 254
  • [8] Sparse Coding with Outliers
    Dai, Xiangguang
    Zhang, Keke
    Zhang, Wei
    Xiong, Jiang
    Feng, Yuming
    2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2019, : 246 - 249
  • [9] BAYESIAN NEURAL NETWORKS FOR SPARSE CODING
    Kuzin, Danil
    Isupova, Olga
    Mihaylova, Lyudmila
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2992 - 2996
  • [10] Supervised Bayesian Sparse Coding for Classification
    Xu, Jinhua
    Ding, Li
    Sun, Shiliang
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 517 - 524