Sparse Topical Coding with Sparse Groups

被引:2
作者
Peng, Min [1 ]
Xie, Qianqian [1 ]
Huang, Jiajia [1 ]
Zhu, Jiahui [1 ]
Ouyang, Shuang [1 ]
Huang, Jimin [1 ]
Tian, Gang [1 ]
机构
[1] Wuhan Univ, Sch Comp, Wuhan, Peoples R China
来源
WEB-AGE INFORMATION MANAGEMENT, PT I | 2016年 / 9658卷
关键词
Document representation; Topic model; Sparse coding; Sparse group lasso; REGRESSION; SELECTION;
D O I
10.1007/978-3-319-39937-9_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a latent semantic representing from a large number of short text corpora makes a profound practical significance in research and engineering. However, it is difficult to use standard topic models in microblogging environments since microblogs have short length, large amount, snarled noise and irregular modality characters, which prevent topic models from using full information of microblogs. In this paper, we propose a novel non-probabilistic topic model called sparse topical coding with sparse groups (STCSG), which is capable of discovering sparse latent semantic representations of large short text corpora. STCSG relaxes the normalization constraint of the inferred representations with sparse group lasso, a sparsity-inducing regularizer, which is convenient to directly control the sparsity of document, topic and word codes. Furthermore, the relaxed non-probabilistic STCSG can be effectively learned with alternating direction method of multipliers (ADMM). Our experimental results on Twitter dataset demonstrate that STCSG performs well in finding meaningful latent representations of short documents. Therefore, it can substantially improve the accuracy and efficiency of document classification.
引用
收藏
页码:415 / 426
页数:12
相关论文
共 50 条
  • [31] Highly overcomplete sparse coding
    Olshausen, Bruno A.
    HUMAN VISION AND ELECTRONIC IMAGING XVIII, 2013, 8651
  • [32] EFFICIENT CONVOLUTIONAL SPARSE CODING
    Wohlberg, Brendt
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [33] Sparse coding of natural contours
    Hoyer, PO
    Hyvärinen, A
    NEUROCOMPUTING, 2002, 44 : 459 - 466
  • [34] SPARSE CODING FOR SPEECH RECOGNITION
    Sivaram, G. S. V. S.
    Nemala, Sridhar Krishna
    Elhilali, Mounya
    Trac D. Tran
    Hermansky, Hynek
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4346 - 4349
  • [35] Multiple graph regularized sparse coding and multiple hypergraph regularized sparse coding for image representation
    Jin, Taisong
    Yu, Zhengtao
    Li, Lingling
    Li, Cuihua
    NEUROCOMPUTING, 2015, 154 : 245 - 256
  • [36] Fast Sparse Coding for Range Data Denoising with Sparse Ridges Constraint
    Gao, Zhi
    Lao, Mingjie
    Sang, Yongsheng
    Wen, Fei
    Ramesh, Bharath
    Zhai, Ruifang
    SENSORS, 2018, 18 (05)
  • [37] Smooth sparse coding via marginal regression for learning sparse representations
    Balasubramanian, Krishnakumar
    Yu, Kai
    Lebanon, Guy
    ARTIFICIAL INTELLIGENCE, 2016, 238 : 83 - 95
  • [38] Optimization of learned dictionary for sparse coding in speech processing
    He, Yongjun
    Sun, Guanglu
    Han, Jiqing
    NEUROCOMPUTING, 2016, 173 : 471 - 482
  • [39] MOBILE VISUAL SEARCH VIA HIEVARCHICAL SPARSE CODING
    Yang, Xiyu
    Liu, Lianli
    Qian, Xueming
    Mei, Tao
    Shen, Jialie
    Tian, Qi
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [40] Spectrum enhancement with sparse coding for robust speech recognition
    He, Yongjun
    Sun, Guanglu
    Han, Jiqing
    DIGITAL SIGNAL PROCESSING, 2015, 43 : 59 - 70