Sparse Representation Based Query Classification Using LDA Topic Modeling

被引:2
作者
Bhattacharya, Indrani [1 ]
Sil, Jaya [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Sibpur, Howrah, India
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2 | 2017年 / 469卷
关键词
Topic modeling; LDA; Sparse classifier; Statistical methods;
D O I
10.1007/978-981-10-1678-3_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, tremendous growth of documents provides scope and challenges to the interdisciplinary research community in text processing for retrieving information. Text analytics reveals high-quality information by identifying patterns and its trends using statistical methods. In this paper, we propose a novel approach to classify user query in a reduced search space by considering the query as a collection of words distributed over different topics. Latent Dirichlet allocation (LDA) has been used for topic modeling and a collection of topics containing words are obtained following Dirichlet distribution. We construct a sparse matrix called topic-vocabulary matrix (TVM) using probability distribution of words appearing in the topics. Finally, sparse representation based classifier (SRC) has been applied for classifying query using TVM consisting of training patterns. Here, we have analyzed the effect of number of patterns in classifying the queries and achieved 90.4 % accuracy.
引用
收藏
页码:621 / 629
页数:9
相关论文
共 50 条
  • [41] LDA Topic Modeling on Twitter Data Concerning Immigrants and Refugees
    Ergul, Halil Ibrahim
    Terzioglu, Aysecan
    Tercan, Murat
    Yanikoglu, Berrin
    Arin, Inanc
    2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [42] LDA-PSTR: A Topic Modeling Method for Short Text
    Zhou, Kai
    Yang, Qun
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 339 - 352
  • [43] GOW-LDA: Applying Term Co-occurrence Graph Representation in LDA Topic Models Improvement
    Phu Pham
    Phuc Do
    Ta, Chien D. C.
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 420 - 431
  • [44] AIRCRAFT TARGET RECOGNITION USING COPULA JOINT STATISTICAL MODEL AND SPARSE REPRESENTATION BASED CLASSIFICATION
    Karine, Ayoub
    Toumi, Abdelmalek
    Khenchaf, Ali
    El Hassouni, Mohammed
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 3635 - 3638
  • [45] LDA-based topic modeling for COVID-19-related sports research trends
    Lee, Jea Woog
    Kim, YoungBin
    Han, Doug Hyun
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [46] Labeling Blog Posts with Wikipedia Entries through LDA-Based Topic Modeling of Wikipedia
    Makita, Kensaku
    Suzuki, Hiroko
    Koike, Daichi
    Utsuro, Takehito
    Kawada, Yasuhide
    Fukuhara, Tomohiro
    JOURNAL OF INTERNET TECHNOLOGY, 2013, 14 (02): : 297 - 306
  • [47] Microblog topic evolution computing based on LDA algorithm
    Feng Jian
    Wang Yajiao
    Ding Yuanyuan
    OPEN PHYSICS, 2018, 16 (01): : 509 - 516
  • [48] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Hamed Jelodar
    Yongli Wang
    Chi Yuan
    Xia Feng
    Xiahui Jiang
    Yanchao Li
    Liang Zhao
    Multimedia Tools and Applications, 2019, 78 : 15169 - 15211
  • [49] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Jelodar, Hamed
    Wang, Yongli
    Yuan, Chi
    Feng, Xia
    Jiang, Xiahui
    Li, Yanchao
    Zhao, Liang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (11) : 15169 - 15211
  • [50] Latent Dirichlet allocation (LDA) for topic modeling of the CFPB consumer complaints
    Bastani, Kaveh
    Namavari, Hamed
    Shaffer, Jeffrey
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 127 : 256 - 271