Sparse Representation Based Query Classification Using LDA Topic Modeling

被引:2
作者
Bhattacharya, Indrani [1 ]
Sil, Jaya [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Sibpur, Howrah, India
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2 | 2017年 / 469卷
关键词
Topic modeling; LDA; Sparse classifier; Statistical methods;
D O I
10.1007/978-981-10-1678-3_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, tremendous growth of documents provides scope and challenges to the interdisciplinary research community in text processing for retrieving information. Text analytics reveals high-quality information by identifying patterns and its trends using statistical methods. In this paper, we propose a novel approach to classify user query in a reduced search space by considering the query as a collection of words distributed over different topics. Latent Dirichlet allocation (LDA) has been used for topic modeling and a collection of topics containing words are obtained following Dirichlet distribution. We construct a sparse matrix called topic-vocabulary matrix (TVM) using probability distribution of words appearing in the topics. Finally, sparse representation based classifier (SRC) has been applied for classifying query using TVM consisting of training patterns. Here, we have analyzed the effect of number of patterns in classifying the queries and achieved 90.4 % accuracy.
引用
收藏
页码:621 / 629
页数:9
相关论文
共 50 条
  • [31] A novel content-based recommendation approach based on LDA topic modeling for literature recommendation
    Bagul, Dhiraj Vaibhav
    Barve, Sunita
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 954 - 961
  • [32] Twin labeled LDA: a supervised topic model for document classification
    Wei Wang
    Bing Guo
    Yan Shen
    Han Yang
    Yaosen Chen
    Xinhua Suo
    Applied Intelligence, 2020, 50 : 4602 - 4615
  • [33] Twin labeled LDA: a supervised topic model for document classification
    Wang, Wei
    Guo, Bing
    Shen, Yan
    Yang, Han
    Chen, Yaosen
    Suo, Xinhua
    APPLIED INTELLIGENCE, 2020, 50 (12) : 4602 - 4615
  • [34] A Prior Setting that Improves LDA in both Document Representation and Topic Extraction
    Ding, Juncheng
    Jin, Wei
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [35] NeurIPS Conference Papers Classification Based on Topic Modeling
    Terko, Ajsa
    Zunic, Emir
    Donko, Dzenana
    2019 XXVII INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND AUTOMATION TECHNOLOGIES (ICAT 2019), 2019,
  • [36] Using PageRank for Characterizing Topic Quality in LDA
    Das Gollapalli, Sujatha
    Li, Xiao-li
    PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 115 - 122
  • [37] NET-LDA: a novel topic modeling method based on semantic document similarity
    Ekinci, Ekin
    Omurca, Sevinc Ilhan
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2020, 28 (04) : 2244 - 2260
  • [38] Topic Mining for Call Centers Based on LDA
    Guo, Wenming
    Deng, Tianlang
    2014 10TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2014, : 839 - 844
  • [39] The Analysis of QQ Group Topic based on the LDA
    Qiu, Jin
    Di, Xiaoqiang
    Yin, Dali
    Bi, Lin
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 299 - 303
  • [40] Big Data Software Engineering: Analysis of Knowledge Domains and Skill Sets Using LDA-Based Topic Modeling
    Gurcan, Fatih
    Cagiltay, Nergiz Ercil
    IEEE ACCESS, 2019, 7 : 82541 - 82552