Sparse Representation Based Query Classification Using LDA Topic Modeling

被引:2
作者
Bhattacharya, Indrani [1 ]
Sil, Jaya [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Sibpur, Howrah, India
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2 | 2017年 / 469卷
关键词
Topic modeling; LDA; Sparse classifier; Statistical methods;
D O I
10.1007/978-981-10-1678-3_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, tremendous growth of documents provides scope and challenges to the interdisciplinary research community in text processing for retrieving information. Text analytics reveals high-quality information by identifying patterns and its trends using statistical methods. In this paper, we propose a novel approach to classify user query in a reduced search space by considering the query as a collection of words distributed over different topics. Latent Dirichlet allocation (LDA) has been used for topic modeling and a collection of topics containing words are obtained following Dirichlet distribution. We construct a sparse matrix called topic-vocabulary matrix (TVM) using probability distribution of words appearing in the topics. Finally, sparse representation based classifier (SRC) has been applied for classifying query using TVM consisting of training patterns. Here, we have analyzed the effect of number of patterns in classifying the queries and achieved 90.4 % accuracy.
引用
收藏
页码:621 / 629
页数:9
相关论文
共 50 条
  • [21] Classification of Programming Problems based on Topic Modeling
    Intisar, Chowdhury Md
    Watanobe, Yutaka
    Poudel, Manoj
    Bhalla, Subhash
    PROCEEDINGS OF 2019 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND EDUCATION TECHNOLOGY (ICIET 2019), 2019, : 275 - 283
  • [22] Data Analysis of Psychological Approaches to Soccer Research: Using LDA Topic Modeling
    Lee, Jea Woog
    Han, Doug Hyun
    BEHAVIORAL SCIENCES, 2023, 13 (10)
  • [23] Analyzing tourism reviews using an LDA topic-based sentiment analysis approach
    Ali, Twil
    Omar, Bencharef
    Soulaimane, Kaloun
    METHODSX, 2022, 9
  • [24] Analyzing the DarkNetMarkets subreddit for evolutions of tools and trends using LDA topic modeling
    Porter, Kyle
    DIGITAL INVESTIGATION, 2018, 26 : S87 - S97
  • [25] Using Topic Modeling in Classification of Brazilian Lawsuits
    Aguiar, Andre
    Silveira, Raquel
    Furtado, Vasco
    Pinheiro, Vladia
    Monteiro Neto, Joao A.
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 233 - 242
  • [26] Stable Topic Modeling for Web Science: Granulated LDA
    Koltcov, Sergei
    Nikolenko, Sergey I.
    Koltsova, Olessia
    Bodrunova, Svetlana S.
    PROCEEDINGS OF THE 2016 ACM WEB SCIENCE CONFERENCE (WEBSCI'16), 2016, : 342 - 343
  • [27] Combining IR and LDA Topic Modeling for Filtering Microblogs
    Hajjem, Malek
    Latiri, Chiraz
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 761 - 770
  • [28] Topic research in fuzzy domain: Based on LDA topic modelling
    Yu, Dejian
    Fang, Anran
    Xu, Zeshui
    INFORMATION SCIENCES, 2023, 648
  • [29] LDA-based online topic detection using tensor factorization
    Guo, Xin
    Xiang, Yang
    Chen, Qian
    Huang, Zhenhua
    Hao, Yongtao
    JOURNAL OF INFORMATION SCIENCE, 2013, 39 (04) : 459 - 469
  • [30] MAPPING LEARNER'S QUERY TO LEARNING OBJECTS USING TOPIC MODELING AND MACHINE LEARNING TECHNIQUES
    Sengupta, Souvik
    Pal, Saurabh
    Pramanik, Pijush kanti dutta
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2023, 24 (04): : 909 - 917