Integration of Neural Embeddings and Probabilistic Models in Topic Modeling

被引:0
作者
Koochemeshkian, Pantea [1 ]
Bouguila, Nizar [1 ]
机构
[1] Concordia Inst Informat Syst Engn CIISE, Informat Syst Engn, Montreal, PQ, Canada
关键词
DIRICHLET; EXTRACTION;
D O I
10.1080/08839514.2024.2403904
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic modeling, a way to find topics in large volumes of text, has grown with the help of deep learning. This paper presents two novel approaches to topic modeling by integrating embeddings derived from Bert-Topic with the multi-grain clustering topic model (MGCTM). Recognizing the inherent hierarchical and multi-scale nature of topics in corpora, our methods utilize MGCTM to capture topic structures at multiple levels of granularity. We enhance the expressiveness of MGCTM by introducing the Generalized Dirichlet and Beta-Liouville distributions as priors, which provide greater flexibility in modeling topic proportions and capturing richer topic relationships. Comprehensive experiments on various datasets showcase the effectiveness of our proposed models in achieving superior topic coherence and granularity compared to state-of-the-art methods. Our findings underscore the potential of leveraging hybrid architectures, marrying neural embeddings with advanced probabilistic modeling, to push the boundaries of topic modeling.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] A computational analysis of aspect-based sentiment analysis research through bibliometric mapping and topic modeling
    Chen, Xieling
    Xie, Haoran
    Tao, Xiaohui
    Wang, Fu Lee
    Zhang, Dian
    Dai, Hong-Ning
    JOURNAL OF BIG DATA, 2025, 12 (01)
  • [32] Role of neural network models for developing speech systems
    Rao, K. Sreenivasa
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 783 - 836
  • [33] A Framework Based on K-Means Clustering and Topic Modeling for Analyzing Unstructured Manufacturing Capability Data
    Sabbagh, Ramin
    Ameri, Farhad
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2020, 20 (01)
  • [34] A learning framework for information block search based on probabilistic graphical models and Fisher Kernel
    Wong, Tak-Lam
    Xie, Haoran
    Lam, Wai
    Wang, Fu Lee
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (09) : 1473 - 1487
  • [35] Modeling and Optimization of EMI Filter by Using Artificial Neural Network
    Chen, Henglin
    Ye, Shize
    IEEE TRANSACTIONS ON ELECTROMAGNETIC COMPATIBILITY, 2019, 61 (06) : 1979 - 1987
  • [36] Neural approach for temperature-dependent modeling of GaN HEMTs
    Marinkovic, Zlatica
    Crupi, Giovanni
    Caddemi, Alina
    Avolio, Gustavo
    Raffo, Antonio
    Markovic, Vera
    Vannini, Giorgio
    Schreurs, Dominique M. M. -P.
    INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS, 2015, 28 (04) : 359 - 370
  • [37] Comparing neural models for nested and overlapping biomedical event detection
    Espinosa, Kurt
    Georgiadis, Panagiotis
    Christopoulou, Fenia
    Ju, Meizhi
    Miwa, Makoto
    Ananiadou, Sophia
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [38] Parameters Estimation of PV Models Using Artificial Neural Network
    Abdellatif, Hussein
    Hossain, Md Ismail
    Abido, Mohammad A.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (11) : 14947 - 14956
  • [39] Modeling of lime production process using artificial neural network
    Daeichian, Abolghasem
    Shahramfar, Rana
    Heidari, Elham
    CHEMICAL PRODUCT AND PROCESS MODELING, 2022, 17 (06): : 655 - 667
  • [40] Neural entity linking: A survey of models based on deep learning
    Sevgili, Oezge
    Shelmanov, Artem
    Arkhipov, Mikhail
    Panchenko, Alexander
    Biemann, Chris
    SEMANTIC WEB, 2022, 13 (03) : 527 - 570