A Study on Performance Enhancement by Integrating Neural Topic Attention with Transformer-Based Language Model

被引:1
|
作者
Um, Taehum [1 ]
Kim, Namhyoung [1 ]
机构
[1] Gachon Univ, Dept Appl Stat, 1342 Seongnam Daero, Seongnam 13120, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 17期
基金
新加坡国家研究基金会;
关键词
natural language processing; neural topic model; ELECTRA; ALBERT; multi-classification;
D O I
10.3390/app14177898
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
As an extension of the transformer architecture, the BERT model has introduced a new paradigm for natural language processing, achieving impressive results in various downstream tasks. However, high-performance BERT-based models-such as ELECTRA, ALBERT, and RoBERTa-suffer from limitations such as poor continuous learning capability and insufficient understanding of domain-specific documents. To address these issues, we propose the use of an attention mechanism to combine BERT-based models with neural topic models. Unlike traditional stochastic topic modeling, neural topic modeling employs artificial neural networks to learn topic representations. Furthermore, neural topic models can be integrated with other neural models and trained to identify latent variables in documents, thereby enabling BERT-based models to sufficiently comprehend the contexts of specific fields. We conducted experiments on three datasets-Movie Review Dataset (MRD), 20Newsgroups, and YELP-to evaluate our model's performance. Compared to the vanilla model, the proposed model achieved an accuracy improvement of 1-2% for the ALBERT model in multiclassification tasks across all three datasets, while the ELECTRA model showed an accuracy improvement of less than 1%.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Vision transformer-based visual language understanding of the construction process
    Yang, Bin
    Zhang, Binghan
    Han, Yilong
    Liu, Boda
    Hu, Jiniming
    Jin, Yiming
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 99 : 242 - 256
  • [12] CardioBERTpt: Transformer-based Models for Cardiology Language Representation in Portuguese
    Rubel Schneider, Elisa Terumi
    Gumiel, Yohan Bonescki
    Andrioli de Souza, Joao Vitor
    Mukai, Lilian Mie
    Silva e Oliveira, Lucas Emanuel
    Rebelo, Marina de Sa
    Gutierrez, Marco Antonio
    Krieger, Jose Eduardo
    Teodoro, Douglas
    Moro, Claudia
    Paraiso, Emerson Cabrera
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 378 - 381
  • [13] AraCovTexFinder: Leveraging the transformer-based language model for Arabic COVID-19 text identification
    Hossain, Md. Rajib
    Hoque, Mohammed Moshiul
    Siddique, Nazmul
    Dewan, Ali Akber
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [14] Character-Level Transformer-Based Neural Machine Translation
    Banar, Nikolay
    Daelemans, Walter
    Kestemont, Mike
    2020 4TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2020, 2020, : 149 - 156
  • [15] Localizing in-domain adaptation of transformer-based biomedical language models
    Buonocore, Tommaso Mario
    Crema, Claudio
    Redolfi, Alberto
    Bellazzi, Riccardo
    Parimbelli, Enea
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 144
  • [16] Efficient Open Domain Question Answering With Delayed Attention in Transformer-Based Models
    Siblini, Wissam
    Challal, Mohamed
    Pasqual, Charlotte
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2022, 18 (02)
  • [17] Enhancing stock price prediction using GANs and transformer-based attention mechanisms
    Li, Siyi
    Xu, Sijie
    EMPIRICAL ECONOMICS, 2025, 68 (01) : 373 - 403
  • [18] Transformer-based deep neural network language models for Alzheimer’s disease risk assessment from targeted speech
    Alireza Roshanzamir
    Hamid Aghajan
    Mahdieh Soleymani Baghshah
    BMC Medical Informatics and Decision Making, 21
  • [19] Transformer-based deep neural network language models for Alzheimer's disease risk assessment from targeted speech
    Roshanzamir, Alireza
    Aghajan, Hamid
    Soleymani Baghshah, Mahdieh
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [20] Public Sentiment toward Solar Energy-Opinion Mining of Twitter Using a Transformer-Based Language Model
    Kim, Serena Y.
    Ganesan, Koushik
    Dickens, Princess
    Panda, Soumya
    SUSTAINABILITY, 2021, 13 (05) : 1 - 19