A Study on Performance Enhancement by Integrating Neural Topic Attention with Transformer-Based Language Model

被引:1
|
作者
Um, Taehum [1 ]
Kim, Namhyoung [1 ]
机构
[1] Gachon Univ, Dept Appl Stat, 1342 Seongnam Daero, Seongnam 13120, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 17期
基金
新加坡国家研究基金会;
关键词
natural language processing; neural topic model; ELECTRA; ALBERT; multi-classification;
D O I
10.3390/app14177898
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
As an extension of the transformer architecture, the BERT model has introduced a new paradigm for natural language processing, achieving impressive results in various downstream tasks. However, high-performance BERT-based models-such as ELECTRA, ALBERT, and RoBERTa-suffer from limitations such as poor continuous learning capability and insufficient understanding of domain-specific documents. To address these issues, we propose the use of an attention mechanism to combine BERT-based models with neural topic models. Unlike traditional stochastic topic modeling, neural topic modeling employs artificial neural networks to learn topic representations. Furthermore, neural topic models can be integrated with other neural models and trained to identify latent variables in documents, thereby enabling BERT-based models to sufficiently comprehend the contexts of specific fields. We conducted experiments on three datasets-Movie Review Dataset (MRD), 20Newsgroups, and YELP-to evaluate our model's performance. Compared to the vanilla model, the proposed model achieved an accuracy improvement of 1-2% for the ALBERT model in multiclassification tasks across all three datasets, while the ELECTRA model showed an accuracy improvement of less than 1%.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Contextual Emotional Transformer-Based Model for Comment Analysis in Mental Health Case Prediction
    Ibitoye, Ayodeji O. J.
    Oladimeji, Oladosu O.
    Onifade, Olufade F. W.
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2024,
  • [42] An accurate transformer-based model for transition-based dependency parsing of free word order languages
    Zuhra, Fatima Tuz
    Saleem, Khalid
    Naz, Surayya
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (06)
  • [43] A Systematic Review of Transformer-Based Pre-Trained Language Models through Self-Supervised Learning
    Kotei, Evans
    Thirunavukarasu, Ramkumar
    INFORMATION, 2023, 14 (03)
  • [44] ToEx: Accelerating Generation Stage of Transformer-Based Language Models via Token-Adaptive Early Exit
    Kang, Myeonggu
    Park, Junyoung
    Shin, Hyein
    Shin, Jaekang
    Kim, Lee-Sup
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (09) : 2248 - 2261
  • [45] Automated tabulation of clinical trial results: A joint entity and relation extraction approach with transformer-based language representations
    Whitton, Jetsun
    Hunter, Anthony
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 144
  • [46] Pre-Trained Transformer-Based Models for Text Classification Using Low-Resourced Ewe Language
    Agbesi, Victor Kwaku
    Chen, Wenyu
    Yussif, Sophyani Banaamwini
    Hossin, Md Altab
    Ukwuoma, Chiagoziem C.
    Kuadey, Noble A.
    Agbesi, Colin Collinson
    Samee, Nagwan Abdel
    Jamjoom, Mona M.
    Al-antari, Mugahed A.
    SYSTEMS, 2024, 12 (01):
  • [47] Identify Diabetic Retinopathy-related Clinical Concepts Using Transformer-based Natural Language Processing Methods
    Yu, Zehao
    Yang, Xi
    Sweeting, Gianna L.
    Ma, Yinghan
    Stolte, Skylar E.
    Fang, Ruogu
    Wu, Yonghui
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 499 - 500
  • [48] Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods
    Yu, Zehao
    Yang, Xi
    Sweeting, Gianna L.
    Ma, Yinghan
    Stolte, Skylar E.
    Fang, Ruogu
    Wu, Yonghui
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (SUPPL 3)
  • [49] Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods
    Zehao Yu
    Xi Yang
    Gianna L. Sweeting
    Yinghan Ma
    Skylar E. Stolte
    Ruogu Fang
    Yonghui Wu
    BMC Medical Informatics and Decision Making, 22
  • [50] Transformer-Based Tool for Automated Fact-Checking of Online Health Information: Development Study
    Bayani, Azadeh
    Ayotte, Alexandre
    Nikiema, Jean Noel
    JMIR INFODEMIOLOGY, 2025, 5