PL-Transformer: a POS-aware and layer ensemble transformer for text classification

被引:3
|
作者
Shi, Yu [1 ]
Zhang, Xi [1 ]
Yu, Ning [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Key Lab Trustworthy Distributed Comp & Serv BUPT, Minist Educ, Beijing, Peoples R China
关键词
Text classification; Transformer; Part-of-speech; Layer ensemble;
D O I
10.1007/s00521-022-07872-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The transformer-based models have become the de-facto standard for natural language processing (NLP) tasks. However, most of these models are only designed to capture the implicit semantics among tokens without considering the extra off-the-shelf knowledge (e.g., parts-of-speech) to facilitate the NLP tasks. Additionally, despite using multiple attention-based encoders, they only utilize the embeddings from the last layer, ignoring that from other layers. To address these issues, in this paper, we propose a novel POS-aware and layer ensemble transformer neural network (named as PL-Transformer). PL-Transformer utilizes the parts-of-speech information explicitly and leverages the outputs from different encoder layers with correlation coefficient attention (C-Encoder) jointly. Moreover, we use correlation coefficient attention to bound dot product in C-Encoder, which improves the overall model performance. Extensive experiments on four datasets demonstrate that PL-Transformer can improve the text classification performance. For example, the accuracy on the MPQA dataset is improved by 3.95% over the vanilla transformer.
引用
收藏
页码:1971 / 1982
页数:12
相关论文
共 44 条
  • [31] A Comparative Survey of Instance Selection Methods applied to Non-Neural and Transformer-Based Text Classification
    Cunha, Washington
    Viegas, Felipe
    Franca, Celso
    Rosa, Thierson
    Rocha, Leonardo
    Goncalves, Marcos Andre
    ACM COMPUTING SURVEYS, 2023, 55 (13S)
  • [32] Video–text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network
    Gang Lv
    Yining Sun
    Fudong Nian
    Multimedia Systems, 2024, 30
  • [33] QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text recognition using a Query-aware Transformer
    Liu, Chongyu
    Jiang, Qing
    Peng, Dezhi
    Kong, Yuxin
    Zhang, Jiaixin
    Xiong, Longfei
    Duan, Jiwei
    Sun, Cheng
    Jin, Lianwen
    NEUROCOMPUTING, 2025, 620
  • [34] Pre-Trained Transformer-Based Models for Text Classification Using Low-Resourced Ewe Language
    Agbesi, Victor Kwaku
    Chen, Wenyu
    Yussif, Sophyani Banaamwini
    Hossin, Md Altab
    Ukwuoma, Chiagoziem C.
    Kuadey, Noble A.
    Agbesi, Colin Collinson
    Samee, Nagwan Abdel
    Jamjoom, Mona M.
    Al-antari, Mugahed A.
    SYSTEMS, 2024, 12 (01):
  • [35] CACFTNet: A Hybrid Cov-Attention and Cross-Layer Fusion Transformer Network for Hyperspectral Image Classification
    Cheng, Shuli
    Chan, Runze
    Du, Anyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 18 - 18
  • [36] Video-text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network
    Lv, Gang
    Sun, Yining
    Nian, Fudong
    MULTIMEDIA SYSTEMS, 2024, 30 (01)
  • [37] BVA-Transformer: Image-text multimodal classification and dialogue model architecture based on Blip and visual attention mechanism
    Zhang, Kaiyu
    Wu, Fei
    Zhang, Guowei
    Liu, Jiawei
    Li, Min
    DISPLAYS, 2024, 83
  • [38] Reduce the medical burden: An automatic medical tri-age system using text classification BERT based on Transformer structure
    Wang, Xinyuan
    Tao, Make
    Wang, Runpu
    Zhang, Likui
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 679 - 685
  • [39] Automated text classification of opinion vs. news French press articles. A comparison of transformer and feature-based approaches
    Escou, Louis
    Descampe, Antonin
    Fairon, Cedrick
    LANGUAGE & COMMUNICATION, 2024, 99 : 129 - 140
  • [40] COVID19 to Pneumonia: Multi Region Lung Severity Classification Using CNN Transformer Position-Aware Feature Encoding Network
    Lee, Jong Bub
    Kim, Jung Soo
    Lee, Hyun Gyu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT I, 2024, 15001 : 472 - 481