PL-Transformer: a POS-aware and layer ensemble transformer for text classification

被引：3

作者：

Shi, Yu ^{[1
]}

Zhang, Xi ^{[1
]}

Yu, Ning ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Key Lab Trustworthy Distributed Comp & Serv BUPT, Minist Educ, Beijing, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 02期

关键词：

Text classification; Transformer; Part-of-speech; Layer ensemble;

D O I：

10.1007/s00521-022-07872-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The transformer-based models have become the de-facto standard for natural language processing (NLP) tasks. However, most of these models are only designed to capture the implicit semantics among tokens without considering the extra off-the-shelf knowledge (e.g., parts-of-speech) to facilitate the NLP tasks. Additionally, despite using multiple attention-based encoders, they only utilize the embeddings from the last layer, ignoring that from other layers. To address these issues, in this paper, we propose a novel POS-aware and layer ensemble transformer neural network (named as PL-Transformer). PL-Transformer utilizes the parts-of-speech information explicitly and leverages the outputs from different encoder layers with correlation coefficient attention (C-Encoder) jointly. Moreover, we use correlation coefficient attention to bound dot product in C-Encoder, which improves the overall model performance. Extensive experiments on four datasets demonstrate that PL-Transformer can improve the text classification performance. For example, the accuracy on the MPQA dataset is improved by 3.95% over the vanilla transformer.

引用

页码：1971 / 1982

页数：12

共 44 条

[31] A Comparative Survey of Instance Selection Methods applied to Non-Neural and Transformer-Based Text Classification
Cunha, Washington
Viegas, Felipe
Franca, Celso
Rosa, Thierson
Rocha, Leonardo
Goncalves, Marcos Andre
ACM COMPUTING SURVEYS, 2023, 55 (13S)
[32] Video–text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network
Gang Lv
Yining Sun
Fudong Nian
Multimedia Systems, 2024, 30
[33] QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text recognition using a Query-aware Transformer
Liu, Chongyu
Jiang, Qing
Peng, Dezhi
Kong, Yuxin
Zhang, Jiaixin
Xiong, Longfei
Duan, Jiwei
Sun, Cheng
Jin, Lianwen
NEUROCOMPUTING, 2025, 620
[34] Pre-Trained Transformer-Based Models for Text Classification Using Low-Resourced Ewe Language
Agbesi, Victor Kwaku
Chen, Wenyu
Yussif, Sophyani Banaamwini
Hossin, Md Altab
Ukwuoma, Chiagoziem C.
Kuadey, Noble A.
Agbesi, Colin Collinson
Samee, Nagwan Abdel
Jamjoom, Mona M.
Al-antari, Mugahed A.
SYSTEMS, 2024, 12 (01):
[35] CACFTNet: A Hybrid Cov-Attention and Cross-Layer Fusion Transformer Network for Hyperspectral Image Classification
Cheng, Shuli
Chan, Runze
Du, Anyu
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 18 - 18
[36] Video-text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network
Lv, Gang
Sun, Yining
Nian, Fudong
MULTIMEDIA SYSTEMS, 2024, 30 (01)
[37] BVA-Transformer: Image-text multimodal classification and dialogue model architecture based on Blip and visual attention mechanism
Zhang, Kaiyu
Wu, Fei
Zhang, Guowei
Liu, Jiawei
Li, Min
DISPLAYS, 2024, 83
[38] Reduce the medical burden: An automatic medical tri-age system using text classification BERT based on Transformer structure
Wang, Xinyuan
Tao, Make
Wang, Runpu
Zhang, Likui
2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 679 - 685
[39] Automated text classification of opinion vs. news French press articles. A comparison of transformer and feature-based approaches
Escou, Louis
Descampe, Antonin
Fairon, Cedrick
LANGUAGE & COMMUNICATION, 2024, 99 : 129 - 140
[40] COVID19 to Pneumonia: Multi Region Lung Severity Classification Using CNN Transformer Position-Aware Feature Encoding Network
Lee, Jong Bub
Kim, Jung Soo
Lee, Hyun Gyu
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT I, 2024, 15001 : 472 - 481

← 1 2 3 4 5 →