End-to-End Transformer-Based Models in Textual-Based NLP

被引:32
|
作者
Rahali, Abir [1 ]
Akhloufi, Moulay A. [1 ]
机构
[1] Univ Moncton, Dept Comp Sci, Percept Robot & Intelligent Machines Res Grp PRIME, Moncton, NB E1A 3E9, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Transformers; deep learning; natural language processing; transfer learning; PRE-TRAINED BERT; PREDICTION; SYSTEMS;
D O I
10.3390/ai4010004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer architectures are highly expressive because they use self-attention mechanisms to encode long-range dependencies in the input sequences. In this paper, we present a literature review on Transformer-based (TB) models, providing a detailed overview of each model in comparison to the Transformer's standard architecture. This survey focuses on TB models used in the field of Natural Language Processing (NLP) for textual-based tasks. We begin with an overview of the fundamental concepts at the heart of the success of these models. Then, we classify them based on their architecture and training mode. We compare the advantages and disadvantages of popular techniques in terms of architectural design and experimental value. Finally, we discuss open research, directions, and potential future work to help solve current TB application challenges in NLP.
引用
收藏
页码:54 / 110
页数:57
相关论文
共 50 条
  • [41] End-to-end deep learning for directly estimating grape yield from ground-based imagery
    Olenskyj, Alexander G.
    Sams, Brent S.
    Fei, Zhenghao
    Singh, Vishal
    Raja, Pranav, V
    Bornhorst, Gail M.
    Earles, J. Mason
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 198
  • [42] Enhancing Arabic Cyberbullying Detection with End-to-End Transformer Model
    Mahdi, Mohamed A.
    Fati, Suliman Mohamed
    Hazber, Mohamed A. G.
    Ahamad, Shahanawaj
    Saad, Sawsan A.
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 141 (02): : 1651 - 1671
  • [43] Transformer-based language models for mental health issues: A survey
    Greco, Candida M.
    Simeri, Andrea
    Tagarelli, Andrea
    Zumpano, Ester
    PATTERN RECOGNITION LETTERS, 2023, 167 : 204 - 211
  • [44] TransVG plus plus : End-to-End Visual Grounding With Language Conditioned Vision Transformer
    Deng, Jiajun
    Yang, Zhengyuan
    Liu, Daqing
    Chen, Tianlang
    Zhou, Wengang
    Zhang, Yanyong
    Li, Houqiang
    Ouyang, Wanli
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13636 - 13652
  • [45] Central Kurdish Text-to-Speech Synthesis with Novel End-to-End Transformer Training
    Ahmad, Hawraz A.
    Rashid, Tarik A.
    ALGORITHMS, 2024, 17 (07)
  • [46] End-to-End Network Intrusion Detection Based on Contrastive Learning
    Li, Longlong
    Lu, Yuliang
    Yang, Guozheng
    Yan, Xuehu
    SENSORS, 2024, 24 (07)
  • [47] End-to-End Learning-Based Image Compression: A Review
    Chen Jimin
    Lin Zehao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [48] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Jiang, Feng
    Tao, Wen
    Liu, Shaohui
    Ren, Jie
    Guo, Xun
    Zhao, Debin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 3007 - 3018
  • [49] End-to-End Powerline Detection Based on Images from UAVs
    Hu, Jingwei
    He, Jing
    Guo, Chengjun
    REMOTE SENSING, 2023, 15 (06)
  • [50] Tunisian Dialectal End-to-end Speech Recognition based on DeepSpeech
    Messaoudi, Abir
    Haddad, Hatem
    Fourati, Chayma
    Hmida, Moez BenHaj
    Mabrouk, Aymen Ben Elhaj
    Graiet, Mohamed
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 183 - 190