A Lightweight Transformer with Convolutional Attention

被引:0
|
作者
Zeng, Kungan [1 ]
Paik, Incheon [1 ]
机构
[1] Univ Aizu, Sch Comp Sci & Engn, Fukushima, Japan
来源
2020 11TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST) | 2020年
关键词
neural machine translation; Transformer; CNN; Muti-head attention;
D O I
10.1109/ICAST51195.2020.9319489
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Neural machine translation (NMT) goes through rapid development because of the application of various deep learning techs. Especially, how to construct a more effective structure of NMT attracts more and more attention. Transformer is a state-of-the-art architecture in NMT. It replies on the self-attention mechanism exactly instead of recurrent neural networks (RNN). The Multi-head attention is a crucial part that implements the self-attention mechanism, and it also dramatically affects the scale of the model. In this paper, we present a new Multi-head attention by combining convolution operation. In comparison with the base Transformer, our approach can reduce the number of parameters effectively. And we perform a reasoned experiment. The result shows that the performance of the new model is similar to the base model.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] CCT: Lightweight compact convolutional transformer for lung disease CT image classification
    Sun, Weiwei
    Pang, Yu
    Zhang, Guo
    FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [22] CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
    Zhang, Zhiwen
    Li, Teng
    Tang, Xuebin
    Hu, Xiang
    Peng, Yuanxi
    SENSORS, 2022, 22 (10)
  • [23] Patch attention convolutional vision transformer for facial expression recognition with occlusion
    Liu, Chang
    Hirota, Kaoru
    Dai, Yaping
    INFORMATION SCIENCES, 2023, 619 : 781 - 794
  • [24] Efficient convolutional dual-attention transformer for automatic modulation recognition
    Yi, Zengrui
    Meng, Hua
    Gao, Lu
    He, Zhonghang
    Yang, Meng
    APPLIED INTELLIGENCE, 2025, 55 (03)
  • [25] ASAFormer: Visual tracking with convolutional vision transformer and asymmetric selective attention
    Gong, Xiaomei
    Zhang, Yi
    Hu, Shu
    KNOWLEDGE-BASED SYSTEMS, 2024, 291
  • [26] Attention dual transformer with adaptive temporal convolutional for diabetic retinopathy detection
    Mishmala Sushith
    Ajanthaa Lakkshmanan
    M. Saravanan
    S. Castro
    Scientific Reports, 15 (1)
  • [27] ASAFormer: Visual tracking with convolutional vision transformer and asymmetric selective attention
    Gong, Xiaomei
    Zhang, Yi
    Hu, Shu
    Knowledge-Based Systems, 2024, 291
  • [28] Spectral Superresolution Using Transformer with Convolutional Spectral Self-Attention
    Liao, Xiaomei
    He, Lirong
    Mao, Jiayou
    Xu, Meng
    REMOTE SENSING, 2024, 16 (10)
  • [29] Image Inpainting Using Lightweight Transformer Neural Network Based on Channel Attention
    Liao, Jan-Ray
    Hsieh, Shao-Yueh
    PROCEEDINGS OF 2023 THE 12TH INTERNATIONAL CONFERENCE ON NETWORKS, COMMUNICATION AND COMPUTING, ICNCC 2023, 2023, : 247 - 253
  • [30] Lightweight Convolutional Network with Integrated Attention Mechanism for Missing Bolt Detection in Railways
    Alif, Mujadded Al Rabbani
    Hussain, Muhammad
    METROLOGY, 2024, 4 (02): : 254 - 278