A bio-inspired positional embedding network for transformer-based models

被引:2
|
作者
Tang, Xue-song [1 ,3 ]
Hao, Kuangrong [1 ,3 ,4 ]
Wei, Hui [2 ,5 ]
机构
[1] 2999 Renmin North Rd, Shanghai 201620, Peoples R China
[2] 2005 Songhu Rd, Shanghai 200434, Peoples R China
[3] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
[4] Minist Educ, Engn Res Ctr Digitized Text Apparel Technol, Shanghai, Peoples R China
[5] Fudan Univ, Sch Comp Sci, Lab Algorithms Cognit Models, Shanghai, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Transformers; Dorsal pathway modeling; Image classification; Position embedding; Zero padding;
D O I
10.1016/j.neunet.2023.07.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Owing to the progress of transformer-based networks, there have been significant improvements in the performance of vision models in recent years. However, there is further potential for improvement in positional embeddings that play a crucial role in distinguishing information across different positions. Based on the biological mechanisms of human visual pathways, we propose a positional embedding network that adaptively captures position information by modeling the dorsal pathway, which is responsible for spatial perception in human vision. Our proposed double-stream architecture leverages large zero-padding convolutions to learn local positional features and utilizes transformers to learn global features, effectively capturing the interaction between dorsal and ventral pathways. To evaluate the effectiveness of our method, we implemented experiments on various datasets, employing differentiated designs. Our statistical analysis demonstrates that the simple implementation significantly enhances image classification performance, and the observed trends demonstrate its biological plausibility.& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:204 / 214
页数:11
相关论文
共 50 条
  • [21] Reward modeling for mitigating toxicity in transformer-based language models
    Faal, Farshid
    Schmitt, Ketra
    Yu, Jia Yuan
    APPLIED INTELLIGENCE, 2023, 53 (07) : 8421 - 8435
  • [22] HEART: Historically Information Embedding and Subspace Re-Weighting Transformer-Based Tracking
    Liu, Tianpeng
    Li, Jing
    Beheshti, Amin
    Wu, Jia
    Chang, Jun
    Song, Beihang
    Lian, Lezhi
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (02) : 566 - 577
  • [23] Structural positional encoding for knowledge integration in transformer-based medical process monitoring and trace classification
    Irwin, Christopher
    Dossena, Marco
    Leonardi, Giorgio
    Montani, Stefania
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2024,
  • [24] Transformer-based models to deal with heterogeneous environments in Human Activity Recognition
    Ek S.
    Portet F.
    Lalanda P.
    Personal and Ubiquitous Computing, 2023, 27 (06) : 2267 - 2280
  • [25] Transformer-Based Models for Probabilistic Time Series Forecasting with Explanatory Variables
    Caetano, Ricardo
    Oliveira, Jose Manuel
    Ramos, Patricia
    MATHEMATICS, 2025, 13 (05)
  • [26] Calibration of Transformer-Based Models for Identifying Stress and Depression in Social Media
    Ilias, Loukas
    Mouzakitis, Spiros
    Askounis, Dimitris
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) : 1979 - 1990
  • [27] Benchmarking Inference of Transformer-Based Transcription Models With Clustering on Embedded GPUs
    Schubert, Marika E.
    Langerman, David
    George, Alan D.
    IEEE ACCESS, 2024, 12 : 123276 - 123293
  • [28] Evaluation of transformer-based models for punctuation and capitalization restoration in Catalan and Galician
    Pan, Ronghao
    Garcia-Diaz, Jose Antonio
    Vivancos-Vicente, Pedro Jose
    Valencia-Garcia, Rafael
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2023, (70): : 27 - 38
  • [29] End-to-End Transformer-Based Models in Textual-Based NLP
    Rahali, Abir
    Akhloufi, Moulay A.
    AI, 2023, 4 (01) : 54 - 110
  • [30] Unlocking Bias Detection: Leveraging Transformer-Based Models for Content Analysis
    Raza, Shaina
    Bamgbose, Oluwanifemi
    Chatrath, Veronica
    Ghuge, Shardule
    Sidyakin, Yan
    Muaad, Abdullah Yahya Mohammed
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, : 1 - 13