Structure-Preserved Self-Attention for Fusion Image Information in Multiple Color Spaces

被引:0
作者
He, Zhu [1 ,2 ]
Lin, Mingwei [1 ,2 ]
Luo, Xin [3 ]
Xu, Zeshui [4 ]
机构
[1] Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350117, Peoples R China
[2] Fujian Normal Univ, Fujian Prov Engn Res Ctr Publ Serv Big Data Min &, Fuzhou 350117, Peoples R China
[3] Southwest Univ, Coll Comp & Informat Sci, Chongqing 400715, Peoples R China
[4] Sichuan Univ, Business Sch, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Image color analysis; Computational modeling; Image recognition; Feature extraction; Accuracy; Adaptation models; Convolutional neural networks; Computer architecture; Image segmentation; Image classification; Channel shuffle; color space; group convolution (GConv); image classification; self-attention mechanism; NEURAL-NETWORKS; MODEL; HSV;
D O I
10.1109/TNNLS.2024.3490800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The selection and utilization of different color spaces significantly impact the recognition performance of deep learning models in downstream tasks. Existing studies typically leverage image information from various color spaces through model integration or channel concatenation. However, these methods result in excessive model size and suboptimal utilization of image information. In this study, we propose the structure-preserved self-attention network (SPSANet) model for efficient fusion of image information from different color spaces. This model incorporates a novel structure-preserved self-attention (SPSA) module that employs a single-head pixel-wise attention mechanism, as opposed to the conventional multihead self-attention (MHSA) approach. Specifically, feature maps from all color space grouping paths are utilized for similarity matching, enabling the model to focus on critical pixel locations across different color spaces. This design mitigates the dependence of the SPSANet model on the choice of color space while enhancing the advantages of integrating multiple color spaces. The SPSANet model also employs channel shuffle operations to facilitate limited interaction between information flows from different color space paths. Experimental results demonstrate that the SPSANet model, utilizing eight common color spaces-RGB, Luv, XYZ, Lab, HSV, YCrCb, YUV, and HLS-achieves superior recognition performance with reduced parameters and computational cost.
引用
收藏
页数:15
相关论文
共 72 条
[61]  
Yang J. W., 2021, arXiv
[62]   Automatic greenhouse pest recognition based on multiple color space features [J].
Yang, Zhankui ;
Li, Wenyong ;
Li, Ming ;
Yang, Xinting .
INTERNATIONAL JOURNAL OF AGRICULTURAL AND BIOLOGICAL ENGINEERING, 2021, 14 (02) :188-195
[63]   Convolutional Neural Network for Traffic Sign Recognition based on Color Space [J].
Yildiz, Gulcan ;
Dizdaroglu, Bekir .
2ND INTERNATIONAL INFORMATICS AND SOFTWARE ENGINEERING CONFERENCE (IISEC), 2021,
[64]  
Yuzkat M, 2021, Avrupa Bilim ve Teknoloji Dergisi, V21, P70
[65]  
Zagoruyko Sergey, 2016, BRIT MACH VIS C BMVC
[66]   A Novel Prediction-Based Temporal Graph Routing Algorithm for Software-Defined Vehicular Networks [J].
Zhao, Liang ;
Li, Zhuhui ;
Al-Dubai, Ahmed Y. ;
Min, Geyong ;
Li, Jiajia ;
Hawbani, Ammar ;
Zomaya, Albert Y. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :13275-13290
[67]   Vehicular Computation Offloading for Industrial Mobile Edge Computing [J].
Zhao, Liang ;
Yang, Kaiqi ;
Tan, Zhiyuan ;
Song, Houbing ;
Al-Dubai, Ahmed ;
Zomaya, Albert Y. ;
Li, Xianwei .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (11) :7871-7881
[68]   A Novel Cost Optimization Strategy for SDN-Enabled UAV-Assisted Vehicular Computation Offloading [J].
Zhao, Liang ;
Yang, Kaiqi ;
Tan, Zhiyuan ;
Li, Xianwei ;
Sharma, Suraj ;
Liu, Zhi .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) :3664-3674
[69]   Object Detection With Deep Learning: A Review [J].
Zhao, Zhong-Qiu ;
Zheng, Peng ;
Xu, Shou-Tao ;
Wu, Xindong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (11) :3212-3232
[70]   Places: A 10 Million Image Database for Scene Recognition [J].
Zhou, Bolei ;
Lapedriza, Agata ;
Khosla, Aditya ;
Oliva, Aude ;
Torralba, Antonio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (06) :1452-1464