Construction of multi-channel fusion salient object detection network based on gating mechanism and pooling network

被引:1
|
作者
Ning, Li [1 ,2 ]
Jincai, Huang [1 ]
Yanghe, Feng [1 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha, Hunan, Peoples R China
[2] China Inst Marine Technol & Econ, Marine Human Factors Engn Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Salienct object recognition; Gating mechanism; Pooling network; Multi-channel fusion; Convolutional Neural network;
D O I
10.1007/s11042-021-11031-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We combine SNet network based on gating mechanism with poolnet network to solve the problem of salient object detection. The network construction of this paper is based on FPN, which is a classic U-net backbone network. Inspired by poolnet, we also introduce the global feature guidance module. By aggregating the high-level semantic information into the transposition convolution stage of different scales, the higher-level semantic features can be used more effectively. Although the introduction of global information can effectively improve the effect of saliency monitoring, how to aggregate the global and local features of different scales still needs to be further explored. Inspired by SNet network, we also integrate snet into our network. In the specific feature fusion process, the feature values of different channels are weighted, and each channel is given different weights. The more important semantic information is extracted from multiple channels, and the key semantic information in the feature map is retained. Compared with the current typical methods, we find that the introduction of snet module can reduce the generation of error areas of saliency map, and further improve the integrity of saliency map. For different regions of the same object, due to the difference of color contrast and texture, the saliency map generated by the previous method is inconsistent in the same object region. Our method can effectively solve this problem. For the same object, we can generate consistent results of saliency probability. Through quantitative evaluation with the existing 15 methods (including SOTA method). Our network can process 300 * 267 images faster than 11FPS, which is at a medium level compared with the most advanced networks. These networks include DGRL, PiCANet, PoolNet and so on. The Precision and Recall curve results show that our network performs well on DUT-O, DUT-S and ECSSD data sets, and the minimum precision values are all above 0.47. The false positive prediction of salient objects in the graph is low, and the overall performance of the model is good.
引用
收藏
页码:12111 / 12126
页数:16
相关论文
共 50 条
  • [21] Lightweight video salient object detection via channel-shuffle enhanced multi-modal fusion network
    Huang, Kan
    Xu, Zhijing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 1025 - 1039
  • [22] Multi-channel GCN network based on position-aware gating fusion for aspect sentiment triplet extraction
    Yang, Kun
    Wang, Xiao
    Gao, Bin
    Liu, Shutian
    Liu, Zhengjun
    NEUROCOMPUTING, 2025, 619
  • [23] LFNet: Light Field Fusion Network for Salient Object Detection
    Zhang, Miao
    Ji, Wei
    Piao, Yongri
    Li, Jingjing
    Zhang, Yu
    Xu, Shuang
    Lu, Huchuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6276 - 6287
  • [24] A multi-channel convolutional neural network based on attention mechanism fusion for facial expression recognition
    Zhu, Muqing
    Wen, Mi
    APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2023, 9 (01)
  • [25] Multi-channel distribution mechanism based on BP neural network
    Zhai, Xue Ming
    Wang, Jia
    Li, Jin Ze
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MECHATRONICS AND INDUSTRIAL INFORMATICS, 2015, 31 : 358 - 361
  • [26] Cross channel aggregation similarity network for salient object detection
    Chen, Liyuan
    Liu, Huawen
    Mo, Jiashuaizi
    Zhang, Dawei
    Yang, Jie
    Lin, Feilong
    Zheng, Zhonglong
    Jia, Riheng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (08) : 2153 - 2169
  • [27] Cross channel aggregation similarity network for salient object detection
    Liyuan Chen
    Huawen Liu
    Jiashuaizi Mo
    Dawei Zhang
    Jie Yang
    Feilong Lin
    Zhonglong Zheng
    Riheng Jia
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 2153 - 2169
  • [28] Object Detection Network Based on Feature Fusion and Attention Mechanism
    Zhang, Ying
    Chen, Yimin
    Huang, Chen
    Gao, Mingke
    FUTURE INTERNET, 2019, 11 (01):
  • [29] Target Classification based on Sensor Fusion in Multi-Channel Seismic Network
    Zubair, Mussab
    Hartmann, Klaus
    2011 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2011, : 438 - 443
  • [30] Offline Handwriting Verification Based on Siamese Network and Multi-channel Fusion
    Lin, Chao-Qun
    Wang, Da-Han
    Xiao, Shun-Xin
    Chi, Xue-Ke
    Wang, Chi-Ming
    Zhang, Xu-Yao
    Zhu, Shun-Zhi
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (08): : 1660 - 1670