CTIF-Net: A CNN-Transformer Iterative Fusion Network for Salient Object Detection

被引:15
|
作者
Yuan, Junbin [1 ]
Zhu, Aiqing [1 ]
Xu, Qingzhen [1 ]
Wattanachote, Kanoksak [2 ]
Gong, Yongyi [2 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou 510631, Peoples R China
[2] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Intelligent Hlth & Visual Comp Lab, Guangzhou 510006, Peoples R China
关键词
CNN; transformer; iterative fusion; salient object detection; ATTENTION; MODEL;
D O I
10.1109/TCSVT.2023.3321190
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Capturing sufficient global context and rich spatial structure information is critical for dense prediction tasks. Convolutional Neural Network (CNN) is particularly adept at modeling fine-grained local features, while Transformer excels at modeling global context information. It is evident that CNN and Transformer exhibit complementary characteristics. Exploring the design of a network, that efficiently fuses these two models to leverage their strengths fully and achieve more accurate detection, represents a promising and worthwhile research topic. In this paper, we introduce a novel CNN-Transformer Iterative Fusion Network (CTIF-Net) for salient object detection. It efficiently combines CNN and Transformer to achieve superior performance by using a parallel dual encoder structure and a feature iterative fusion module. Firstly, CTIF-Net extracts features from the image using the CNN and the Transformer, respectively. Secondly, two feature convertors and a feature iterative fusion module are employed to combine and iteratively refine the two sets of features. The experimental results on multiple SOD datasets show that CTIF-Net outperforms 17 state-of-the-art methods, achieving higher performance in various mainstream evaluation metrics such as F-measure, S-measure, and MAE value. Code can be found at https://github.com/danielfaster/CTIF-Net/.
引用
收藏
页码:3795 / 3805
页数:11
相关论文
共 50 条
  • [21] EEG classification algorithm of motor imagery based on CNN-Transformer fusion network
    Liu, Haofeng
    Liu, Yuefeng
    Wang, Yue
    Liu, Bo
    Bao, Xiang
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1302 - 1309
  • [22] MFH-Net: A Hybrid CNN-Transformer Network Based Multi-Scale Fusion for Medical Image Segmentation
    Wang, Ying
    Zhang, Meng
    Liang, Jian'an
    Liang, Meiyan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (06)
  • [23] Hybrid CNN-Transformer Network for Electricity Theft Detection in Smart Grids
    Bai, Yu
    Sun, Haitong
    Zhang, Lili
    Wu, Haoqi
    SENSORS, 2023, 23 (20)
  • [24] SaltFormer: A hybrid CNN-Transformer network for automatic salt dome detection
    Li, Yang
    Peng, Suping
    He, Dengke
    COMPUTERS & GEOSCIENCES, 2025, 195
  • [25] CSU-Net: A CNN-Transformer Parallel Network for Multimodal Brain Tumour Segmentation
    Chen, Yu
    Yin, Ming
    Li, Yu
    Cai, Qian
    ELECTRONICS, 2022, 11 (14)
  • [26] HCTA-Net: A Hybrid CNN-Transformer Attention Network for Surgical Instrument Segmentation
    Yang, Lei
    Wang, Hongyong
    Bian, Guibin
    Liu, Yanhong
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2023, 5 (04): : 929 - 944
  • [27] Transformer-based difference fusion network for RGB-D salient object detection
    Cui, Zhi-Qiang
    Wang, Feng
    Feng, Zheng-Yong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [28] CTFNet: CNN-Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation
    Wu, Honglin
    Huang, Peng
    Zhang, Min
    Tang, Wenlong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [29] Feature extraction and fusion network for salient object detection
    Dai, Chao
    Pan, Chen
    He, Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33955 - 33969
  • [30] Hierarchical Feature Fusion Network for Salient Object Detection
    Li, Xuelong
    Song, Dawei
    Dong, Yongsheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 9165 - 9175