CTIF-Net: A CNN-Transformer Iterative Fusion Network for Salient Object Detection

被引:15
|
作者
Yuan, Junbin [1 ]
Zhu, Aiqing [1 ]
Xu, Qingzhen [1 ]
Wattanachote, Kanoksak [2 ]
Gong, Yongyi [2 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou 510631, Peoples R China
[2] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Intelligent Hlth & Visual Comp Lab, Guangzhou 510006, Peoples R China
关键词
CNN; transformer; iterative fusion; salient object detection; ATTENTION; MODEL;
D O I
10.1109/TCSVT.2023.3321190
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Capturing sufficient global context and rich spatial structure information is critical for dense prediction tasks. Convolutional Neural Network (CNN) is particularly adept at modeling fine-grained local features, while Transformer excels at modeling global context information. It is evident that CNN and Transformer exhibit complementary characteristics. Exploring the design of a network, that efficiently fuses these two models to leverage their strengths fully and achieve more accurate detection, represents a promising and worthwhile research topic. In this paper, we introduce a novel CNN-Transformer Iterative Fusion Network (CTIF-Net) for salient object detection. It efficiently combines CNN and Transformer to achieve superior performance by using a parallel dual encoder structure and a feature iterative fusion module. Firstly, CTIF-Net extracts features from the image using the CNN and the Transformer, respectively. Secondly, two feature convertors and a feature iterative fusion module are employed to combine and iteratively refine the two sets of features. The experimental results on multiple SOD datasets show that CTIF-Net outperforms 17 state-of-the-art methods, achieving higher performance in various mainstream evaluation metrics such as F-measure, S-measure, and MAE value. Code can be found at https://github.com/danielfaster/CTIF-Net/.
引用
收藏
页码:3795 / 3805
页数:11
相关论文
共 50 条
  • [1] CTFU-Net:CNN-Transformer Fusion U-shaped Network for Moving Object Detection
    Xia, Tingting
    Yang, Yizhong
    2024 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MEDIA COMPUTING, ICIPMC 2024, 2024, : 44 - 50
  • [2] CT-Net: an interpretable CNN-Transformer fusion network for fNIRS classification
    Liao, Lingxiang
    Lu, Jingqing
    Wang, Lutao
    Zhang, Yongqing
    Gao, Dongrui
    Wang, Manqing
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (10) : 3233 - 3247
  • [3] Object Detection Algorithm Based on CNN-Transformer Dual Modal Feature Fusion
    Yang Chen
    Hou Zhiqiang
    Li Xinyue
    Ma Sugang
    Yang Xiaobao
    ACTA PHOTONICA SINICA, 2024, 53 (03)
  • [4] Relating CNN-Transformer Fusion Network for Remote Sensing Change Detection
    Gao, Yuhao
    Pei, Gensheng
    Sheng, Mengmeng
    Sun, Zeren
    Chen, Tao
    Yao, Yazhou
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [5] A semi-parallel CNN-transformer fusion network for semantic change detection
    Zou, Changzhong
    Wang, Ziyuan
    IMAGE AND VISION COMPUTING, 2024, 149
  • [6] A Hybrid CNN-Transformer Network for Object Detection in Optical Remote Sensing Images: Integrating Local and Global Feature Fusion
    Huang, Youxiang
    Jiao, Donglai
    Huang, Xingru
    Tang, Tiantian
    Gui, Guan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 241 - 254
  • [7] GhostFormer: Efficiently amalgamated CNN-transformer architecture for object detection
    Xie, Xin
    Wu, Dengquan
    Xie, Mingye
    Li, Zixi
    PATTERN RECOGNITION, 2024, 148
  • [8] A hierarchical CNN-Transformer model for network intrusion detection
    Luo, Sijie
    Zhao, Zhiheng
    Hu, Qiyuan
    Liu, Yang
    2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELLING, AND INTELLIGENT COMPUTING (CAMMIC 2022), 2022, 12259
  • [9] CTAFFNet: CNN-Transformer Adaptive Feature Fusion Object Detection Algorithm for Complex Traffic Scenarios
    Dong, Xinlong
    Shi, Peicheng
    Liang, Taonian
    Yang, Aixi
    TRANSPORTATION RESEARCH RECORD, 2024,
  • [10] DBCT-Net:A dual branch hybrid CNN-transformer network for remote sensing image fusion
    Wang, Quanli
    Jin, Xin
    Jiang, Qian
    Wu, Liwen
    Zhang, Yunchun
    Zhou, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233