Generative Transformer for Accurate and Reliable Salient Object Detection

被引:0
|
作者
Mao, Yuxin [1 ,2 ]
Zhang, Jing [3 ]
Wan, Zhexiong [1 ,2 ]
Tian, Xinyu [1 ,2 ]
Li, Aixuan [1 ,2 ]
Lv, Yunqiu [1 ,2 ]
Dai, Yuchao [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] Shaanxi Key Lab Informat Acquisit & Proc, Xian 710072, Peoples R China
[3] Australian Natl Univ, Sch Comp, Canberra, ACT 2601, Australia
基金
中国国家自然科学基金;
关键词
Transformers; Context modeling; Predictive models; Object detection; Accuracy; Reliability; Generative adversarial networks; Feature extraction; Decoding; Visualization; Vision transformer; salient object detection; inferential generative adversarial network; ATTENTION; NETWORK;
D O I
10.1109/TCSVT.2024.3469286
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We explore the impact of transformers on accurate and reliable salient object detection. For accuracy, we integrate the transformer with a deterministic model and delineate its advantages in structural modeling. Regarding reliability, we address the transformer's tendency to produce overly confident, incorrect predictions. To gauge reliability implicitly, we introduce a latent variable model within the transformer framework, termed the inferential generative adversarial network (iGAN). The stochastic nature of the latent variable facilitates the estimation of predictive uncertainty, which serves as an auxiliary measure of the model's prediction reliability. Different from the conventional GAN, which defines the distribution of the latent variable as fixed standard normal distribution N(0, I). The proposed iGAN infers the latent variable by gradient-based Markov Chain Monte Carlo (MCMC), namely Langevin dynamics, leading to an input-dependent latent variable model. We apply our proposed iGAN to fully supervised salient object detection, explaining that iGAN within the transformer framework leads to both accurate and reliable salient object detection.
引用
收藏
页码:1041 / 1054
页数:14
相关论文
共 50 条
  • [1] TENet: Accurate light-field salient object detection with a transformer embedding network
    Wang, Xingzheng
    Chen, Songwei
    Wei, Guoyao
    Liu, Jiehao
    IMAGE AND VISION COMPUTING, 2023, 129
  • [2] Collaborative compensative transformer network for salient object detection
    Chen, Jun
    Zhang, Heye
    Gong, Mingming
    Gao, Zhifan
    PATTERN RECOGNITION, 2024, 154
  • [3] SwinSOD: Salient object detection using swin-transformer
    Wu, Shuang
    Zhang, Guangjian
    Liu, Xuefeng
    IMAGE AND VISION COMPUTING, 2024, 146
  • [4] Distortion-aware Transformer in 360° Salient Object Detection
    Zhao, Yinjie
    Zhao, Lichen
    Yu, Qian
    Sheng, Lu
    Zhang, Jing
    Xu, Dong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 499 - 508
  • [5] Focal Perception Transformer for Light Field Salient Object Detection
    Zhao, Liming
    Zhang, Miao
    Pia, Yongri
    Yin, Jihao
    Lu, Huchuan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 3 - 18
  • [6] SDETR: ATTENTION-GUIDED SALIENT OBJECT DETECTION WITH TRANSFORMER
    Liu, Guanze
    Xu, Bo
    Huang, Han
    Lu, Cheng
    Guo, Yandong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1611 - 1615
  • [7] Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection
    Zhao, Zhirui
    Xia, Changqun
    Xie, Chenxi
    Li, Jia
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4967 - 4975
  • [8] SBN: Scale Balance Network for Accurate Salient Object Detection
    Tan, Zhenshan
    Gu, Xiaodong
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [9] Cascaded Partial Decoder for Fast and Accurate Salient Object Detection
    Wu, Zhe
    Su, Li
    Huang, Qingming
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3902 - 3911
  • [10] Sparse Reconstruction on Robust Dictionary for Accurate Salient Object Detection
    Wang, Jun
    Mao, Yi
    Wu, Guodong
    Wang, Hongjun
    Niu, Hehao
    Du, Lin
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 591 - 595