Adversarial Transformers for Weakly Supervised Object Localization

被引:5
|
作者
Meng, Meng [1 ]
Zhang, Tianzhu [1 ]
Zhang, Zhe [2 ,3 ]
Zhang, Yongdong [4 ]
Wu, Feng [4 ,5 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[3] Lunar Explorat & Space Engn Ctr CNSA, Beijing, Peoples R China
[4] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China
[5] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China
关键词
Perturbation methods; Adversarial training; transformers; weakly supervised object localization; SEMANTIC SEGMENTATION;
D O I
10.1109/TIP.2022.3220055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object localization (WSOL) aims at localizing objects with only image-level labels, which has better scalability and practicability than fully supervised methods. However, without pixel-level supervision, existing methods tend to generate rough localization maps, which hinders localization performance. To alleviate this problem, we propose an adversarial transformer network (ATNet), which aims to obtain a well-learned localization model with pixel-level pseudo labels. The proposed ATNet enjoys several merits. First, we design an object transformer ( $G$ ) that can generate localization maps and pseudo labels effectively and dynamically, and a part transformer ( $D$ ) to accurately discriminate detailed local differences between localization maps and pseudo labels. Second, we propose to train $G$ and $D$ via an adversarial process, where $G$ can generate more accurate localization maps approaching pseudo labels to fool $D$ . To the best of our knowledge, this is the first work to explore transformers with adversarial training to obtain a well-learned localization model for WSOL. Extensive experiments with four backbones on two standard benchmarks demonstrate that our ATNet achieves favorable performance against state-of-the-art WSOL methods. Besides, our adversarial training can provide higher robustness against adversarial attacks.
引用
收藏
页码:7130 / 7143
页数:14
相关论文
共 50 条
  • [1] Entropy guided adversarial model for weakly supervised object localization
    Benassou, Sabrina Narimene
    Shi, Wuzhen
    Jiang, Feng
    NEUROCOMPUTING, 2021, 429 : 60 - 68
  • [2] Improving weakly-supervised object localization using adversarial erasing and label
    Kang, Byeongkeun
    Cha, Sinhae
    Lee, Yeejin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [3] Improving weakly-supervised object localization using adversarial erasing and pseudo label
    Kang, Byeongkeun
    Cha, Sinhae
    Lee, Yeejin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [4] Improving weakly-supervised object localization using adversarial erasing and pseudo label
    Kang B.
    Cha S.
    Lee Y.
    Engineering Applications of Artificial Intelligence, 2024, 133
  • [5] Token Masking Transformer for Weakly Supervised Object Localization
    Xu, Wenhao
    Wang, Changwei
    Xu, Rongtao
    Xu, Shibiao
    Meng, Weiliang
    Zhang, Man
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 2059 - 2069
  • [6] DiPS: Discriminative pseudo-label sampling with self-supervised transformers for weakly supervised object localization
    Murtaza, Shakeeb
    Belharbi, Soufiane
    Pedersoli, Marco
    Sarraf, Aydin
    Granger, Eric
    IMAGE AND VISION COMPUTING, 2023, 140
  • [7] LBTR: local feature mining and background activation minimization in transformers for weakly supervised object localization
    Li, Bingfeng
    Chen, Qingshan
    Lv, Boxiang
    Wang, Keping
    Li, Xinwei
    JOURNAL OF ELECTRONIC IMAGING, 2025, 34 (01)
  • [8] SALIENCY AWARE: WEAKLY SUPERVISED OBJECT LOCALIZATION
    Chen, Yun-Chun
    Hsu, Winston H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1907 - 1911
  • [9] Adaptive attention augmentor for weakly supervised object localization
    Zhang, Longhao
    Yang, Huihua
    NEUROCOMPUTING, 2021, 454 : 474 - 482
  • [10] Hierarchical complementary learning for weakly supervised object localization
    Benassou, Sabrina Narimene
    Shi, Wuzhen
    Jiang, Feng
    Benzine, Abdallah
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 100