Adversarial Transformers for Weakly Supervised Object Localization

被引:5
|
作者
Meng, Meng [1 ]
Zhang, Tianzhu [1 ]
Zhang, Zhe [2 ,3 ]
Zhang, Yongdong [4 ]
Wu, Feng [4 ,5 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[3] Lunar Explorat & Space Engn Ctr CNSA, Beijing, Peoples R China
[4] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China
[5] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China
关键词
Perturbation methods; Adversarial training; transformers; weakly supervised object localization; SEMANTIC SEGMENTATION;
D O I
10.1109/TIP.2022.3220055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object localization (WSOL) aims at localizing objects with only image-level labels, which has better scalability and practicability than fully supervised methods. However, without pixel-level supervision, existing methods tend to generate rough localization maps, which hinders localization performance. To alleviate this problem, we propose an adversarial transformer network (ATNet), which aims to obtain a well-learned localization model with pixel-level pseudo labels. The proposed ATNet enjoys several merits. First, we design an object transformer ( $G$ ) that can generate localization maps and pseudo labels effectively and dynamically, and a part transformer ( $D$ ) to accurately discriminate detailed local differences between localization maps and pseudo labels. Second, we propose to train $G$ and $D$ via an adversarial process, where $G$ can generate more accurate localization maps approaching pseudo labels to fool $D$ . To the best of our knowledge, this is the first work to explore transformers with adversarial training to obtain a well-learned localization model for WSOL. Extensive experiments with four backbones on two standard benchmarks demonstrate that our ATNet achieves favorable performance against state-of-the-art WSOL methods. Besides, our adversarial training can provide higher robustness against adversarial attacks.
引用
收藏
页码:7130 / 7143
页数:14
相关论文
共 50 条
  • [21] Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization
    Wang, Changwei
    Xu, Rongtao
    Xu, Shibiao
    Meng, Weiliang
    Wang, Ruisheng
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1045 - 1058
  • [22] HiCT: Hierarchical Comprehend of Transformer for Weakly Supervised Object Localization
    Sun, Wanchun
    Feng, Xin
    Ma, Hui
    Liu, Jingyao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [23] Advanced wildfire detection using generative adversarial network-based augmented datasets and weakly supervised object localization
    Park, Minsoo
    Dai Quoc Tran
    Bak, Jinyeong
    Park, Seunghee
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 114
  • [24] Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization
    Lee, Jungbeom
    Kim, Eunji
    Mok, Jisoo
    Yoon, Sungroh
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1618 - 1634
  • [25] Proxy Probing Decoder for Weakly Supervised Object Localization: A Baseline Investigation
    Xu, Jingyuan
    Xie, Hongtao
    Liu, Chuanbin
    Zhang, Yongdong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4185 - 4193
  • [26] Dual-branch contrastive learning for weakly supervised object localization
    Guo, Zebin
    Li, Dong
    Du, Zhengjun
    Seng, Bingfeng
    APPLIED INTELLIGENCE, 2025, 55 (07)
  • [27] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Wei Zhai
    Pingyu Wu
    Kai Zhu
    Yang Cao
    Feng Wu
    Zheng-Jun Zha
    International Journal of Computer Vision, 2024, 132 (3) : 750 - 775
  • [28] Weakly Supervised Object Localization Based on Attention Mechanism and Categorical Hierarchy
    Feng X.
    Yang J.
    Zhou T.
    Gong C.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (10): : 4916 - 4929
  • [29] Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration
    Bai, Haotian
    Zhang, Ruimao
    Wang, Jiong
    Wan, Xiang
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 612 - 628
  • [30] RSMNet: A Regional Similar Module Network for Weakly Supervised Object Localization
    Ling, Zhigang
    Li, Liang
    Zhang, Aoran
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5079 - 5097