Adversarial Transformers for Weakly Supervised Object Localization

被引:5
|
作者
Meng, Meng [1 ]
Zhang, Tianzhu [1 ]
Zhang, Zhe [2 ,3 ]
Zhang, Yongdong [4 ]
Wu, Feng [4 ,5 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China
[3] Lunar Explorat & Space Engn Ctr CNSA, Beijing, Peoples R China
[4] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China
[5] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China
关键词
Perturbation methods; Adversarial training; transformers; weakly supervised object localization; SEMANTIC SEGMENTATION;
D O I
10.1109/TIP.2022.3220055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object localization (WSOL) aims at localizing objects with only image-level labels, which has better scalability and practicability than fully supervised methods. However, without pixel-level supervision, existing methods tend to generate rough localization maps, which hinders localization performance. To alleviate this problem, we propose an adversarial transformer network (ATNet), which aims to obtain a well-learned localization model with pixel-level pseudo labels. The proposed ATNet enjoys several merits. First, we design an object transformer ( $G$ ) that can generate localization maps and pseudo labels effectively and dynamically, and a part transformer ( $D$ ) to accurately discriminate detailed local differences between localization maps and pseudo labels. Second, we propose to train $G$ and $D$ via an adversarial process, where $G$ can generate more accurate localization maps approaching pseudo labels to fool $D$ . To the best of our knowledge, this is the first work to explore transformers with adversarial training to obtain a well-learned localization model for WSOL. Extensive experiments with four backbones on two standard benchmarks demonstrate that our ATNet achieves favorable performance against state-of-the-art WSOL methods. Besides, our adversarial training can provide higher robustness against adversarial attacks.
引用
收藏
页码:7130 / 7143
页数:14
相关论文
共 50 条
  • [31] RSMNet: A Regional Similar Module Network for Weakly Supervised Object Localization
    Zhigang Ling
    Liang Li
    Aoran Zhang
    Neural Processing Letters, 2022, 54 : 5079 - 5097
  • [32] Weakly-supervised object localization with gradient-pyramid feature
    Mao, Zhongjie
    Zhou, Yipeng
    Sun, Jun
    Wu, Hao
    Pan, Feng
    Ahmad, Bilal
    APPLIED INTELLIGENCE, 2023, 53 (03) : 2923 - 2935
  • [33] Weakly-supervised object localization with gradient-pyramid feature
    Zhongjie Mao
    Yipeng Zhou
    Jun Sun
    Hao Wu
    Feng Pan
    Bilal Ahmad
    Applied Intelligence, 2023, 53 : 2923 - 2935
  • [34] Exploring Pixel Alignment on Shallow Feature for Weakly Supervised Object Localization
    Cao, Xinzi
    Yang, Meng
    Sun, Guoyin
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [35] Attention-based Selection Strategy for Weakly Supervised Object Localization
    Zhang, Zhenfei
    Bui, Tien D.
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10305 - 10311
  • [36] Instance Annotation via Optimal BoW for Weakly Supervised Object Localization
    Wang, Liantao
    Meng, Deyu
    Hu, Xuelei
    Lu, Jianfeng
    Zhao, Ji
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) : 1313 - 1324
  • [37] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Zhai, Wei
    Wu, Pingyu
    Zhu, Kai
    Cao, Yang
    Wu, Feng
    Zha, Zheng-Jun
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 750 - 775
  • [38] Weakly supervised foreground learning for weakly supervised localization and detection
    Zhang, Chen -Lin
    Li, Yin
    Wu, Jianxin
    PATTERN RECOGNITION, 2023, 137
  • [39] Multi-Layer Decoupling Attention Network for Weakly Supervised Object Localization
    Zhang, Aoran
    Ling, Zhigang
    Wang, Yaonan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4469 - 4479
  • [40] Weakly-Supervised Object Localization by Cutting Background with Deep Reinforcement Learning
    Zheng, Wu
    Zhang, Zhaoxiang
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 210 - 218