Adversarial Transformers for Weakly Supervised Object Localization

被引：5

作者：

Meng, Meng ^{[1
]}

Zhang, Tianzhu ^{[1
]}

Zhang, Zhe ^{[2
,3
]}

Zhang, Yongdong ^{[4
]}

Wu, Feng ^{[4
,5
]}

机构：

[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China

[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Automat, Hefei, Peoples R China

[3] Lunar Explorat & Space Engn Ctr CNSA, Beijing, Peoples R China

[4] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China

[5] Univ Sci & Technol China, Sch Informat Sci & Technol, Dept Elect Engn & Informat Sci, Hefei, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2022年 / 31卷

关键词：

Perturbation methods; Adversarial training; transformers; weakly supervised object localization; SEMANTIC SEGMENTATION;

D O I：

10.1109/TIP.2022.3220055

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised object localization (WSOL) aims at localizing objects with only image-level labels, which has better scalability and practicability than fully supervised methods. However, without pixel-level supervision, existing methods tend to generate rough localization maps, which hinders localization performance. To alleviate this problem, we propose an adversarial transformer network (ATNet), which aims to obtain a well-learned localization model with pixel-level pseudo labels. The proposed ATNet enjoys several merits. First, we design an object transformer ( $G$ ) that can generate localization maps and pseudo labels effectively and dynamically, and a part transformer ( $D$ ) to accurately discriminate detailed local differences between localization maps and pseudo labels. Second, we propose to train $G$ and $D$ via an adversarial process, where $G$ can generate more accurate localization maps approaching pseudo labels to fool $D$ . To the best of our knowledge, this is the first work to explore transformers with adversarial training to obtain a well-learned localization model for WSOL. Extensive experiments with four backbones on two standard benchmarks demonstrate that our ATNet achieves favorable performance against state-of-the-art WSOL methods. Besides, our adversarial training can provide higher robustness against adversarial attacks.

引用

页码：7130 / 7143

页数：14

共 50 条

[31] RSMNet: A Regional Similar Module Network for Weakly Supervised Object Localization
Zhigang Ling
Liang Li
Aoran Zhang
Neural Processing Letters, 2022, 54 : 5079 - 5097
[32] Weakly-supervised object localization with gradient-pyramid feature
Mao, Zhongjie
Zhou, Yipeng
Sun, Jun
Wu, Hao
Pan, Feng
Ahmad, Bilal
APPLIED INTELLIGENCE, 2023, 53 (03) : 2923 - 2935
[33] Weakly-supervised object localization with gradient-pyramid feature
Zhongjie Mao
Yipeng Zhou
Jun Sun
Hao Wu
Feng Pan
Bilal Ahmad
Applied Intelligence, 2023, 53 : 2923 - 2935
[34] Exploring Pixel Alignment on Shallow Feature for Weakly Supervised Object Localization
Cao, Xinzi
Yang, Meng
Sun, Guoyin
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[35] Attention-based Selection Strategy for Weakly Supervised Object Localization
Zhang, Zhenfei
Bui, Tien D.
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10305 - 10311
[36] Instance Annotation via Optimal BoW for Weakly Supervised Object Localization
Wang, Liantao
Meng, Deyu
Hu, Xuelei
Lu, Jianfeng
Zhao, Ji
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) : 1313 - 1324
[37] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
Zhai, Wei
Wu, Pingyu
Zhu, Kai
Cao, Yang
Wu, Feng
Zha, Zheng-Jun
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 750 - 775
[38] Weakly supervised foreground learning for weakly supervised localization and detection
Zhang, Chen -Lin
Li, Yin
Wu, Jianxin
PATTERN RECOGNITION, 2023, 137
[39] Multi-Layer Decoupling Attention Network for Weakly Supervised Object Localization
Zhang, Aoran
Ling, Zhigang
Wang, Yaonan
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4469 - 4479
[40] Weakly-Supervised Object Localization by Cutting Background with Deep Reinforcement Learning
Zheng, Wu
Zhang, Zhaoxiang
PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 210 - 218

← 1 2 3 4 5 →