Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization

被引：1

作者：

Wang, Changwei ^{[1
,2
,3
]}

Xu, Rongtao ^{[2
]}

Xu, Shibiao ^{[4
]}

Meng, Weiliang ^{[2
]}

Wang, Ruisheng ^{[5
]}

Zhang, Xiaopeng ^{[2
]}

机构：

[1] Qilu Univ Technol, Key Lab Comp Power Network & Informat Secur, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan,Shandong Acad Sci,Minist, Jinan 250014, Peoples R China

[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China

[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[4] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[5] Univ Calgary, Dept Geomat Engn, Calgary, AB T2N 1N4, Canada

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

北京市自然科学基金;

关键词：

Weakly supervised object localization; intrinsic discrimination and consistency; deep metric learning; geometric transformation consistency; IMAGE; FEATURES;

D O I：

10.1109/TIP.2024.3356174

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised object localization (WSOL) is a challenging and promising task that aims to localize objects solely based on the supervision of image category labels. In the absence of annotated bounding boxes, WSOL methods must employ the intrinsic properties of the image classification task pipeline to generate object localizations. In this work, we propose a WSOL method for exploring the Intrinsic Discrimination and Consistency in the image classification task pipeline, and call it as IDC. First, we develop a Triplet Metrics Based Foreground Modeling (TMFM) framework to directly predict object foreground regions using intrinsic discrimination. Unlike Class Activation Map (CAM) based methods that also rely on intrinsic discrimination, our TMFM framework alleviates the problem of only focusing on the most discriminative parts by optimizing foreground and background regions synergistically. Second, we design a Dual Geometric Transformation Consistency Constraints (DGTC2) training strategy to introduce additional supervision and regularization constraints for WSOL by leveraging intrinsic geometric transformation consistency. The proposed pixel-wise and object-wise consistency constraint losses cost-effectively provide spontaneous supervision for WSOL. Extensive experiments show that our IDC method achieves significant and consistent performance gains compared to existing state-of-the-art WSOL approaches. Code is available at: https://github.com/vignywang/IDC.

引用

页码：1045 / 1058

页数：14

共 50 条

[1] A Consistency and Integration Model with Adaptive Thresholds for Weakly Supervised Object Localization
Su, Hao
Yang, Meng
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1281 - 1289
[2] Exploring Pixel Alignment on Shallow Feature for Weakly Supervised Object Localization
Cao, Xinzi
Yang, Meng
Sun, Guoyin
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[3] Rethinking the Localization in Weakly Supervised Object Localization
Xu, Rui
Luo, Yong
Hu, Han
Du, Bo
Shen, Jialie
Wen, Yonggang
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5484 - 5494
[4] Generalized Weakly Supervised Object Localization
Zhang, Dingwen
Guo, Guangyu
Zeng, Wenyuan
Li, Lei
Han, Junwei
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5395 - 5406
[5] Weakly supervised object localization and segmentation in videos
Rochan, Mrigank
Rahman, Shafin
Bruce, Neil D. B.
Wang, Yang
IMAGE AND VISION COMPUTING, 2016, 56 : 1 - 12
[6] Weakly Supervised Object Localization as Domain Adaption
Zhu, Lei
She, Qi
Chen, Qian
You, Yunfei
Wang, Boyu
Lu, Yanye
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14617 - 14626
[7] Weakly Supervised Object Localization with Stable Segmentations
Galleguillos, Carolina
Babenko, Boris
Rabinovich, Andrew
Belongie, Serge
COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 : 193 - 207
[8] Adversarial Transformers for Weakly Supervised Object Localization
Meng, Meng
Zhang, Tianzhu
Zhang, Zhe
Zhang, Yongdong
Wu, Feng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 7130 - 7143
[9] Adversarial Transformers for Weakly Supervised Object Localization
Meng, Meng
Zhang, Tianzhu
Zhang, Zhe
Zhang, Yongdong
Wu, Feng
IEEE Transactions on Image Processing, 2022, 31 : 7130 - 7143
[10] SALIENCY AWARE: WEAKLY SUPERVISED OBJECT LOCALIZATION
Chen, Yun-Chun
Hsu, Winston H.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1907 - 1911

← 1 2 3 4 5 →