Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization

被引:1
|
作者
Wang, Changwei [1 ,2 ,3 ]
Xu, Rongtao [2 ]
Xu, Shibiao [4 ]
Meng, Weiliang [2 ]
Wang, Ruisheng [5 ]
Zhang, Xiaopeng [2 ]
机构
[1] Qilu Univ Technol, Key Lab Comp Power Network & Informat Secur, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan,Shandong Acad Sci,Minist, Jinan 250014, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[5] Univ Calgary, Dept Geomat Engn, Calgary, AB T2N 1N4, Canada
基金
北京市自然科学基金;
关键词
Weakly supervised object localization; intrinsic discrimination and consistency; deep metric learning; geometric transformation consistency; IMAGE; FEATURES;
D O I
10.1109/TIP.2024.3356174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object localization (WSOL) is a challenging and promising task that aims to localize objects solely based on the supervision of image category labels. In the absence of annotated bounding boxes, WSOL methods must employ the intrinsic properties of the image classification task pipeline to generate object localizations. In this work, we propose a WSOL method for exploring the Intrinsic Discrimination and Consistency in the image classification task pipeline, and call it as IDC. First, we develop a Triplet Metrics Based Foreground Modeling (TMFM) framework to directly predict object foreground regions using intrinsic discrimination. Unlike Class Activation Map (CAM) based methods that also rely on intrinsic discrimination, our TMFM framework alleviates the problem of only focusing on the most discriminative parts by optimizing foreground and background regions synergistically. Second, we design a Dual Geometric Transformation Consistency Constraints (DGTC2) training strategy to introduce additional supervision and regularization constraints for WSOL by leveraging intrinsic geometric transformation consistency. The proposed pixel-wise and object-wise consistency constraint losses cost-effectively provide spontaneous supervision for WSOL. Extensive experiments show that our IDC method achieves significant and consistent performance gains compared to existing state-of-the-art WSOL approaches. Code is available at: https://github.com/vignywang/IDC.
引用
收藏
页码:1045 / 1058
页数:14
相关论文
共 50 条
  • [1] A Consistency and Integration Model with Adaptive Thresholds for Weakly Supervised Object Localization
    Su, Hao
    Yang, Meng
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1281 - 1289
  • [2] Exploring Pixel Alignment on Shallow Feature for Weakly Supervised Object Localization
    Cao, Xinzi
    Yang, Meng
    Sun, Guoyin
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [3] Rethinking the Localization in Weakly Supervised Object Localization
    Xu, Rui
    Luo, Yong
    Hu, Han
    Du, Bo
    Shen, Jialie
    Wen, Yonggang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5484 - 5494
  • [4] Generalized Weakly Supervised Object Localization
    Zhang, Dingwen
    Guo, Guangyu
    Zeng, Wenyuan
    Li, Lei
    Han, Junwei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5395 - 5406
  • [5] Weakly supervised object localization and segmentation in videos
    Rochan, Mrigank
    Rahman, Shafin
    Bruce, Neil D. B.
    Wang, Yang
    IMAGE AND VISION COMPUTING, 2016, 56 : 1 - 12
  • [6] Weakly Supervised Object Localization as Domain Adaption
    Zhu, Lei
    She, Qi
    Chen, Qian
    You, Yunfei
    Wang, Boyu
    Lu, Yanye
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14617 - 14626
  • [7] Weakly Supervised Object Localization with Stable Segmentations
    Galleguillos, Carolina
    Babenko, Boris
    Rabinovich, Andrew
    Belongie, Serge
    COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 : 193 - 207
  • [8] Adversarial Transformers for Weakly Supervised Object Localization
    Meng, Meng
    Zhang, Tianzhu
    Zhang, Zhe
    Zhang, Yongdong
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 7130 - 7143
  • [9] Adversarial Transformers for Weakly Supervised Object Localization
    Meng, Meng
    Zhang, Tianzhu
    Zhang, Zhe
    Zhang, Yongdong
    Wu, Feng
    IEEE Transactions on Image Processing, 2022, 31 : 7130 - 7143
  • [10] SALIENCY AWARE: WEAKLY SUPERVISED OBJECT LOCALIZATION
    Chen, Yun-Chun
    Hsu, Winston H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1907 - 1911