Weakly Supervised Referring Expression Grounding via Dynamic Self-Knowledge Distillation

被引:1
|
作者
Mi, Jinpeng [1 ]
Chen, Zhiqian [1 ]
Zhang, Jianwei [2 ]
机构
[1] Univ Shanghai Sci & Technol, Inst Machine Intelligence IMI, Shanghai, Peoples R China
[2] Univ Hamburg, Dept Informat, Tech Aspects Multimodal Syst TAMS, Hamburg, Germany
来源
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS | 2023年
基金
美国国家科学基金会;
关键词
RECONSTRUCTION;
D O I
10.1109/IROS55552.2023.10341909
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised referring expression grounding (WREG) is an attractive and challenging task for grounding target regions in images by understanding given referring expressions. WREG learns to ground target objects without the manual annotations between image regions and referring expressions during the model training phase. Different from the predominant grounding pattern of existing models, which locates target objects by reconstructing the region-expression correspondence, we investigate WREG from a novel perspective and enrich the prevailing pattern with self-knowledge distillation. Specifically, we propose a target-guided self-knowledge distillation approach that adopts the target prediction knowledge learned from the previous training iterations as the teacher to guide the subsequent training procedure. In order to avoid the misleading caused by the teacher knowledge with low prediction confidence, we present an uncertainty-aware knowledge refinement strategy to adaptively rectify the teacher knowledge by learning dynamic threshold values based on the model prediction uncertainty. To validate the proposed approach, we implement extensive experiments on three benchmark datasets, i.e., RefCOCO, RefCOCO+, and RefCOCOg. Our approach achieves new state-of-the-art results on several splits of the benchmark datasets, showcasing the advantage of the proposed framework for WREG. The implementation codes and trained models are available at: https://github.com/dami23/WREG Self KD.
引用
收藏
页码:1254 / 1260
页数:7
相关论文
共 50 条
  • [31] RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension
    Jin, Lei
    Luo, Gen
    Zhou, Yiyi
    Sun, Xiaoshuai
    Jiang, Guannan
    Shu, Annan
    Ji, Rongrong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2681 - 2690
  • [32] Weakly supervised video object segmentation initialized with referring expression
    Bu, Xiaoqing
    Sun, Yukuan
    Wang, Jianming
    Liu, Kunliang
    Liang, Jiayu
    Jin, Guanghao
    Chung, Tae-Sun
    NEUROCOMPUTING, 2021, 453 : 754 - 765
  • [33] Transparency, expression, and self-knowledge
    Bar-On, Dorit
    PHILOSOPHICAL EXPLORATIONS, 2015, 18 (02) : 134 - 152
  • [34] Enhancing deep feature representation in self-knowledge distillation via pyramid feature refinement
    Yu, Hao
    Feng, Xin
    Wang, Yunlong
    PATTERN RECOGNITION LETTERS, 2024, 178 : 35 - 42
  • [35] Personalized federated learning via decoupling self-knowledge distillation and global adaptive aggregation
    Tang, Zhiwei
    Xu, Shuwei
    Jin, Haozhe
    Liu, Shichong
    Zhai, Rui
    Lu, Ke
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [36] Weakly-Supervised Domain Adaptation of Deep Regression Trackers via Reinforced Knowledge Distillation
    Dunnhofer, Matteo
    Martinel, Niki
    Micheloni, Christian
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) : 5016 - 5023
  • [37] Weakly supervised object localization via knowledge distillation based on foreground-background contrast
    Ma, Siteng
    Hou, Biao
    Li, Zhihao
    Wu, Zitong
    Guo, Xianpeng
    Yang, Chen
    Jiao, Licheng
    NEUROCOMPUTING, 2024, 576
  • [38] SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
    Nag, Sayan
    Goswami, Koustava
    Karanam, Srikrishna
    COMPUTER VISION-ECCV 2024, PT XLIV, 2025, 15102 : 485 - 503
  • [39] Bar-On on Self-Knowledge and Expression
    Boyle, Matthew
    ACTA ANALYTICA-INTERNATIONAL PERIODICAL FOR PHILOSOPHY IN THE ANALYTICAL TRADITION, 2010, 25 (01): : 9 - 20
  • [40] Bar-On on Self-Knowledge and Expression
    Matthew Boyle
    Acta Analytica, 2010, 25 : 9 - 20