Learning Unsupervised Visual Grounding Through Semantic Self-Supervision

被引:0
|
作者
Javed, Syed Ashar [1 ]
Saxena, Shreyas
Gandhi, Vineet [2 ]
机构
[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[2] IIIT Hyderabad, CVIT, Kohli Ctr Intelligent Syst KCIS, Hyderabad, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Localizing natural language phrases in images is a challenging problem that requires joint understanding of both the textual and visual modalities. In the unsupervised setting, lack of supervisory signals exacerbate this difficulty. In this paper, we propose a novel framework for unsupervised visual grounding which uses concept learning as a proxy task to obtain self-supervision. The intuition behind this idea is to encourage the model to localize to regions which can explain some semantic property in the data, in our case, the property being the presence of a concept in a set of images We present thorough quantitative and qualitative experiments to demonstrate the efficacy of our approach and show a 5.6% improvement over the current state of the art on Visual Genome dataset, a 5.8% improvement on the ReferItGame dataset and comparable to state-of-art performance on the Flickr30k dataset.
引用
收藏
页码:796 / 802
页数:7
相关论文
共 50 条
  • [41] Towards Generalized Manipulation Learning Through Grasp Mechanics-Based Features and Self-Supervision
    Morgan, Andrew S.
    Bircher, Walter G.
    Dollar, Aaron M.
    IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (05) : 1553 - 1569
  • [42] Improving Semi-Supervised Learning for Remaining Useful Lifetime Estimation Through Self-Supervision
    Krokotsch, Tilman
    Knaak, Mirko
    Guehmann, Clemens
    INTERNATIONAL JOURNAL OF PROGNOSTICS AND HEALTH MANAGEMENT, 2022, 13 (01) : 1 - 19
  • [43] FedGL: Federated graph learning framework with global self-supervision
    Chen, Chuan
    Xu, Ziyue
    Hu, Weibo
    Zheng, Zibin
    Zhang, Jie
    INFORMATION SCIENCES, 2024, 657
  • [44] DoubleMatch: Improving Semi-Supervised Learning with Self-Supervision
    Wallin, Erik
    Svensson, Lennart
    Kahl, Fredrik
    Hammarstrand, Lars
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2871 - 2877
  • [45] Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision
    Scholz, Julien
    Weber, Cornelius
    Hafez, Muhammad Burhan
    Wermter, Stefan
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [46] DEEP VIDEO INPAINTING GUIDED BY AUDIO-VISUAL SELF-SUPERVISION
    Kim, Kyuyeon
    Jung, Junsik
    Kim, Woo Jae
    Yoon, Sung-Eui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1970 - 1974
  • [47] Unsupervised Adaptation of Polyp Segmentation Models via Coarse-to-Fine Self-Supervision
    Wang, Jiexiang
    Chen, Chaoqi
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2023, 2023, 13939 : 250 - 262
  • [48] Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision
    Weng, Zhenzhen
    Ogut, Mehmet Giray
    Limonchik, Shai
    Yeung, Serena
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2603 - 2612
  • [49] Task-specific image summaries using semantic information and self-supervision
    Sharma, Deepak Kumar
    Singh, Anurag
    Sharma, Sudhir Kumar
    Srivastava, Gautam
    Lin, Jerry Chun-Wei
    SOFT COMPUTING, 2022, 26 (16) : 7581 - 7594
  • [50] Self-Supervision: Psychodynamic Strategies
    Brenner, Ira
    JOURNAL OF THE AMERICAN PSYCHOANALYTIC ASSOCIATION, 2024, 72 (02)