Improving Predicate Representation in Scene Graph Generation by Self-Supervised Learning

被引:0
|
作者
Hasegawa, So [1 ]
Hiromoto, Masayuki [1 ]
Nakagawa, Akira [1 ]
Umeda, Yuhei [1 ]
机构
[1] Fujitsu Ltd, Tokyo, Japan
关键词
D O I
10.1109/WACV56688.2023.00276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graph generation (SGG) aims to understand sophisticated visual information by detecting triplets of subject, object, and their relationship (predicate). Since the predicate labels are heavily imbalanced, existing supervised methods struggle to improve accuracy for the rare predicates due to insufficient labeled data. In this paper, we propose SePiR, a novel self-supervised learning method for SGG to improve the representation of rare predicates. We first train a relational encoder by contrastive learning without using predicate labels, and then fine-tune a predicate classifier with labeled data. To apply contrastive learning to SGG, we newly propose data augmentation in which subject-object pairs are augmented by replacing their visual features with those from other images having the same object labels. By such augmentation, we can increase the variation of the visual features while keeping the relationship between the objects. Comprehensive experimental results on the Visual Genome dataset show that the SGG performance of SePiR is comparable to the state-of-the-art, and especially with the limited labeled dataset, our method significantly outperforms the existing supervised methods. Moreover, SePiR's improved representation enables the model architecture simpler, resulting in 3.6x and 6.3x reduction of the parameters and inference time from the existing method, independently.
引用
收藏
页码:2739 / 2748
页数:10
相关论文
共 50 条
  • [31] Self-Supervised Representation Learning for CAD
    Jones, Benjamin T.
    Hu, Michael
    Kodnongbua, Milin
    Kim, Vladimir G.
    Schulz, Adriana
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21327 - 21336
  • [32] Improving Self-supervised Molecular Representation Learning using Persistent Homology
    Luo, Yuankai
    Shi, Lei
    Thost, Veronika
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [33] Look Twice as Much as You Say: Scene Graph Contrastive Learning for Self-Supervised Image Caption Generation
    Zhang, Chunhui
    Huang, Chao
    Li, Youhuan
    Zhang, Xiangliang
    Ye, Yanfang
    Zhang, Chuxu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2519 - 2528
  • [34] Self-supervised contrastive graph representation with node and graph augmentation?
    Duan, Haoran
    Xie, Cheng
    Li, Bin
    Tang, Peng
    NEURAL NETWORKS, 2023, 167 : 223 - 232
  • [35] Self-supervised Hierarchical Graph Neural Network for Graph Representation
    Bandyopadhyay, Sambaran
    Aggarwal, Manasvi
    Murty, M. Narasimha
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 603 - 608
  • [36] Dual-channel graph contrastive learning for self-supervised graph-level representation learning
    Luo, Zhenfei
    Dong, Yixiang
    Zheng, Qinghua
    Liu, Huan
    Luo, Minnan
    PATTERN RECOGNITION, 2023, 139
  • [37] Self-supervised Graph Learning with Segmented Graph Channels
    Gao, Hang
    Li, Jiangmeng
    Zheng, Changwen
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II, 2023, 13714 : 293 - 308
  • [38] Cross-View Masked Model for Self-Supervised Graph Representation Learning
    Duan H.
    Yu B.
    Xie C.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 13
  • [39] Chemistry-Wise Augmentations for Molecule Graph Self-supervised Representation Learning
    Ondar, Evgeniia
    Makarov, Ilya
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT II, 2023, 14135 : 327 - 336
  • [40] GAN-based self-supervised message passing graph representation learning
    Yang, Yining
    Xu, Ke
    Tang, Ying
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251