Unbiased Scene Graph Generation Using Predicate Similarities

被引:0
|
作者
Matsui, Yusuke [1 ]
Ohashi, Misaki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat & Commun Engn, Bunkyo Ku, Tokyo 1138656, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Knowledge transfer; Feature extraction; Visualization; Training; Computer vision; Transfer learning; Bioinformatics; Genomics; Classification algorithms; Scene classification; Scene graph; unbiased generation; predicate similarities; transfer learning; long-tailed distribution; SMOTE;
D O I
10.1109/ACCESS.2024.3424230
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene Graphs are widely applied in computer vision as a graphical representation of relationships between objects shown in images. However, these applications have not yet reached a practical stage of development owing to biased training caused by long-tailed predicate distributions. In recent years, many studies have tackled this problem. In contrast, relatively few works have considered predicate similarities as a unique dataset feature which also leads to the biased prediction. Due to the feature, infrequent predicates (e.g., "parked on", "covered in") are easily misclassified as closely-related frequent predicates (e.g., "on", "in"). Utilizing predicate similarities, we propose a new classification scheme that branches the process to several fine-grained classifiers for similar predicate groups. The classifiers aim to capture the differences among similar predicates in detail. We also introduce the idea of transfer learning to enhance the features for the predicates which lack sufficient training samples to learn the descriptive representations. Our target here is to improve the average precision scores even for the instances with the tail predicators. The results of extensive experiments on the Visual Genome dataset show that the combination of our method and an existing debiasing approach greatly improves performance on tail predicates in challenging SGCls/SGDet tasks. Nonetheless, the overall performance of the proposed approach does not reach that of the current state of the art, so further analysis remains necessary as future work.
引用
收藏
页码:95507 / 95516
页数:10
相关论文
共 50 条
  • [1] Label Semantic Knowledge Distillation for Unbiased Scene Graph Generation
    Li, Lin
    Xiao, Jun
    Shi, Hanrong
    Wang, Wenxiao
    Shao, Jian
    Liu, An-An
    Yang, Yi
    Chen, Long
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 195 - 206
  • [2] DBiased-P: Dual-Biased Predicate Predictor for Unbiased Scene Graph Generation
    Han, Xianjing
    Song, Xuemeng
    Dong, Xingning
    Wei, Yinwei
    Liu, Meng
    Nie, Liqiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5319 - 5329
  • [3] Predicate Correlation Learning for Scene Graph Generation
    Tao, Leitian
    Mi, Li
    Li, Nannan
    Cheng, Xianhang
    Hu, Yaosi
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4173 - 4185
  • [4] TEMPLATE-GUIDED DATA AUGMENTATION FOR UNBIASED SCENE GRAPH GENERATION
    Zang, Yujie
    Li, Yaochen
    Cao, Luguang
    Lu, Ruitao
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3510 - 3514
  • [5] Scene Graph Generation Using Depth, Spatial, and Visual Cues in 2D Images
    Kumar, Aiswarya S.
    Nair, Jyothisha J.
    IEEE ACCESS, 2022, 10 : 1968 - 1978
  • [6] MuRelSGG: Multimodal Relationship Prediction for Neurosymbolic Scene Graph Generation
    Khan, Muhammad Junaid
    Siddiqui, Adil Masood
    Khan, Hamid Saeed
    Akram, Faisal
    Khan, M. Jaleed
    IEEE ACCESS, 2025, 13 : 47042 - 47054
  • [7] Dual-Branch Hybrid Learning Network for Unbiased Scene Graph Generation
    Zheng, Chaofan
    Gao, Lianli
    Lyu, Xinyu
    Zeng, Pengpeng
    El Saddik, Abdulmotaleb
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1743 - 1756
  • [8] Dark Knowledge Balance Learning for Unbiased Scene Graph Generation
    Chen, Zhiqing
    Luo, Yawei
    Shao, Jian
    Yang, Yi
    Wang, Chunping
    Chen, Lei
    Xiao, Jun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4838 - 4847
  • [9] PANET: A CONTEXT BASED PREDICATE ASSOCIATION NETWORK FOR SCENE GRAPH GENERATION
    Chen, Yunian
    Wang, Yanjie
    Zhang, Yang
    Guo, Yanwen
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 508 - 513
  • [10] Relation-Specific Feature Augmentation for unbiased scene graph generation
    Liu, Zhihong
    Wang, Jianji
    Chen, Hui
    Ma, Yongqiang
    Zheng, Nanning
    PATTERN RECOGNITION, 2025, 157