A causality guided loss for imbalanced learning in scene graph generation

被引:1
|
作者
Peng, Ru [1 ]
Zhao, Chao [1 ]
Chen, Xingyu [1 ]
Wang, Ziru [1 ]
Liu, Yaxin [1 ]
Liu, Yulong [1 ]
Lan, Xuguang [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Natl Engn Res Ctr Visual Informat & Applicat, Natl Key Lab Human Machine Hybrid Augmented Intell, 28 West Xianning Rd, Xian 710049, Peoples R China
关键词
Causal learning; Deep long-tailed learning; Scene graph generation; Image classification;
D O I
10.1016/j.neucom.2024.128042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unbiased visual relation detection on long-tailed annotations is a critical challenge in scene graph generation (SGG). Imbalanced learning aims to tackle the problem of class distribution that is long-tailed in order to learn unbiased models from imbalanced data. Since long-tailed datasets are inevitable in the real world, obtaining a balanced dataset can be expensive or even impossible. However, training models on such data are easily biased towards head classes and underperform on tail classes. To overcome this challenge, existing methods focus more on utilizing label frequency as prior knowledge, but ignore the research on how imbalanced datasets lead to prediction bias, which is crucial for solving the long -tail problem. Therefore we propose a causal graph for the training process. This causal graph reveals the conventional loss serves as a confounder of the features and predictions during training. Guided by the causal graph, a degree -of -difficulty loss (DDloss) is designed which is a simple yet effective method to alleviate catering to the head. We demonstrate the effectiveness of DDloss through extensive experiments on SGG and test its expansibility on long-tailed image classification.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Multimodal graph inference network for scene graph generation
    Jingwen Duan
    Weidong Min
    Deyu Lin
    Jianfeng Xu
    Xin Xiong
    Applied Intelligence, 2021, 51 : 8768 - 8783
  • [22] Graph R-CNN for Scene Graph Generation
    Yang, Jianwei
    Lu, Jiasen
    Lee, Stefan
    Batra, Dhruv
    Parikh, Devi
    COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 690 - 706
  • [23] Review on scene graph generation methods
    Monesh, S.
    Senthilkumar, N. C.
    MULTIAGENT AND GRID SYSTEMS, 2024, 20 (02) : 129 - 160
  • [24] Adversarial Attacks on Scene Graph Generation
    Zhao, Mengnan
    Zhang, Lihe
    Wang, Wei
    Kong, Yuqiu
    Yin, Baocai
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 3210 - 3225
  • [25] Scene Graph Generation: A comprehensive survey
    Li, Hongsheng
    Zhu, Guangming
    Zhang, Liang
    Jiang, Youliang
    Dang, Yixuan
    Hou, Haoran
    Shen, Peiyi
    Zhao, Xia
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    NEUROCOMPUTING, 2024, 566
  • [26] Multimodal graph inference network for scene graph generation
    Duan, Jingwen
    Min, Weidong
    Lin, Deyu
    Xu, Jianfeng
    Xiong, Xin
    APPLIED INTELLIGENCE, 2021, 51 (12) : 8768 - 8783
  • [27] Scene Graph Generation With Hierarchical Context
    Ren, Guanghui
    Ren, Lejian
    Liao, Yue
    Liu, Si
    Li, Bo
    Han, Jizhong
    Yan, Shuicheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 909 - 915
  • [28] PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation
    Yan, Shaotian
    Shen, Chen
    Jin, Zhongming
    Huang, Jianqiang
    Jiang, Rongxin
    Chen, Yaowu
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 265 - 273
  • [29] Gaussian Distribution-Aware Commonsense Knowledge Learning for Scene Graph Generation
    Tian, Hongshuo
    Xu, Ning
    Kankanhalli, Mohan
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 13044 - 13057
  • [30] Attention redirection transformer with semantic oriented learning for unbiased scene graph generation
    Zhang, Ruonan
    An, Gaoyun
    Cen, Yigang
    Ruan, Qiuqi
    PATTERN RECOGNITION, 2025, 158