A Balanced Relation Prediction Framework for Scene Graph Generation

被引:1
作者
Xu, Kai [1 ]
Wang, Lichun [1 ]
Li, Shuang [1 ]
Zhang, Huiyong [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV | 2023年 / 14257卷
基金
国家重点研发计划;
关键词
Scene Graph Generation; Long-tailed Problem; Cumulative Learning;
D O I
10.1007/978-3-031-44216-2_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It has become a consensus that regular scene graph generation (SGG) is limited in actual applications due to the overfitting of head predicates. A series of debiasing methods, i.e. unbiased SGG, have been proposed to solve the problem. However, existing unbiased SGG methods have a tendency to fit the tail predicates, which is another type of bias. This paper aims to eliminate the one-way overfitting of head or tail predicates. In order to provide more balanced relationship prediction, we propose a new framework DCL (Dual-branch Cumulative Learning) which integrates regular relation prediction process and debiasing relation prediction process by employing cumulative learning mechanism. The learning process of DCL enhances the discrimination of tail predicates without reducing the discrimination performance of the model on head predicates. DCL is model-agnostic and compatible with existed different type of debiasing methods. Experiments on Visual Genome dataset show that, among all the model-agnostic methods, DCL achieves the best comprehensive performance while considering both R@K and mR@K.
引用
收藏
页码:216 / 228
页数:13
相关论文
共 23 条
[1]   Knowledge-Embedded Routing Network for Scene Graph Generation [J].
Chen, Tianshui ;
Yu, Weihao ;
Chen, Riquan ;
Lin, Liang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6156-6164
[2]  
Chen Zhanwen, 2023, WACV, P4023
[3]   Recovering the Unbiased Scene Graphs from the Biased Ones [J].
Chiou, Meng-Jiun ;
Ding, Henghui ;
Yan, Hanshu ;
Wang, Changhu ;
Zimmermann, Roger ;
Feng, Jiashi .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :1581-1590
[4]   Detecting Visual Relationships with Deep Relational Networks [J].
Dai, Bo ;
Zhang, Yuqi ;
Lin, Dahua .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3298-3308
[5]   BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation [J].
Dhingra, Naina ;
Ritter, Florian ;
Kunz, Andreas .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :2150-2159
[6]   Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation [J].
Dong, Xingning ;
Gan, Tian ;
Song, Xuemeng ;
Wu, Jianlong ;
Cheng, Yuan ;
Nie, Liqiang .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :19405-19414
[7]  
Koner R., 2020, arXiv
[8]   Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations [J].
Krishna, Ranjay ;
Zhu, Yuke ;
Groth, Oliver ;
Johnson, Justin ;
Hata, Kenji ;
Kravitz, Joshua ;
Chen, Stephanie ;
Kalantidis, Yannis ;
Li, Li-Jia ;
Shamma, David A. ;
Bernstein, Michael S. ;
Li Fei-Fei .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 123 (01) :32-73
[9]   The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation [J].
Li, Lin ;
Chen, Long ;
Huang, Yifeng ;
Zhang, Zhimeng ;
Zhang, Songyang ;
Xiao, Jun .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18847-18856
[10]   Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation [J].
Li, Rongjie ;
Zhang, Songyang ;
Wan, Bo ;
He, Xuming .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11104-11114