Semantic scene graph generation based on an edge dual scene graph and message passing neural network☆

被引:0
作者
Kim, Hyeongjin [1 ]
Ko, Byoung Chul [1 ]
机构
[1] Keimyung Univ, 1095 Dalgubeol Daero, Daegu 42601, South Korea
基金
新加坡国家研究基金会;
关键词
Scene graph generation; Edge dual scene graph; Long-tail problem; Scene understanding;
D O I
10.1016/j.imavis.2025.105572
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Along with generative AI, interest in scene graph generation (SGG), which comprehensively captures the relationships and interactions between objects in an image and creates a structured graph-based representation, has significantly increased in recent years. However, relying on object-centric and dichotomous relationships, existing SGG methods have a limited ability to accurately predict detailed relationships. To solve these problems, a new approach to the modeling multi-object relationships, called edge dual scene graph generation (EdgeSGG), is proposed herein. EdgeSGG is based on an edge dual scene graph and object-relation centric message passing neural network (OR-MPNN), which can capture rich contextual interactions between unconstrained objects. To facilitate the learning of edge dual scene graphs with a symmetric graph structure, the proposed OR-MPNN learns both object-and relation-centric features for more accurately predicting relationaware contexts and allows fine-grained relational updates between objects. A comparative experiment with state-of-the-art (SoTA) methods was conducted using two public datasets for SGG operations and six metrics for three subtasks. Compared with SoTA approaches, the proposed model exhibited substantial performance improvements across all SGG subtasks. Furthermore, experiment on imbalanced class distributions revealed that incorporating the relationships between objects effectively mitigates existing long-tail problems. Our code is available at https://github.com/Chocolate-Love/EdgeSGG-pytorch.
引用
收藏
页数:9
相关论文
共 41 条
[1]   STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition [J].
Ahn, Dasom ;
Kim, Sangwon ;
Hong, Hyunsu ;
Ko, Byoung Chul .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :3319-3328
[2]   Learning about learning: Mining human brain sub-network biomarkers from fMRI data [J].
Bogdanov, Petko ;
Dereli, Nazli ;
Dang, Xuan-Hong ;
Bassett, Danielle S. ;
Wymbs, Nicholas F. ;
Grafton, Scott T. ;
Singh, Ambuj K. .
PLOS ONE, 2017, 12 (10)
[3]   Knowledge-Embedded Routing Network for Scene Graph Generation [J].
Chen, Tianshui ;
Yu, Weihao ;
Chen, Riquan ;
Lin, Liang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6156-6164
[4]   High Capacity and High-Rate NASICON-Na3.75V1.25Mn0.75(PO4)3 Cathode for Na-Ion Batteries via Modulating Electronic and Crystal Structures [J].
Ghosh, Subham ;
Barman, Nabadyuti ;
Mazumder, Madhulika ;
Pati, Swapan K. ;
Rousse, Gwenaelle ;
Senguttuvan, Premkumar .
ADVANCED ENERGY MATERIALS, 2020, 10 (06)
[5]   From General to Specific: Informative Scene Graph Generation via Balance Adjustment [J].
Guo, Yuyu ;
Gao, Lianli ;
Wang, Xuanhan ;
Hu, Yuxuan ;
Xu, Xing ;
Lu, Xu ;
Shen, Heng Tao ;
Song, Jingkuan .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :16363-16372
[6]   Fast Contextual Scene Graph Generation with Unbiased Context Augmentation [J].
Jin, Tianlei ;
Guo, Fangtai ;
Meng, Qiwei ;
Zhu, Shiqiang ;
Xi, Xiangming ;
Wang, Wen ;
Mu, Zonghao ;
Song, Wei .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :6302-6311
[7]   Devil's on the Edges: Selective Quad Attention for Scene Graph Generation [J].
Jung, Deunsol ;
Kim, Sanghyun ;
Kim, Won Hwa ;
Cho, Minsu .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :18664-18674
[8]  
Kim K., 2024, ICLR, P112
[9]   IS-GGT: Iterative Scene Graph Generation with Generative Transformers [J].
Kundu, Sanjoy ;
Aakur, Sathyanarayanan N. .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :6292-6301
[10]   The Open Images Dataset V4 Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale [J].
Kuznetsova, Alina ;
Rom, Hassan ;
Alldrin, Neil ;
Uijlings, Jasper ;
Krasin, Ivan ;
Pont-Tuset, Jordi ;
Kamali, Shahab ;
Popov, Stefan ;
Malloci, Matteo ;
Kolesnikov, Alexander ;
Duerig, Tom ;
Ferrari, Vittorio .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (07) :1956-1981