Multi-Granularity Sparse Relationship Matrix Prediction Network for End-to-End Scene Graph Generation

被引:0
作者
Wang, Lei [1 ]
Yuan, Zejian [1 ]
Chen, Badong [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
来源
COMPUTER VISION-ECCV 2024, PT LXXXII | 2025年 / 15140卷
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Scene Graph Generation; End-to-End; Sparse Relationship Matrix; Multi-Granularity;
D O I
10.1007/978-3-031-73007-8_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current end-to-end Scene Graph Generation (SGG) relies solely on visual representations to separately detect sparse relations and entities in an image. This leads to the issue where the predictions of entities do not contribute to the prediction of relations, necessitating post-processing to assign corresponding subjects and objects to the predicted relations. In this paper, we introduce a sparse relationship matrix that bridges entity detection and relation detection. Our approach not only eliminates the need for relation matching, but also leverages the semantics and positional information of predicted entities to enhance relation prediction. Specifically, a multi-granularity sparse relationship matrix prediction network is proposed, which utilizes three gated pooling modules focusing on filtering negative samples at different granularities, thereby obtaining a sparse relationship matrix containing entity pairs most likely to form relations. Finally, a set of sparse, most probable subject-object pairs can be constructed and used for relation decoding. Experimental results on multiple datasets demonstrate that our method achieves a new state-of-the-art overall performance. Our code is available at https://github.com/wanglei0618/Mg-RMPN.
引用
收藏
页码:105 / 121
页数:17
相关论文
共 40 条
  • [21] AN END-TO-END MULTI-SCALE RESIDUAL RECONSTRUCTION NETWORK FOR IMAGE COMPRESSIVE SENSING
    Liu, Renhe
    Li, Sumei
    Hou, Chunping
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2070 - 2074
  • [22] PD-GATv2: positive difference second generation graph attention network based on multi-granularity in information systems to classification
    Fu, Yu
    Liu, Xindi
    Yu, Bin
    APPLIED INTELLIGENCE, 2024, 54 (06) : 5081 - 5096
  • [23] A Deep Neural Network Model for Rating Prediction Based on Multi-layer Prediction and Multi-granularity Latent Feature Vectors
    Yang, Bo
    Mu, Qilin
    Zou, Hairui
    Zeng, Yancheng
    Wong, Hau-San
    Li, Zesong
    Wang, Peng
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 227 - 236
  • [24] Multi-modal information fusion for multi-task end-to-end behavior prediction in autonomous driving
    Guo, Baicang
    Liu, Hao
    Yang, Xiao
    Cao, Yuan
    Jin, Lisheng
    Wang, Yinlin
    NEUROCOMPUTING, 2025, 634
  • [25] Boosting the performance of molecular property prediction via graph-text alignment and multi-granularity representation enhancement
    Zhao, Zhuoran
    Zhou, Qing
    Wu, Chengkai
    Su, Renbin
    Xiong, Weihong
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2024, 132
  • [26] MSARN: A Multi-scale Attention Residual Network for End-to-End Environmental Sound Classification
    Fucai Hu
    Peng Song
    Ruhan He
    Zhaoli Yan
    Yongsheng Yu
    Neural Processing Letters, 2023, 55 : 11449 - 11465
  • [27] MSARN: A Multi-scale Attention Residual Network for End-to-End Environmental Sound Classification
    Hu, Fucai
    Song, Peng
    He, Ruhan
    Yan, Zhaoli
    Yu, Yongsheng
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 11449 - 11465
  • [28] Implicit Filter-and-sum Network for End-to-end Multi-channel Speech Separation
    Luo, Yi
    Mesgarani, Nima
    INTERSPEECH 2021, 2021, : 3071 - 3075
  • [29] Dual-channel and multi-granularity gated graph attention network for aspect-based sentiment analysis
    Yong Wang
    Ningchuang Yang
    Duoqian Miao
    Qiuyi Chen
    Applied Intelligence, 2023, 53 : 13145 - 13157
  • [30] Dual-channel and multi-granularity gated graph attention network for aspect-based sentiment analysis
    Wang, Yong
    Yang, Ningchuang
    Miao, Duoqian
    Chen, Qiuyi
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13145 - 13157