RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation

被引:3
|
作者
Liu, Hengyue [1 ]
Bhanu, Bir [1 ]
机构
[1] Univ Calif Riverside, Dept Elect & Comp Engn, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
Feature extraction; Visualization; Semantics; Task analysis; Detectors; Shape; Training; Scene graph generation; visual relationship detection; long-tailed learning; human-Object interaction;
D O I
10.1109/TPAMI.2024.3402143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Graph Generation (SGG) has achieved significant progress recently. However, most previous works rely heavily on fixed-size entity representations based on bounding box proposals, anchors, or learnable queries. As each representation's cardinality has different trade-offs between performance and computation overhead, extracting highly representative features efficiently and dynamically is both challenging and crucial for SGG. In this work, a novel architecture called RepSGG is proposed to address the aforementioned challenges, formulating a subject as queries, an object as keys, and their relationship as the maximum attention weight between pairwise queries and keys. With more fine-grained and flexible representation power for entities and relationships, RepSGG learns to sample semantically discriminative and representative points for relationship inference. Moreover, the long-tailed distribution also poses a significant challenge for generalization of SGG. A run-time performance-guided logit adjustment (PGLA) strategy is proposed such that the relationship logits are modified via affine transformations based on run-time performance during training. This strategy encourages a more balanced performance between dominant and rare classes. Experimental results show that RepSGG achieves the state-of-the-art or comparable performance on the Visual Genome and Open Images V6 datasets with fast inference speed, demonstrating the efficacy and efficiency of the proposed methods.
引用
收藏
页码:8018 / 8035
页数:18
相关论文
共 50 条
  • [1] Relation Regularized Scene Graph Generation
    Guo, Yuyu
    Gao, Lianli
    Song, Jingkuan
    Wang, Peng
    Sebe, Nicu
    Shen, Heng Tao
    Li, Xuelong
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 5961 - 5972
  • [2] A Novel Framework for Scene Graph Generation via Prior Knowledge
    Wang, Zhenghao
    Lian, Jing
    Li, Linhui
    Zhao, Jian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3768 - 3781
  • [3] Debiased Scene Graph Generation for Dual Imbalance Learning
    Zhou, Hao
    Zhang, Jun
    Luo, Tingjin
    Yang, Yazhou
    Lei, Jun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4274 - 4288
  • [4] Semantically Similarity-Wise Dual-Branch Network for Scene Graph Generation
    Zhao, Bowen
    Mao, Zhendong
    Fang, Shancheng
    Zang, Wenyu
    Zhang, Yongdong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4573 - 4583
  • [5] MLMG-SGG: Multilabel Scene Graph Generation With Multigrained Features
    Li, Xuewei
    Miao, Peihan
    Li, Songyuan
    Li, Xi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1549 - 1559
  • [6] Tackling the Challenges in Scene Graph Generation With Local-to-Global Interactions
    Woo, Sangmin
    Noh, Junhyug
    Kim, Kangil
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 9713 - 9726
  • [7] Toward a Unified Transformer-Based Framework for Scene Graph Generation and Human-Object Interaction Detection
    He, Tao
    Gao, Lianli
    Song, Jingkuan
    Li, Yuan-Fang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6274 - 6288
  • [8] Unbiased Scene Graph Generation Using Predicate Similarities
    Matsui, Yusuke
    Ohashi, Misaki
    IEEE ACCESS, 2024, 12 : 95507 - 95516
  • [9] Adversarial Attacks on Scene Graph Generation
    Zhao, Mengnan
    Zhang, Lihe
    Wang, Wei
    Kong, Yuqiu
    Yin, Baocai
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 3210 - 3225
  • [10] Scene Graph Generation With Hierarchical Context
    Ren, Guanghui
    Ren, Lejian
    Liao, Yue
    Liu, Si
    Li, Bo
    Han, Jizhong
    Yan, Shuicheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 909 - 915