Structured Neural Motifs: Scene Graph Parsing via Enhanced Context

被引:3
|
作者
Li, Yiming [1 ,4 ]
Yang, Xiaoshan [2 ,3 ,4 ]
Xu, Changsheng [1 ,2 ,3 ,4 ]
机构
[1] HeFei Univ Technol, Hefei, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
来源
MULTIMEDIA MODELING (MMM 2020), PT II | 2020年 / 11962卷
基金
中国国家自然科学基金;
关键词
Scene graph; Deep learning; LSTMs;
D O I
10.1007/978-3-030-37734-2_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graph is one kind of structured representation of the visual content in an image. It is helpful for complex visual understanding tasks such as image captioning, visual question answering and semantic image retrieval. Since the real-world images always have multiple object instances and complex relationships, the context information is extremely important for scene graph generation. It has been noted that the context dependencies among different nodes in the scene graph are asymmetric, which meas it is highly possible to directly predict relationship labels based on object labels but not vice-versa. Based on this finding, the existing motifs network has successfully exploited the context patterns among object nodes and the dependencies between the object nodes and the relation nodes. However, the spatial information and the context dependencies among relation nodes are neglected. In this work, we propose Structured Motif Network (StrcMN) which predicts object labels and pairwise relationships by mining more complete global context features. The experiments show that our model significantly outperforms previous methods on the VRD and Visual Genome datasets.
引用
收藏
页码:175 / 188
页数:14
相关论文
共 50 条
  • [31] Improved Code Summarization via a Graph Neural Network
    LeClair, Alexander
    Haque, Sakib
    Wu, Lingfei
    McMillan, Collin
    2020 IEEE/ACM 28TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2020, : 184 - 195
  • [32] MoGE: Graph Context Enhanced Multi-Task Recommendation Method
    Zhou J.
    Hu B.-B.
    Zhang Z.-Q.
    Chen C.-C.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3377 - 3387
  • [33] Structured learning via convolutional neural networks for vehicle detection
    Maqueda, Ana I.
    del Blanco, Carlos R.
    Jaureguizar, Fernando
    Garcia, Narciso
    REAL-TIME IMAGE AND VIDEO PROCESSING 2017, 2017, 10223
  • [34] Context-Aware Knowledge Graph Framework for Traffic Speed Forecasting Using Graph Neural Network
    Zhang, Yatao
    Wang, Yi
    Gao, Song
    Raubal, Martin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3885 - 3902
  • [35] Quantum-enhanced deep neural network architecture for image scene classification
    Chalumuri, Avinash
    Kune, Raghavendra
    Kannan, S.
    Manoj, B. S.
    QUANTUM INFORMATION PROCESSING, 2021, 20 (11)
  • [36] Quantum-enhanced deep neural network architecture for image scene classification
    Avinash Chalumuri
    Raghavendra Kune
    S. Kannan
    B. S. Manoj
    Quantum Information Processing, 2021, 20
  • [37] Trajectory-User Linking via Graph Neural Network
    Zhou, Fan
    Chen, Shupei
    Wu, Jin
    Cao, Chengtai
    Zhang, Shengming
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [38] Bayesian Graph Convolutional Neural Networks via Tempered MCMC
    Chandra, Rohitash
    Bhagat, Ayush
    Maharana, Manavendra
    Krivitsky, Pavel N.
    IEEE ACCESS, 2021, 9 : 130353 - 130365
  • [39] Vessel Segmentation via Link Prediction of Graph Neural Networks
    Yu, Hao
    Zhao, Jie
    Zhang, Li
    MULTISCALE MULTIMODAL MEDICAL IMAGING, MMMI 2022, 2022, 13594 : 34 - 43
  • [40] Learning Deep Graph Representations via Convolutional Neural Networks
    Ye, Wei
    Askarisichani, Omid
    Jones, Alex
    Singh, Ambuj
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2268 - 2279