Structured Neural Motifs: Scene Graph Parsing via Enhanced Context

被引:3
作者
Li, Yiming [1 ,4 ]
Yang, Xiaoshan [2 ,3 ,4 ]
Xu, Changsheng [1 ,2 ,3 ,4 ]
机构
[1] HeFei Univ Technol, Hefei, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
来源
MULTIMEDIA MODELING (MMM 2020), PT II | 2020年 / 11962卷
基金
中国国家自然科学基金;
关键词
Scene graph; Deep learning; LSTMs;
D O I
10.1007/978-3-030-37734-2_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graph is one kind of structured representation of the visual content in an image. It is helpful for complex visual understanding tasks such as image captioning, visual question answering and semantic image retrieval. Since the real-world images always have multiple object instances and complex relationships, the context information is extremely important for scene graph generation. It has been noted that the context dependencies among different nodes in the scene graph are asymmetric, which meas it is highly possible to directly predict relationship labels based on object labels but not vice-versa. Based on this finding, the existing motifs network has successfully exploited the context patterns among object nodes and the dependencies between the object nodes and the relation nodes. However, the spatial information and the context dependencies among relation nodes are neglected. In this work, we propose Structured Motif Network (StrcMN) which predicts object labels and pairwise relationships by mining more complete global context features. The experiments show that our model significantly outperforms previous methods on the VRD and Visual Genome datasets.
引用
收藏
页码:175 / 188
页数:14
相关论文
共 50 条
  • [41] Marine Target Detection via SpatialTemporal Graph Neural Network
    Wang, Xiang
    Cui, Guolong
    Wang, Yumiao
    Zhao, Wenjing
    Xiong, Kui
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [42] Information Diffusion Prediction via Dynamic Graph Neural Networks
    Cao, Zongmai
    Han, Kai
    Zhu, Jianfu
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 1099 - 1104
  • [43] Graph-CAM: Explainable Image Features via Graph Neural Network in Fourier Domain
    Shaposhnikov, Vladimir
    Bespalov, Iaroslav
    Dylov, Dmitry V.
    IEEE ACCESS, 2025, 13 : 55456 - 55465
  • [44] A BERT-enhanced Graph Neural Network for Knowledge Base Population
    Lim, Heechul
    Kim, Min-Soo
    2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 81 - 84
  • [45] Temporal Enhanced Multimodal Graph Neural Networks for Fake News Detection
    Qu, Zhibo
    Zhou, Fuhui
    Song, Xi
    Ding, Rui
    Yuan, Lu
    Wu, Qihui
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, : 1 - 13
  • [46] Syntax-guided text generation via graph neural network
    Qipeng Guo
    Xipeng Qiu
    Xiangyang Xue
    Zheng Zhang
    Science China Information Sciences, 2021, 64
  • [47] Learning to Drop: Robust Graph Neural Network via Topological Denoising
    Luo, Dongsheng
    Cheng, Wei
    Yu, Wenchao
    Zong, Bo
    Ni, Jingchao
    Chen, Haifeng
    Zhang, Xiang
    WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 779 - 787
  • [48] FlowX: Towards Explainable Graph Neural Networks via Message Flows
    Gui, Shurui
    Yuan, Hao
    Wang, Jie
    Lao, Qicheng
    Li, Kang
    Ji, Shuiwang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (07) : 4567 - 4578
  • [49] Syntax-guided text generation via graph neural network
    Guo, Qipeng
    Qiu, Xipeng
    Xue, Xiangyang
    Zhang, Zheng
    SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (05)
  • [50] Solving the kidney exchange problem via graph neural networks with no supervision
    Pimenta P.F.
    Avelar P.H.C.
    Lamb L.C.
    Neural Computing and Applications, 2024, 36 (25) : 15373 - 15388