Multi-Level Contextual RNNs With Attention Model for Scene Labeling

被引:26
|
作者
Fan, Heng [1 ]
Mei, Xue [2 ]
Prokhorov, Danil [2 ]
Ling, Haibin [1 ]
机构
[1] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19121 USA
[2] Toyota Res Inst, Ann Arbor, MI 48105 USA
关键词
Scene labeling; scene understanding; contextual recurrent neural networks (CRNNs); attention model; intelligent transportation system; FEATURES; SEGMENTATION;
D O I
10.1109/TITS.2017.2775628
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Image context in image is crucial for improving scene labeling. While the existing methods only exploit local context generated from a small surrounding area of an image patch or a pixel, the long-range and global contextual information is often ignored. To handle this issue, we propose a novel approach for scene labeling by multi-level contextual recurrent neural networks (RNNs). We encode three kinds of contextual cues, viz., local context, global context, and image topic context in structural RNNs to model long-range local and global dependencies in an image. In this way, our method is able to "see" the image in terms of both long-range local and holistic views, and make a more reliable inference for image labeling. Besides, we integrate the proposed contextual RNNs into hierarchical convolutional neural networks, and exploit dependence relationships at multiple levels to provide rich spatial and semantic information. Moreover, we adopt an attention model to effectively merge multiple levels and show that it outperforms average- or max-pooling fusion strategies. Extensive experiments demonstrate that the proposed approach achieves improved results on the CamVid, KITTI, SiftFlow, Stanford Background, and Cityscapes data sets.
引用
收藏
页码:3475 / 3485
页数:11
相关论文
共 50 条
  • [21] Integration of multi-level semantics in PTMs with an attention model for question matching
    Ye, Zheng
    Che, Linwei
    Ge, Jun
    Qin, Jun
    Liu, Jing
    PLOS ONE, 2024, 19 (08):
  • [22] A Multi-level Mesh Mutual Attention Model for Visual Question Answering
    Lei, Zhi
    Zhang, Guixian
    Wu, Lijuan
    Zhang, Kui
    Liang, Rongjiao
    DATA SCIENCE AND ENGINEERING, 2022, 7 (04) : 339 - 353
  • [23] Multimodal Multi-Level Fusion using Contextual Information
    Vybornova, Olga
    Gemo, Monica
    Macq, Benoit
    ERCIM NEWS, 2007, (70): : 61 - 62
  • [24] Scene graph generation by multi-level semantic tasks
    Peng Tian
    Hongwei Mo
    Laihao Jiang
    Applied Intelligence, 2021, 51 : 7781 - 7793
  • [25] MULTI-LEVEL SCENE UNDERSTANDING VIA HIERARCHICAL CLASSIFICATION
    Clouse, Hamilton Scott
    Bian, Xiao
    Gentimis, Thanos
    Krim, Hamid
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 966 - 970
  • [26] Multi-level Adaptive Active Learning for Scene Classification
    Li, Xin
    Guo, Yuhong
    COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 234 - 249
  • [27] Scene graph generation by multi-level semantic tasks
    Tian, Peng
    Mo, Hongwei
    Jiang, Laihao
    APPLIED INTELLIGENCE, 2021, 51 (11) : 7781 - 7793
  • [28] An image inpainting model based on channel attention gated convolution and multi-level attention mechanism
    Zhao, Sihan
    Li, Chunmeng
    Zhang, Chenyang
    Yang, Xiaozhong
    DISPLAYS, 2025, 87
  • [29] Visual Relation Detection with Multi-Level Attention
    Zheng, Sipeng
    Chen, Shizhe
    Jin, Qin
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 121 - 129
  • [30] Attention as a multi-level system of weights and balances
    Narhi-Martinez, William
    Dube, Blaire
    Golomb, Julie D.
    WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2023, 14 (01)