Multi-Level Contextual RNNs With Attention Model for Scene Labeling

被引:26
|
作者
Fan, Heng [1 ]
Mei, Xue [2 ]
Prokhorov, Danil [2 ]
Ling, Haibin [1 ]
机构
[1] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19121 USA
[2] Toyota Res Inst, Ann Arbor, MI 48105 USA
关键词
Scene labeling; scene understanding; contextual recurrent neural networks (CRNNs); attention model; intelligent transportation system; FEATURES; SEGMENTATION;
D O I
10.1109/TITS.2017.2775628
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Image context in image is crucial for improving scene labeling. While the existing methods only exploit local context generated from a small surrounding area of an image patch or a pixel, the long-range and global contextual information is often ignored. To handle this issue, we propose a novel approach for scene labeling by multi-level contextual recurrent neural networks (RNNs). We encode three kinds of contextual cues, viz., local context, global context, and image topic context in structural RNNs to model long-range local and global dependencies in an image. In this way, our method is able to "see" the image in terms of both long-range local and holistic views, and make a more reliable inference for image labeling. Besides, we integrate the proposed contextual RNNs into hierarchical convolutional neural networks, and exploit dependence relationships at multiple levels to provide rich spatial and semantic information. Moreover, we adopt an attention model to effectively merge multiple levels and show that it outperforms average- or max-pooling fusion strategies. Extensive experiments demonstrate that the proposed approach achieves improved results on the CamVid, KITTI, SiftFlow, Stanford Background, and Cityscapes data sets.
引用
收藏
页码:3475 / 3485
页数:11
相关论文
共 50 条
  • [41] Multi-level attention model for tracking and segmentation of objects under complex occlusion
    Xu, L-Q
    Puig, P.
    BT TECHNOLOGY JOURNAL, 2006, 24 (02) : 180 - 185
  • [42] A multi-level model for preferences
    Gabrielsen, G
    FOOD QUALITY AND PREFERENCE, 2001, 12 (5-7) : 337 - 344
  • [43] Episodic CAMN: Contextual Attention-based Memory Networks With Iterative Feedback For Scene Labeling
    Abdulnabi, Abrar H.
    Shuai, Bing
    Winkler, Stefan
    Wang, Gang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6278 - 6287
  • [44] A new joint CTC-attention-based speech recognition model with multi-level multi-head attention
    Qin, Chu-Xiong
    Zhang, Wen-Lin
    Qu, Dan
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)
  • [45] A new joint CTC-attention-based speech recognition model with multi-level multi-head attention
    Chu-Xiong Qin
    Wen-Lin Zhang
    Dan Qu
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [46] Multi-level Multiple Attentions for Contextual Multimodal Sentiment Analysis
    Poria, Soujanya
    Cambria, Erik
    Hazarika, Devamanyu
    Mazumder, Navonil
    Zadeh, Amir
    Morency, Louis-Philippe
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 1033 - 1038
  • [47] Weakly supervised anomaly detection with multi-level contextual modeling
    Mengting Liu
    Xinrui Li
    Yongge Liu
    Yahong Han
    Multimedia Systems, 2023, 29 : 2153 - 2164
  • [48] Weakly supervised anomaly detection with multi-level contextual modeling
    Liu, Mengting
    Li, Xinrui
    Liu, Yongge
    Han, Yahong
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 2153 - 2164
  • [49] Multi-level contextual product development knowledge management in PLM
    Lin, Yi
    Ming, X. G.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2010, 37 (3-4) : 279 - 286
  • [50] Sentiment Domain Adaptation with Multi-Level Contextual Sentiment Knowledge
    Wu, Fangzhao
    Wu, Sixing
    Huang, Yongfeng
    Huang, Songfang
    Qin, Yong
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 949 - 958