Multi-Level Contextual RNNs With Attention Model for Scene Labeling

被引:26
|
作者
Fan, Heng [1 ]
Mei, Xue [2 ]
Prokhorov, Danil [2 ]
Ling, Haibin [1 ]
机构
[1] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19121 USA
[2] Toyota Res Inst, Ann Arbor, MI 48105 USA
关键词
Scene labeling; scene understanding; contextual recurrent neural networks (CRNNs); attention model; intelligent transportation system; FEATURES; SEGMENTATION;
D O I
10.1109/TITS.2017.2775628
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Image context in image is crucial for improving scene labeling. While the existing methods only exploit local context generated from a small surrounding area of an image patch or a pixel, the long-range and global contextual information is often ignored. To handle this issue, we propose a novel approach for scene labeling by multi-level contextual recurrent neural networks (RNNs). We encode three kinds of contextual cues, viz., local context, global context, and image topic context in structural RNNs to model long-range local and global dependencies in an image. In this way, our method is able to "see" the image in terms of both long-range local and holistic views, and make a more reliable inference for image labeling. Besides, we integrate the proposed contextual RNNs into hierarchical convolutional neural networks, and exploit dependence relationships at multiple levels to provide rich spatial and semantic information. Moreover, we adopt an attention model to effectively merge multiple levels and show that it outperforms average- or max-pooling fusion strategies. Extensive experiments demonstrate that the proposed approach achieves improved results on the CamVid, KITTI, SiftFlow, Stanford Background, and Cityscapes data sets.
引用
收藏
页码:3475 / 3485
页数:11
相关论文
共 50 条
  • [1] MULTI-LEVEL ATTENTION MODEL WITH DEEP SCATTERING SPECTRUM FOR ACOUSTIC SCENE CLASSIFICATION
    Li, Zhitong
    Hou, Yuanbo
    Xie, Xiang
    Li, Shengchen
    Zhang, Liqiang
    Du, Shixuan
    Liu, Wei
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 396 - 401
  • [2] A Multi-level Attention Model for Text Matching
    Sun, Qiang
    Wu, Yue
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 142 - 153
  • [3] Multi-Scale Multi-Level Generative Model in Scene Classification
    Xie, Wenjie
    Xu, De
    Tang, Yingjun
    Cui, Geng
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (01): : 167 - 170
  • [4] MLFC-net: A multi-level feature combination attention model for remote sensing scene classification
    Wang, Deyi
    Zhang, Chengkun
    Han, Min
    COMPUTERS & GEOSCIENCES, 2022, 160
  • [5] A Multi-Level Contextual Model For Person Recognition in Photo Albums
    Li, Haoxiang
    Brandt, Jonathan
    Lin, Zhe
    Shen, Xiaohui
    Hua, Gang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1297 - 1305
  • [6] Multi-level Contextual Type Theory
    Boespflug, Mathieu
    Pientka, Brigitte
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2011, (71): : 29 - 43
  • [7] When CNNs meet random RNNs: Towards multi-level analysis for RGB-D object and scene recognition
    Caglayan, Ali
    Imamoglu, Nevrez
    Can, Ahmet Burak
    Nakamura, Ryosuke
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 217
  • [8] A Multi-Level Attention Model for Remote Sensing Image Captions
    Li, Yangyang
    Fang, Shuangkang
    Jiao, Licheng
    Liu, Ruijiao
    Shang, Ronghua
    REMOTE SENSING, 2020, 12 (06)
  • [9] Multi-level Stereo Attention Model for Center Channel Extraction
    Lim, Wootaek
    Beack, Seungkwon
    Lee, Taejin
    2019 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2019,
  • [10] Multi-level attention model for person re-identification
    Yan, Yichao
    Ni, Bingbing
    Liu, Jinxian
    Yang, Xiaokang
    PATTERN RECOGNITION LETTERS, 2019, 127 : 156 - 164