Pyramid Scene Parsing Network

被引:9345
|
作者
Zhao, Hengshuang [1 ]
Shi, Jianping [2 ]
Qi, Xiaojuan [1 ]
Wang, Xiaogang [1 ]
Jia, Jiaya [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[2] SenseTime Grp Ltd, Beijing, Peoples R China
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
关键词
D O I
10.1109/CVPR.2017.660
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene parsing is challenging for unrestricted open vocabulary and diverse scenes. In this paper, we exploit the capability of global context information by different-region-based context aggregation through our pyramid pooling module together with the proposed pyramid scene parsing network (PSPNet). Our global prior representation is effective to produce good quality results on the scene parsing task, while PSPNet provides a superior framework for pixel-level prediction. The proposed approach achieves state-of-the-art performance on various datasets. It came first in ImageNet scene parsing challenge 2016, PASCAL VOC 2012 benchmark and Cityscapes benchmark. A single PSPNet yields the new record of mIoU accuracy 85.4% on PASCAL VOC 2012 and accuracy 80.2% on Cityscapes.
引用
收藏
页码:6230 / 6239
页数:10
相关论文
共 50 条
  • [21] Multistage Shallow Pyramid Parsing for Road Scene Understanding Based on Semantic Segmentation
    Nurhadiyatna, Adi
    Loncaric, Sven
    PROCEEDINGS OF THE 2019 11TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2019), 2019, : 198 - 203
  • [22] Fully Contextual Network for Hyperspectral Scene Parsing
    Wang, Di
    Du, Bo
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [23] Efficient Light Deep Network for Street Scene Parsing
    Wang, ZheHui
    Zhao, Sanyuan
    Shen, Jianbing
    Lei, Zhengchao
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 42 - 45
  • [24] Attentive Temporal Pyramid Network for Dynamic Scene Classification
    Huang, Yuanjun
    Cao, Xianbin
    Zhen, Xiantong
    Han, Jungong
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8497 - 8504
  • [25] Scene Text Detection with Supervised Pyramid Context Network
    Xie, Enze
    Zang, Yuhang
    Shao, Shuai
    Yu, Gang
    Yao, Cong
    Li, Guangyao
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9038 - 9045
  • [26] A New Feature Pyramid Network For Road Scene Segmentation
    Zhan, Wujing
    Chen, Jiaxing
    Fan, Lei
    Ou, Xinqi
    Chen, Long
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 3487 - 3492
  • [27] Pyramid scene parsing network in 3D: Improving semantic segmentation of point clouds with multi-scale contextual information
    Fang, Hao
    Lafarge, Florent
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 154 : 246 - 258
  • [28] DSCA-PSPNet: Dynamic spatial-channel attention pyramid scene parsing network for sugarcane field segmentation in satellite imagery
    Yuan, Yujian
    Yang, Lina
    Chang, Kan
    Huang, Youju
    Yang, Haoyan
    Wang, Jiale
    FRONTIERS IN PLANT SCIENCE, 2024, 14
  • [29] Interpretable Model Based on Pyramid Scene Parsing Features for Brain Tumor MRI Image Segmentation
    Zhao, Mingyang
    Xin, Junchang
    Wang, Zhongyang
    Wang, Xinlei
    Wang, Zhiqiong
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [30] Global-Guided Selective Context Network for Scene Parsing
    Jiang, Jie
    Liu, Jing
    Fu, Jun
    Zhu, Xinxin
    Li, Zechao
    Lu, Hanqing
    IEEE Transactions on Neural Networks and Learning Systems, 2022, 33 (04): : 1752 - 1764