Deep Hierarchical Semantic Segmentation

被引:80
|
作者
Li, Liulei [1 ,5 ]
Zhou, Tianfei [2 ]
Wang, Wenguan [3 ]
Li, Jianwu [1 ]
Yang, Yi [4 ]
机构
[1] Beijing Inst Technol, Beijing, Peoples R China
[2] Swiss Fed Inst Technol, Zurich, Switzerland
[3] Univ Technol Sydney, AAII, ReLER, Sydney, NSW, Australia
[4] Zhejiang Univ, CCAI, Hangzhou, Peoples R China
[5] Baidu Res, Beijing, Peoples R China
基金
北京市自然科学基金; 澳大利亚研究理事会;
关键词
CLASSIFICATION;
D O I
10.1109/CVPR52688.2022.00131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans are able to recognize structured relations in observation, allowing us to decompose complex scenes into simpler parts and abstract the visual world in multiple levels. However, such hierarchical reasoning ability of human perception remains largely unexplored in current literature of semantic segmentation. Existing work is often aware of flatten labels and predicts target classes exclusively for each pixel. In this paper, we instead address hierarchical semantic segmentation (HSS), which aims at structured, pixel-wise description of visual observation in terms of a class hierarchy. We devise H SSN , a general HSS framework that tackles two critical issues in this task: i) how to efficiently adapt existing hierarchy-agnostic segmentation networks to the HSS setting, and ii) how to leverage the hierarchy information to regularize HSS network learning. To address i), HSSN directly casts HSS as a pixel-wise multi-label classification task, only bringing minimal architecture change to current segmentation models. To solve ii), HSSN first explores inherent properties of the hierarchy as a training objective, which enforces segmentation predictions to obey the hierarchy structure. Further, with hierarchy-induced margin constraints, HSSNreshapes the pixel embedding space, so as to generate well-structured pixel representations and improve segmentation eventually. We conduct experiments on four semantic segmentation datasets (i.e., Mapillary Vistas 2.0, City-scapes, LIP, and PASCAL-Person-Part), with different class hierarchies, segmentation network architectures and backbones, showing the generalization and superiority of HSSN.
引用
收藏
页码:1236 / 1247
页数:12
相关论文
共 50 条
  • [41] Hierarchically Gated Deep Networks for Semantic Segmentation
    Qi, Guo-Jun
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2267 - 2275
  • [42] Position attention optimized deep semantic segmentation
    Rui Zhao
    Xiaoyan Yu
    Xianwei Rong
    Multimedia Tools and Applications, 2024, 83 : 29531 - 29545
  • [43] Efficient and robust deep networks for semantic segmentation
    Oliveira, Gabriel L.
    Bollen, Claas
    Burgard, Wolfram
    Brox, Thomas
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (4-5): : 472 - 491
  • [44] Image Classification and Semantic Segmentation with Deep Learning
    Quazi, Saiman
    Musa, Sarhan M.
    6TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2021,
  • [45] Semantic Image Segmentation With Propagating Deep Aggregation
    Ji, Jian
    Li, Sitong
    Xiong, Jian
    Chen, Peng
    Miao, Qiguang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (12) : 9732 - 9742
  • [46] How deep learning is empowering semantic segmentationTraditional and deep learning techniques for semantic segmentation: A comparison
    Uroosa Sehar
    Muhammad Luqman Naseem
    Multimedia Tools and Applications, 2022, 81 : 30519 - 30544
  • [47] Blood Cell Images Segmentation using Deep Learning Semantic Segmentation
    Thanh Tran
    Kwon, Oh-Heum
    Kwon, Ki-Ryong
    Lee, Suk-Hwan
    Kang, Kyung-Won
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING (ICECE 2018), 2018, : 13 - 16
  • [48] Wildfire Segmentation using Deep-RegSeg Semantic Segmentation Architecture
    Ghali, Rafik
    Akhloufi, Moulay A.
    Mseddi, Wided Souidene
    Jmal, Marwa
    19TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2022, 2022, : 149 - 154
  • [49] Litchi Flower and Leaf Segmentation and Recognition Based on Deep Semantic Segmentation
    Xiong J.
    Liu B.
    Zhong Z.
    Chen S.
    Zheng Z.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2021, 52 (06): : 252 - 258
  • [50] Hierarchical segmentation using latent semantic indexing in scale space
    Slaney, M
    Ponceleon, D
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1437 - 1440