BiEPNet: Bilateral Edge-perceiving Network for High-Resolution Human Parsing

被引:0
作者
Gong, Qiqi [1 ]
Wei, Yunchao [1 ]
Zhao, Yao [1 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing Key Lab Adv Informat Sci & Network, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, ICDSP 2024 | 2024年
基金
国家重点研发计划;
关键词
BiEPNet; Human parsing; High resolution; Computer vision;
D O I
10.1145/3653876.3653898
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human parsing is a fundamental task aimed at segmenting human images into distinct body parts and holds vast potential applications. Nowadays, the advancement of image-capturing devices has led to a growing number of high-resolution human images. Receptive field, detail loss and memory usage are a triplet of contradictions in high-resolution scenarios. Existing human parsing methods designed for low-resolution inputs struggle to process high-resolution images efficiently due to their massive demands for computation and memory. Some methods save resources by overwhelmingly downsampling or encoding high-resolution inputs at the cost of poor performance on details. To resolve the issues above, we propose the Bilateral Edge-Perceiving Network (BiEPNet), consisting of a resources-friendly semantic-perceiving branch to acquire sufficient global information and a simple yet effective edge-perceiving branch used to refine details. The attention mechanism is utilized to simultaneously enhance the perception of context and details, leading to better performance on the boundary regions. To verify the effectiveness of BiEPNet, we contribute a high-resolution human parsing dataset, Human4K, containing 4,000 images with more than five million pixels. Extensive experiments on Human4K demonstrate that our method effectively outperforms the state-of-the-art methods.
引用
收藏
页码:197 / 204
页数:8
相关论文
共 42 条
  • [2] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [4] Attention to Scale: Scale-aware Semantic Image Segmentation
    Chen, Liang-Chieh
    Yang, Yi
    Wang, Jiang
    Xu, Wei
    Yuille, Alan L.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649
  • [5] Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images
    Chen, Wuyang
    Jiang, Ziyu
    Wang, Zhangyang
    Cui, Kexin
    Qian, Xiaoning
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8916 - 8925
  • [6] Boundary IoU: Improving Object-Centric Image Segmentation Evaluation
    Cheng, Bowen
    Girshick, Ross
    Dollar, Piotr
    Berg, Alexander C.
    Kirillov, Alexander
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15329 - 15337
  • [7] SPGNet: Semantic Prediction Guidance for Scene Parsing
    Cheng, Bowen
    Chen, Liang-Chieh
    Wei, Yunchao
    Zhu, Yukun
    Huang, Zilong
    Xiong, Jinjun
    Huang, Thomas S.
    Hwu, Wen-Mei
    Shi, Honghui
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5217 - 5227
  • [8] Cheng HK, 2020, PROC CVPR IEEE, P8887, DOI 10.1109/CVPR42600.2020.00891
  • [9] Progressive Semantic Segmentation
    Chuong Huynh
    Anh Tuan Tran
    Khoa Luu
    Minh Hoai
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16750 - 16759
  • [10] Contributors M., 2020, MMSegmentation: Openmmlab semantic segmentation toolbox and benchmark