BiEPNet: Bilateral Edge-perceiving Network for High-Resolution Human Parsing

被引：0

作者：

Gong, Qiqi ^{[1
]}

Wei, Yunchao ^{[1
]}

Zhao, Yao ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing Key Lab Adv Informat Sci & Network, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, ICDSP 2024 | 2024年

基金：

国家重点研发计划;

关键词：

BiEPNet; Human parsing; High resolution; Computer vision;

D O I：

10.1145/3653876.3653898

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Human parsing is a fundamental task aimed at segmenting human images into distinct body parts and holds vast potential applications. Nowadays, the advancement of image-capturing devices has led to a growing number of high-resolution human images. Receptive field, detail loss and memory usage are a triplet of contradictions in high-resolution scenarios. Existing human parsing methods designed for low-resolution inputs struggle to process high-resolution images efficiently due to their massive demands for computation and memory. Some methods save resources by overwhelmingly downsampling or encoding high-resolution inputs at the cost of poor performance on details. To resolve the issues above, we propose the Bilateral Edge-Perceiving Network (BiEPNet), consisting of a resources-friendly semantic-perceiving branch to acquire sufficient global information and a simple yet effective edge-perceiving branch used to refine details. The attention mechanism is utilized to simultaneously enhance the perception of context and details, leading to better performance on the boundary regions. To verify the effectiveness of BiEPNet, we contribute a high-resolution human parsing dataset, Human4K, containing 4,000 images with more than five million pixels. Extensive experiments on Human4K demonstrate that our method effectively outperforms the state-of-the-art methods.

引用

页码：197 / 204

页数：8

共 42 条

[1] A COMPUTATIONAL APPROACH TO EDGE-DETECTION
CANNY, J
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (06) : 679 - 698
[2] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[4] Attention to Scale: Scale-aware Semantic Image Segmentation
Chen, Liang-Chieh
Yang, Yi
Wang, Jiang
Xu, Wei
Yuille, Alan L.
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649
[5] Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images
Chen, Wuyang
Jiang, Ziyu
Wang, Zhangyang
Cui, Kexin
Qian, Xiaoning
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8916 - 8925
[6] Boundary IoU: Improving Object-Centric Image Segmentation Evaluation
Cheng, Bowen
Girshick, Ross
Dollar, Piotr
Berg, Alexander C.
Kirillov, Alexander
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15329 - 15337
[7] SPGNet: Semantic Prediction Guidance for Scene Parsing
Cheng, Bowen
Chen, Liang-Chieh
Wei, Yunchao
Zhu, Yukun
Huang, Zilong
Xiong, Jinjun
Huang, Thomas S.
Hwu, Wen-Mei
Shi, Honghui
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5217 - 5227
[8] Cheng HK, 2020, PROC CVPR IEEE, P8887, DOI 10.1109/CVPR42600.2020.00891
[9] Progressive Semantic Segmentation
Chuong Huynh
Anh Tuan Tran
Khoa Luu
Minh Hoai
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16750 - 16759
[10] Contributors M., 2020, MMSegmentation: Openmmlab semantic segmentation toolbox and benchmark

← 1 2 3 4 5 →