FCGNet: Foreground and Class Guided Network for human parsing

被引：0

作者：

Jang, Jaehyuk ^{[1
]}

Wang, Yooseung ^{[1
]}

Kim, Changick ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol KAIST, Sch Elect Engn, Daejeon 34141, South Korea

来源：

PATTERN RECOGNITION | 2025年 / 157卷

关键词：

Human parsing; Semantic segmentation; Graph convolutional network;

D O I：

10.1016/j.patcog.2024.110879

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Understanding the inherent hierarchical human structure is key to human parsing. To capture the human- specific characteristic, it is necessary to focus on the spatial and class information corresponding to the foreground (i.e., human) in an image. Inspired by these insights, we introduce two supervision signals, spatial foreground information and existent class information in the image. By utilizing foreground information as guidance, the network is guided to generate a human-focused feature map and capture the pixel-wise hierarchical characteristics by computing correlations between pixels. Furthermore, we guide the network to consider class information in the image at the feature level and capture the class-wise relationship by calculating correlations between channels. Moreover, during the training phase, we prevent the network from misclassifying pixels into confusing classes by providing the existent class information in the image to the network at the prediction level. Our model achieves state-of-the-art performance with significantly reduced parameters and Multiply-Accumulate Operations (MACs) in three public benchmarks.

引用

页数：12

共 50 条

[41] BiEPNet: Bilateral Edge-perceiving Network for High-Resolution Human Parsing
Gong, Qiqi
Wei, Yunchao
Zhao, Yao
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, ICDSP 2024, 2024, : 197 - 204
[42] PGNet: Panoptic parsing guided deep stereo matching
Chen, Shuya
Xiang, Zhiyu
Qiao, Chengyu
Chen, Yiman
Bai, Tingming
NEUROCOMPUTING, 2021, 463 : 609 - 622
[43] Parsing Based on Parselets: A Unified Deformable Mixture Model for Human Parsing
Dong, Jian
Chen, Qiang
Huang, Zhongyang
Yang, Jianchao
Yan, Shuicheng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) : 88 - 101
[44] Learning rebalanced human parsing model from imbalanced datasets
Huang, Enbo
Su, Zhuo
Zhou, Fan
Wang, Ruomei
IMAGE AND VISION COMPUTING, 2020, 99
[45] Human parsing by weak structural label
Chen, Zhiyong
Liu, Si
Zhai, Yanlong
Lin, Jia
Cao, Xiaochun
Yang, Liang
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (15) : 19795 - 19809
[46] Phase Contour Enhancement Network for Clothing Parsing
Yu, Feng
Zhang, Ying
Li, Huiyin
Du, Chenghu
Liu, Li
Jiang, Minghua
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 2784 - 2793
[47] Self-Correction for Human Parsing
Li, Peike
Xu, Yunqiu
Wei, Yunchao
Yang, Yi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 3260 - 3271
[48] Human parsing by weak structural label
Zhiyong Chen
Si Liu
Yanlong Zhai
Jia Lin
Xiaochun Cao
Liang Yang
Multimedia Tools and Applications, 2018, 77 : 19795 - 19809
[49] EHANet: An Effective Hierarchical Aggregation Network for Face Parsing
Luo, Ling
Xue, Dingyu
Feng, Xinglong
APPLIED SCIENCES-BASEL, 2020, 10 (09):
[50] Efficient Light Deep Network for Street Scene Parsing
Wang, ZheHui
Zhao, Sanyuan
Shen, Jianbing
Lei, Zhengchao
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 42 - 45

← 1 2 3 4 5 →