FCGNet: Foreground and Class Guided Network for human parsing

被引：0

作者：

Jang, Jaehyuk ^{[1
]}

Wang, Yooseung ^{[1
]}

Kim, Changick ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol KAIST, Sch Elect Engn, Daejeon 34141, South Korea

来源：

PATTERN RECOGNITION | 2025年 / 157卷

关键词：

Human parsing; Semantic segmentation; Graph convolutional network;

D O I：

10.1016/j.patcog.2024.110879

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Understanding the inherent hierarchical human structure is key to human parsing. To capture the human- specific characteristic, it is necessary to focus on the spatial and class information corresponding to the foreground (i.e., human) in an image. Inspired by these insights, we introduce two supervision signals, spatial foreground information and existent class information in the image. By utilizing foreground information as guidance, the network is guided to generate a human-focused feature map and capture the pixel-wise hierarchical characteristics by computing correlations between pixels. Furthermore, we guide the network to consider class information in the image at the feature level and capture the class-wise relationship by calculating correlations between channels. Moreover, during the training phase, we prevent the network from misclassifying pixels into confusing classes by providing the existent class information in the image to the network at the prediction level. Our model achieves state-of-the-art performance with significantly reduced parameters and Multiply-Accumulate Operations (MACs) in three public benchmarks.

引用

页数：12

共 50 条

[31] TAO: A TRILATERAL AWARENESS OPERATION FOR HUMAN PARSING
Huang, Enbo
Su, Zhuo
Zhou, Fan
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[32] Enhanced Context Learning with Transformer for Human Parsing
Song, Jingya
Shi, Qingxuan
Li, Yihang
Yang, Fang
APPLIED SCIENCES-BASEL, 2022, 12 (15):
[33] A Universal Decoupled Training Framework for Human Parsing
Li, Yang
Zuo, Huahong
Han, Ping
SENSORS, 2022, 22 (16)
[34] EGA-Net: Edge Guided Attention Network With Label Refinement for Parsing of Animal Body Parts
Raghavendra, S.
Abhilash, S. K.
Nookala, Venu Madhav
Girisha, S.
Adesh, N. D.
IEEE ACCESS, 2024, 12 : 149162 - 149172
[35] A Part-Based Deep Neural Network Cascade Model for Human Parsing
Zhou, Yanghong
Mok, P. Y.
Zhou, Shijie
IEEE ACCESS, 2019, 7 : 160101 - 160111
[36] Clicking Matters: Towards Interactive Human Parsing
Gao, Yutong
Liang, Liqian
Lang, Congyan
Feng, Songhe
Li, Yidong
Wei, Yunchao
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3190 - 3203
[37] Lightweight cross-guided contextual perceptive network for visible-infrared urban road scene parsing
Liu, Jinfu
Zhou, Wujie
Fang, Meixin
Mao, Shanshan
Yang, Rongwang
INFRARED PHYSICS & TECHNOLOGY, 2024, 137
[38] A Review on Deep Learning Techniques Applied to Human Parsing
Shao J.
Huang X.
Cao K.-T.
Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2019, 48 (05): : 644 - 654
[39] MCFNet: Multi-Attentional Class Feature Augmentation Network for Real-Time Scene Parsing
Wang, Xizhong
Liu, Rui
Yang, Xin
Zhang, Qiang
Zhou, Dongsheng
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (06)
[40] Class-Guided Feature Decoupling Network for Airborne Image Segmentation
Zhou, Feng
Hang, Renlong
Liu, Qingshan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (03): : 2245 - 2255

← 1 2 3 4 5 →