FCGNet: Foreground and Class Guided Network for human parsing

被引:0
|
作者
Jang, Jaehyuk [1 ]
Wang, Yooseung [1 ]
Kim, Changick [1 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Sch Elect Engn, Daejeon 34141, South Korea
关键词
Human parsing; Semantic segmentation; Graph convolutional network;
D O I
10.1016/j.patcog.2024.110879
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding the inherent hierarchical human structure is key to human parsing. To capture the human- specific characteristic, it is necessary to focus on the spatial and class information corresponding to the foreground (i.e., human) in an image. Inspired by these insights, we introduce two supervision signals, spatial foreground information and existent class information in the image. By utilizing foreground information as guidance, the network is guided to generate a human-focused feature map and capture the pixel-wise hierarchical characteristics by computing correlations between pixels. Furthermore, we guide the network to consider class information in the image at the feature level and capture the class-wise relationship by calculating correlations between channels. Moreover, during the training phase, we prevent the network from misclassifying pixels into confusing classes by providing the existent class information in the image to the network at the prediction level. Our model achieves state-of-the-art performance with significantly reduced parameters and Multiply-Accumulate Operations (MACs) in three public benchmarks.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] TAO: A TRILATERAL AWARENESS OPERATION FOR HUMAN PARSING
    Huang, Enbo
    Su, Zhuo
    Zhou, Fan
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [32] Enhanced Context Learning with Transformer for Human Parsing
    Song, Jingya
    Shi, Qingxuan
    Li, Yihang
    Yang, Fang
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [33] A Universal Decoupled Training Framework for Human Parsing
    Li, Yang
    Zuo, Huahong
    Han, Ping
    SENSORS, 2022, 22 (16)
  • [34] EGA-Net: Edge Guided Attention Network With Label Refinement for Parsing of Animal Body Parts
    Raghavendra, S.
    Abhilash, S. K.
    Nookala, Venu Madhav
    Girisha, S.
    Adesh, N. D.
    IEEE ACCESS, 2024, 12 : 149162 - 149172
  • [35] A Part-Based Deep Neural Network Cascade Model for Human Parsing
    Zhou, Yanghong
    Mok, P. Y.
    Zhou, Shijie
    IEEE ACCESS, 2019, 7 : 160101 - 160111
  • [36] Clicking Matters: Towards Interactive Human Parsing
    Gao, Yutong
    Liang, Liqian
    Lang, Congyan
    Feng, Songhe
    Li, Yidong
    Wei, Yunchao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3190 - 3203
  • [37] Lightweight cross-guided contextual perceptive network for visible-infrared urban road scene parsing
    Liu, Jinfu
    Zhou, Wujie
    Fang, Meixin
    Mao, Shanshan
    Yang, Rongwang
    INFRARED PHYSICS & TECHNOLOGY, 2024, 137
  • [38] A Review on Deep Learning Techniques Applied to Human Parsing
    Shao J.
    Huang X.
    Cao K.-T.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2019, 48 (05): : 644 - 654
  • [39] MCFNet: Multi-Attentional Class Feature Augmentation Network for Real-Time Scene Parsing
    Wang, Xizhong
    Liu, Rui
    Yang, Xin
    Zhang, Qiang
    Zhou, Dongsheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (06)
  • [40] Class-Guided Feature Decoupling Network for Airborne Image Segmentation
    Zhou, Feng
    Hang, Renlong
    Liu, Qingshan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (03): : 2245 - 2255