FCGNet: Foreground and Class Guided Network for human parsing

被引:0
|
作者
Jang, Jaehyuk [1 ]
Wang, Yooseung [1 ]
Kim, Changick [1 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Sch Elect Engn, Daejeon 34141, South Korea
关键词
Human parsing; Semantic segmentation; Graph convolutional network;
D O I
10.1016/j.patcog.2024.110879
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding the inherent hierarchical human structure is key to human parsing. To capture the human- specific characteristic, it is necessary to focus on the spatial and class information corresponding to the foreground (i.e., human) in an image. Inspired by these insights, we introduce two supervision signals, spatial foreground information and existent class information in the image. By utilizing foreground information as guidance, the network is guided to generate a human-focused feature map and capture the pixel-wise hierarchical characteristics by computing correlations between pixels. Furthermore, we guide the network to consider class information in the image at the feature level and capture the class-wise relationship by calculating correlations between channels. Moreover, during the training phase, we prevent the network from misclassifying pixels into confusing classes by providing the existent class information in the image to the network at the prediction level. Our model achieves state-of-the-art performance with significantly reduced parameters and Multiply-Accumulate Operations (MACs) in three public benchmarks.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] BiEPNet: Bilateral Edge-perceiving Network for High-Resolution Human Parsing
    Gong, Qiqi
    Wei, Yunchao
    Zhao, Yao
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, ICDSP 2024, 2024, : 197 - 204
  • [42] PGNet: Panoptic parsing guided deep stereo matching
    Chen, Shuya
    Xiang, Zhiyu
    Qiao, Chengyu
    Chen, Yiman
    Bai, Tingming
    NEUROCOMPUTING, 2021, 463 : 609 - 622
  • [43] Parsing Based on Parselets: A Unified Deformable Mixture Model for Human Parsing
    Dong, Jian
    Chen, Qiang
    Huang, Zhongyang
    Yang, Jianchao
    Yan, Shuicheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) : 88 - 101
  • [44] Learning rebalanced human parsing model from imbalanced datasets
    Huang, Enbo
    Su, Zhuo
    Zhou, Fan
    Wang, Ruomei
    IMAGE AND VISION COMPUTING, 2020, 99
  • [45] Human parsing by weak structural label
    Chen, Zhiyong
    Liu, Si
    Zhai, Yanlong
    Lin, Jia
    Cao, Xiaochun
    Yang, Liang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (15) : 19795 - 19809
  • [46] Phase Contour Enhancement Network for Clothing Parsing
    Yu, Feng
    Zhang, Ying
    Li, Huiyin
    Du, Chenghu
    Liu, Li
    Jiang, Minghua
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 2784 - 2793
  • [47] Self-Correction for Human Parsing
    Li, Peike
    Xu, Yunqiu
    Wei, Yunchao
    Yang, Yi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 3260 - 3271
  • [48] Human parsing by weak structural label
    Zhiyong Chen
    Si Liu
    Yanlong Zhai
    Jia Lin
    Xiaochun Cao
    Liang Yang
    Multimedia Tools and Applications, 2018, 77 : 19795 - 19809
  • [49] EHANet: An Effective Hierarchical Aggregation Network for Face Parsing
    Luo, Ling
    Xue, Dingyu
    Feng, Xinglong
    APPLIED SCIENCES-BASEL, 2020, 10 (09):
  • [50] Efficient Light Deep Network for Street Scene Parsing
    Wang, ZheHui
    Zhao, Sanyuan
    Shen, Jianbing
    Lei, Zhengchao
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 42 - 45