EGA-Net: Edge Guided Attention Network With Label Refinement for Parsing of Animal Body Parts

被引:0
|
作者
Raghavendra, S. [1 ]
Abhilash, S. K. [2 ]
Nookala, Venu Madhav [2 ]
Girisha, S. [3 ]
Adesh, N. D. [1 ]
机构
[1] Manipal Inst Technol, Manipal Acad Higher Educ, Dept Informat & Commun Technol, Manipal 576104, India
[2] KPIT Technol, Bengaluru 560103, India
[3] Manipal Inst Technol, Manipal Acad Higher Educ, Dept Data Sci & Comp Applicat, Manipal 576104, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Accuracy; Animals; Semantics; Feature extraction; Quadrupedal robots; Semantic segmentation; Image edge detection; Training; Decoding; Edge-guided; part segmentation; quadruped animals; semantic segmentation; transformer;
D O I
10.1109/ACCESS.2024.3471948
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In computer vision, semantic segmentation precisely delineates objects at the pixel level. This fundamental idea is constantly evolving by adding new modules and adjustments to suit the unique characteristics of different object classes. Pixel-level semantic segmentation is an intricate and computationally intensive task, especially within the context of part-based approaches. The study proposes a transformer-based attention network that is edge-guided and developed for the precise partitioning of different parts of quadruped animals. The process of labeling masks at the pixel level is a challenging task for various object categories, owing to its inherent complexity, which often results in inaccurate annotations. An additional mechanism is used to enhance pixel-level accuracy between classes, which iteratively refines labels. The model is evaluated using the PascalPart and PartImageNet datasets, using various scales of transformer architectures. Performance is evaluated using metrics such as mean Intersection-over-Union (mIoU), Pixel Accuracy (PA), and mean Accuracy (mA). Ablation studies are conducted to evaluate the model's performance based on network parameters, while the effectiveness of each component is assessed using Class Activation Maps (CAM). The results show a notable 8% improvement in mIoU scores over existing state-of-the-art architectures, indicating the effectiveness of the proposed model in achieving fine-grained part segmentation, particularly in the context of quadruped animals.
引用
收藏
页码:149162 / 149172
页数:11
相关论文
共 1 条
  • [1] Attention-Guided Label Refinement Network for Semantic Segmentation of Very High Resolution Aerial Orthoimages
    Huang, Jianfeng
    Zhang, Xinchang
    Sun, Ying
    Xin, Qinchuan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 4490 - 4503