Multi-task Learning Based Skin Segmentation

被引:0
作者
Tan, Taizhe [1 ,2 ]
Shan, Zhenghao [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Heyuan Bay Area Digital Econ Technol Innovat Ctr, Heyuan 517001, Peoples R China
来源
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2023 | 2023年 / 14119卷
关键词
Skin segmentation; query-based; multi-task learning; encoder-decoder; deep learning;
D O I
10.1007/978-3-031-40289-0_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skin segmentation is a critical task in computer vision that has diverse applications in several fields such as biometrics, medical imaging, and video surveillance. Despite its importance, the acquisition of high-quality data remains a significant challenge in skin segmentation research. In this paper, we propose a novel skin segmentation algorithm for single-person images by utilizing a dual-task neural network built on the multi-task learning framework. Specifically, the algorithm employs an encoder-decoder architecture consisting of a shared backbone, two dynamic encoders, and a decoder. The dynamic encoders use dynamic convolution to extract more spatial location information, while the decoder utilizes a query-based dual-task approach that allows each task to utilize the information generated by the other one efficiently. The experimental results indicate that the proposed skin segmentation algorithm outperforms or matches the current state-of-the-art techniques on the benchmark test set.
引用
收藏
页码:360 / 369
页数:10
相关论文
共 23 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[3]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[4]   Fast Convergence of DETR with Spatially Modulated Co-Attention [J].
Gao, Peng ;
Zheng, Minghang ;
Wang, Xiaogang ;
Dai, Jifeng ;
Li, Hongsheng .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3601-3610
[5]   DSNet: Automatic dermoscopic skin lesion segmentation [J].
Hasan, Md Kamrul ;
Dahal, Lavsen ;
Samarakoon, Prasad N. ;
Tushar, Fakrul Islam ;
Marti, Robert .
COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 120
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]   Semi-supervised Skin Detection by Network with Mutual Guidance [J].
He, Yi ;
Shi, Jiayuan ;
Wang, Chuan ;
Huang, Haibin ;
Liu, Jiaming ;
Li, Guanbin ;
Liu, Risheng ;
Wang, Jue .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2111-2120
[8]   A statistic approach to the detection of human faces in color nature scene [J].
Hsieh, IS ;
Fan, KC ;
Lin, CH .
PATTERN RECOGNITION, 2002, 35 (07) :1583-1596
[9]   Statistical color models with application to skin detection [J].
Jones, MJ ;
Rehg, JM .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 46 (01) :81-96
[10]  
Kovac J, 2003, IEEE REGION 8 EUROCON 2003, VOL B, PROCEEDINGS, P144