Clothing Parsing using Extended U-Net

被引：4

作者：

Vozarikova, Gabriela ^{[1
]}

Stana, Richard ^{[1
]}

Semanisin, Gabriel ^{[1
]}

机构：

[1] Pavol Jozef Safarik Univ Kosice, Inst Comp Sci, Jesenna 5, Kosice, Slovakia

来源：

VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP | 2021年

关键词：

U-Net; Clothing Parsing; Segmentation; Computer Vision; Multitask Learning; Deep Learning; Fully-convolutional Network;

D O I：

10.5220/0010177700150024

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper focuses on the task of clothing parsing, which is a special case of the more general object segmentation task well known in the field of computer vision. Each pixel is to be assigned to one of the clothing categories or background. Due to complexity of the problem and lack of data (until recently) performance of the modern state-of-the-art clothing parsing models expressed in terms of mean Intersection over Union metric (IoU) does not exceed 55%. In this paper, we propose a novel multitask network by extending fully-convolutional neural network U-Net with two side branches - one solves a multilabel classification task and the other predicts bounding boxes of clothing instances. We trained this network using a large-scaled iMaterialist dataset (Visipedia, 2019), which we refined. Compared to well performing segmentation architectures FPN, DeepLabV3, DeepLabV3+ and plain U-Net, our model achieves the best experimental results.

引用

页码：15 / 24

页数：10

共 20 条

[1]

Aoki R, 2019, IEEE ICCE, P289, DOI [10.1109/icce-berlin47944.2019.8966159, 10.1109/ICCE-Berlin47944.2019.8966159]

[2]

Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)

[3] DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images [J].

Ge, Yuying ;

Zhang, Ruimao ;

Wang, Xiaogang ;

Tang, Xiaoou ;

Luo, Ping .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5332-5340

[4] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[5] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[6]

Howard J., 2018, fastai

[7]

Khurana T, 2018, IEEE IMAGE PROC, P2102, DOI 10.1109/ICIP.2018.8451281

[8] Fashion Parsing With Weak Color-Category Labels [J].

Liu, Si ;

Feng, Jiashi ;

Domokos, Csaba ;

Xu, Hui ;

Huang, Junshi ;

Hu, Zhenzhen ;

Yan, Shuicheng .

IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (01) :253-265

[9]

Loshchilov Ilya, 2018, Fixing weight decay regularization in adam

[10] Semantic Segmentation of Fashion Images Using Feature Pyramid Networks [J].

Martinsson, John ;

Mogren, Olof .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :3133-3136

← 1 2 →