Semantic Segmentation of Fashion Images Using Feature Pyramid Networks

被引：15

作者：

Martinsson, John ^{[1
]}

Mogren, Olof ^{[1
]}

机构：

[1] RISE Res Inst Sweden, Gothenburg, Sweden

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年

关键词：

D O I：

10.1109/ICCVW.2019.00382

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we approach the problem of semantically segmenting fashion images into different categories of clothing. This problem poses particular challenges because of the importance of both textural information and cues from shapes and context. To this end, we propose a fully convolutional neural network based on feature pyramid networks (FPN), together with a backbone consisting of the ResNeXt architecture. Our experimental evaluation shows that the proposed model achieves state-of-the-art results on two standard fashion benchmark datasets, and a qualitative study verifies its effectiveness when applied to typical fashion images. The approach has a modest memory footprint and can be used without a conditional random field (CRF) without much degradation of quality which makes our model preferable from a computational perspective. When comparing all methods without a CRF, our approach outperforms all state-of-the-art models on both datasets by a clear margin in all evaluated metrics. In fact, our approach achieves a higher accuracy without the CRF than the state-of-the-art models using CRFs.

引用

页码：3133 / 3136

页数：4

共 13 条

[1]

[Anonymous], 2017, FASHION FORWARD FORE

[2]

[Anonymous], 2017, LOOKING OUTFIT PARSE

[3] Densely Connected Convolutional Networks [J].

Huang, Gao ;

Liu, Zhuang ;

van der Maaten, Laurens ;

Weinberger, Kilian Q. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269

[4]

Ji W., 2018, P 27 INT JOINT C ART

[5]

Khurana T., 2018, 2018 25 IEEE INT C I

[6]

Kingma DP, 2014, ARXIV

[7]

Krahenbuhl P., 2011, Adv. Neural Inf. Process. Syst., V24

[8]

Lin T.-Y., 2016, FEATURE PYRAMID NETW

[9] Fashion Parsing With Weak Color-Category Labels [J].

Liu, Si ;

Feng, Jiashi ;

Domokos, Csaba ;

Xu, Hui ;

Huang, Junshi ;

Hu, Zhenzhen ;

Yan, Shuicheng .

IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (01) :253-265

[10] Decoder Network over Lightweight Reconstructed Feature for Fast Semantic Style Transfer [J].

Lu, Ming ;

Zhao, Hao ;

Yao, Anbang ;

Xu, Feng ;

Chen, Yurong ;

Zhang, Li .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2488-2496

← 1 2 →