DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images

被引：189

作者：

Ge, Yuying ^{[1
]}

Zhang, Ruimao ^{[1
]}

Wang, Xiaogang ^{[1
]}

Tang, Xiaoou ^{[1
]}

Luo, Ping ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00548

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Understanding fashion images has been advanced by benchmarks with rich annotations such as DeepFashion, whose labels include clothing categories, landmarks, and consumer-commercial image pairs. However, DeepFashion has nonnegligible issues such as single clothing-item per image, sparse landmarks (4 similar to 8 only), and no per-pixel masks, making it had significant gap from real-world scenarios. We fill in the gap by presenting DeepFashion2 to address these issues. It is a versatile benchmark of four tasks including clothes detection, pose estimation, segmentation, and retrieval. It has 801K clothing items where each item has rich annotations such as style, scale, view- point, occlusion, bounding box, dense landmarks (e.g. 39 for 'long sleeve outwear' and 15 for 'vest'), and masks. There are also 873K Commercial-Consumer clothes pairs. The annotations of DeepFashion2 are much larger than its counterparts such as 8x of FashionAI Global Challenge. A strong baseline is proposed, called Match R-CNN, which builds upon Mask R-CNN to solve the above four tasks in an end-to-end manner. Extensive evaluations are conducted with different criterions in Deep- Fashion2.

引用

页码：5332 / 5340

页数：9

共 20 条

[1]

[Anonymous], ACM MULTIMEDIA

[2]

[Anonymous], 2015, ICCV

[3]

[Anonymous], 2014, CVPR

[4]

[Anonymous], 2015, CVPR

[5]

[Anonymous], 2016, ECCV

[6]

[Anonymous], 2015, ICCV

[7]

[Anonymous], 2017, P IEEE C COMPUTER VI

[8]

[Anonymous], 2012, ECCV

[9]

[Anonymous], 2013, ICCV

[10]

[Anonymous], 2018, CVPR

← 1 2 →