Learning Fashion Similarity Based on Hierarchical Attribute Embedding

被引：6

作者：

Yan, Cairong ^{[1
]}

Ding, Anan ^{[1
]}

Zhang, Yanting ^{[1
]}

Wang, Zijian ^{[1
]}

机构：

[1] Donghua Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China

来源：

2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA) | 2021年

关键词：

Fashion Retrieval; Similarity Learning; Attribute-aware Embedding; Attention Mechanism; FEATURES;

D O I：

10.1109/DSAA53316.2021.9564236

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Embedding items directly into a common feature space, and then measuring the similarity by calculating the feature distance in this space, has become the main method for similarity learning in current fashion retrieval tasks. The method is simple and efficient, but it ignores the correlation among fashion attributes and the impact of these correlations on the feature space, thereby reducing the accuracy of retrieval. Since the number of fashion attributes is large and the semantic granularity is also different, how to capture the relationship between fashion attributes and perform refined embedding to accurately represent fashion items is a challenge. In this paper, by constructing an attribute tree, we propose a hierarchical attribute embedding method for representing fashion items to enhance the relationship between attributes and use masking technology to disentangle different attributes. Based on these modules, we propose a hierarchical attribute-aware embedding network (HAEN) which takes images and attributes as input, learns multiple attribute-specific embedding spaces, and measures fine-grained similarity in the corresponding spaces. The extensive experimental result on two fashion-related public datasets FashionAI and DARN shows the superiority (+5.11% and +3.09% in MAP, respectively) of our proposed HAEN compared with state-of-the-art methods.

引用

页数：8

共 30 条

[1] Learning Attribute Representations with Localization for Flexible Fashion Search
Ak, Kenan E.
Kassim, Ashraf A.
Lim, Joo Hwee
Tham, Jo Yew
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7708 - 7717
[2] Efficient Multi-Attribute Similarity Learning Towards Attribute-based Fashion Search
Ak, Kenan E.
Lim, Joo Hwee
Tham, Jo Yew
Kassim, Ashraf A.
[J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1671 - 1679
[3] Label-Embedding for Image Classification
Akata, Zeynep
Perronnin, Florent
Harchaoui, Zaid
Schmid, Cordelia
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) : 1425 - 1438
[4] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[5] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6] Dual Encoding for Zero-Example Video Retrieval
Dong, Jianfeng
Li, Xirong
Xu, Chaoxi
Ji, Shouling
He, Yuan
Yang, Gang
Wang, Xun
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9338 - 9347
[7] Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild
Dong, Jianfeng
Li, Xirong
Xu, Duanqing
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) : 2371 - 2384
[8] Prototype-guided Attribute-wise Interpretable Scheme for Clothing Matching
Han, Xianjing
Song, Xuemeng
Yin, Jianhua
Wang, Yinglong
Nie, Liqiang
[J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 785 - 794
[9] Learning Fashion Compatibility with Bidirectional LSTMs
Han, Xintong
Wu, Zuxuan
Jiang, Yu-Gang
Davis, Larry S.
[J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1078 - 1086
[10] Automatic Spatially-aware Fashion Concept Discovery
Han, Xintong
Wu, Zuxuan
Huang, Phoenix X.
Zhang, Xiao
Zhu, Menglong
Li, Yuan
Zhao, Yang
Davis, Larry S.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1472 - 1480

← 1 2 3 →