Learning Fashion Similarity Based on Hierarchical Attribute Embedding

被引:6
作者
Yan, Cairong [1 ]
Ding, Anan [1 ]
Zhang, Yanting [1 ]
Wang, Zijian [1 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
来源
2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA) | 2021年
关键词
Fashion Retrieval; Similarity Learning; Attribute-aware Embedding; Attention Mechanism; FEATURES;
D O I
10.1109/DSAA53316.2021.9564236
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Embedding items directly into a common feature space, and then measuring the similarity by calculating the feature distance in this space, has become the main method for similarity learning in current fashion retrieval tasks. The method is simple and efficient, but it ignores the correlation among fashion attributes and the impact of these correlations on the feature space, thereby reducing the accuracy of retrieval. Since the number of fashion attributes is large and the semantic granularity is also different, how to capture the relationship between fashion attributes and perform refined embedding to accurately represent fashion items is a challenge. In this paper, by constructing an attribute tree, we propose a hierarchical attribute embedding method for representing fashion items to enhance the relationship between attributes and use masking technology to disentangle different attributes. Based on these modules, we propose a hierarchical attribute-aware embedding network (HAEN) which takes images and attributes as input, learns multiple attribute-specific embedding spaces, and measures fine-grained similarity in the corresponding spaces. The extensive experimental result on two fashion-related public datasets FashionAI and DARN shows the superiority (+5.11% and +3.09% in MAP, respectively) of our proposed HAEN compared with state-of-the-art methods.
引用
收藏
页数:8
相关论文
共 30 条
  • [1] Learning Attribute Representations with Localization for Flexible Fashion Search
    Ak, Kenan E.
    Kassim, Ashraf A.
    Lim, Joo Hwee
    Tham, Jo Yew
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7708 - 7717
  • [2] Efficient Multi-Attribute Similarity Learning Towards Attribute-based Fashion Search
    Ak, Kenan E.
    Lim, Joo Hwee
    Tham, Jo Yew
    Kassim, Ashraf A.
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1671 - 1679
  • [3] Label-Embedding for Image Classification
    Akata, Zeynep
    Perronnin, Florent
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) : 1425 - 1438
  • [4] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [5] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [6] Dual Encoding for Zero-Example Video Retrieval
    Dong, Jianfeng
    Li, Xirong
    Xu, Chaoxi
    Ji, Shouling
    He, Yuan
    Yang, Gang
    Wang, Xun
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9338 - 9347
  • [7] Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild
    Dong, Jianfeng
    Li, Xirong
    Xu, Duanqing
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) : 2371 - 2384
  • [8] Prototype-guided Attribute-wise Interpretable Scheme for Clothing Matching
    Han, Xianjing
    Song, Xuemeng
    Yin, Jianhua
    Wang, Yinglong
    Nie, Liqiang
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 785 - 794
  • [9] Learning Fashion Compatibility with Bidirectional LSTMs
    Han, Xintong
    Wu, Zuxuan
    Jiang, Yu-Gang
    Davis, Larry S.
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1078 - 1086
  • [10] Automatic Spatially-aware Fashion Concept Discovery
    Han, Xintong
    Wu, Zuxuan
    Huang, Phoenix X.
    Zhang, Xiao
    Zhu, Menglong
    Li, Yuan
    Zhao, Yang
    Davis, Larry S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1472 - 1480