Semantic granularity metric learning for visual search

被引:6
|
作者
Manandhar, Dipu [1 ,3 ]
Bastan, Muhammet [2 ,4 ]
Yap, Kim-Hui [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
[2] Amazon, Palo Alto, CA USA
[3] Univ Surrey, Guildford, Surrey, England
[4] Nanyang Technol Univ, Singapore, Singapore
关键词
Deep learnin; Metric learning; Metric loss functions; Semantic similarity; Visual search; IMAGE SIMILARITY; DEEP; REPRESENTATION;
D O I
10.1016/j.jvcir.2020.102871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing metric learning methods often do not consider different granularly in visual similarly. However, in many domains, images exhibit similarly at multiple granularities with visual semantic concepts, e.g. fashion demonstrates similarly ranging from clothing of the exact same instance to similar looks/design or common category. Therefore, training image triplets/pairs inherently possess different degree of information. Nevertheless, the existing methods often treat them with equal importance which hinder capturing underlying granularities in image similarly. In view of this, we propose a new semantic granularly metric learning (SGML) that develops a novel idea of detecting and leveraging attribute semantic space and integrating it into deep metric learning to capture multiple granularities of similarly. The proposed framework simultaneously learns image attributes and embeddings with multitask-CNN where the tasks are linked by semantic granularly similarly mapping to leverage correlations between the tasks. To this end, we propose a new soft-binomial deviance loss that effectively integrates informativeness of training samples into metric-learning on-the-fly during training. Compared to recent ensemble-based methods, SGML is conceptually elegant, computationally simple yet effective. Extensive experiments on benchmark datasets demonstrate its superiorly e.g., 1-4.5%-Recall@1 improvement over the state-of-the-arts (Kim a al., 2018; Cakir a al., 2019) on DeepFashion-Inshop
引用
收藏
页数:11
相关论文
共 50 条
  • [1] DYNAMICALLY MODULATED DEEP METRIC LEARNING FOR VISUAL SEARCH
    Manandhar, Dipu
    Bastan, Muhammet
    Yap, Kim-Hui
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2408 - 2412
  • [2] Impact of Semantic Granularity on Geographic Information Search Support
    Mauro, N.
    Ardissono, L.
    Di Rocco, L.
    Guerrini, G.
    Bertolotto, M.
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 323 - 328
  • [3] Semantic Segmentation based on Multiple Granularity Learning
    Wu, Kebin
    Bawazir, Ameera
    Xiao, Xiaofei
    Avula, Sai Bhargav
    Almazrouei, Ebtesam
    Roura, Eloy
    Debbah, Merouane
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9065 - 9070
  • [4] Semantic granularity for the semantic web
    Albertoni, Riccardo
    Camossi, Elena
    De Martino, Monica
    Giannini, Franca
    Monti, Marina
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2006: OTM 2006 WORKSHOPS, PT 2, PROCEEDINGS, 2006, 4278 : 1863 - +
  • [5] Multi-Granularity Semantic Collaborative Reasoning Network for Visual Dialog
    Zhang, Hongwei
    Wang, Xiaojie
    Jiang, Si
    Li, Xuefeng
    APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [6] PSVMA plus : Exploring Multi-Granularity Semantic-Visual Adaption for Generalized Zero-Shot Learning
    Liu, Man
    Bai, Huihui
    Li, Feng
    Zhang, Chunjie
    Wei, Yunchao
    Wang, Meng
    Chua, Tat-Seng
    Zhao, Yao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) : 51 - 66
  • [7] Sparse semantic metric learning for image retrieval
    Jing Liu
    Zechao Li
    Hanqing Lu
    Multimedia Systems, 2014, 20 : 635 - 643
  • [8] Sparse semantic metric learning for image retrieval
    Liu, Jing
    Li, Zechao
    Lu, Hanqing
    MULTIMEDIA SYSTEMS, 2014, 20 (06) : 635 - 643
  • [9] Semantic preserving distance metric learning and applications
    Yu, Jun
    Tao, Dapeng
    Li, Jonathan
    Cheng, Jun
    INFORMATION SCIENCES, 2014, 281 : 674 - 686
  • [10] Semantic Frame Induction with Deep Metric Learning
    Yamada, Kosuke
    Sasano, Ryohei
    Takeda, Koichi
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1833 - 1845