Learning to Parameterize Visual Attributes for Open-set Fine-grained Retrieval

被引:0
|
作者
Wang, Shijie [1 ]
Chang, Jianlong [2 ]
Li, Haojie [3 ]
Wang, Zhihui [1 ]
Ouyang, Wanli [4 ]
Tian, Qi [2 ]
机构
[1] Dalian Univ Technol, Int Sch Informat Sci & Engn, Dalian, Peoples R China
[2] Huawei Cloud & AI, Shenzhen, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Comp & Engn, Qingdao, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Open-set fine-grained retrieval is an emerging challenging task that allows to retrieve unknown categories beyond the training set. The best solution for handling unknown categories is to represent them using a set of visual attributes learnt from known categories, as widely used in zero-shot learning. Though important, attribute modeling usually requires significant manual annotations and thus is labor-intensive. Therefore, it is worth to investigate how to transform retrieval models trained by image-level supervision from category semantic extraction to attribute modeling. To this end, we propose a novel Visual Attribute Parameterization Network (VAPNet) to learn visual attributes from known categories and parameterize them into the retrieval model, without the involvement of any attribute annotations. In this way, VAPNet could utilize its parameters to parse a set of visual attributes from unknown categories and precisely represent them. Technically, VAPNet explicitly attains some semantics with rich details via making use of local image patches and distills the visual attributes from these discovered semantics. Additionally, it integrates the online refinement of these visual attributes into the training process to iteratively enhance their quality. Simultaneously, VAPNet treats these attributes as supervisory signals to tune the retrieval models, thereby achieving attribute parameterization. Extensive experiments on open-set fine-grained retrieval datasets validate the superior performance of our VAPNet over existing solutions.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] DeepFirearm: Learning Discriminative Feature Representation for Fine-grained Firearm Retrieval
    Hao, Jiedong
    Dong, Jing
    Wang, Wei
    Tan, Tieniu
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3335 - 3340
  • [42] Searching and Learning Discriminative Regions for Fine-Grained Image Retrieval and Classification
    Sun, Kangbo
    Zhu, Jie
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (01) : 141 - 149
  • [43] Learning relation-based features for fine-grained image retrieval
    Zhu, Yingying
    Cao, Gang
    Yang, Zhanyuan
    Lu, Xiufan
    PATTERN RECOGNITION, 2023, 140
  • [44] Fine-grained Foreground Retrieval via Teacher-Student Learning
    Wu, Zongze
    Lischinski, Dani
    Shechtman, Eli
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3645 - 3653
  • [45] A review of fine-grained sketch image retrieval based on deep learning
    Luo, Qing
    Gao, Xiang
    Jiang, Bo
    Yan, Xueting
    Liu, Wanyuan
    Ge, Junchao
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (12) : 21186 - 21210
  • [46] Progressive Learning Vision Transformer for Open Set Recognition of Fine-Grained Objects in Remote Sensing Images
    Fu, Yimin
    Liu, Zhunga
    Zhang, Zuowei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [47] Fine-grained visual understanding and reasoning
    Yu, Jun
    Yang, Yezhou
    Murtagh, Fionn
    Gao, Xinbo
    NEUROCOMPUTING, 2020, 398 (398) : 408 - 410
  • [48] Fine-Grained Visual Text Prompting
    Yang, Lingfeng
    Li, Xiang
    Wang, Yueze
    Wang, Xinlong
    Yang, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1594 - 1609
  • [49] Fine-grained Visual Faceted Search
    Champ, Julien
    Joly, Alexis
    Bonnet, Pierre
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 721 - 722
  • [50] Contrastive Learning of Semantic Concepts for Open-set Cross-domain Retrieval
    Agarwal, Aishwarya
    Karanam, Srikrishna
    Srinivasan, Balaji Vasan
    Banerjee, Biplab
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4104 - 4113