Learning to Parameterize Visual Attributes for Open-set Fine-grained Retrieval

被引:0
|
作者
Wang, Shijie [1 ]
Chang, Jianlong [2 ]
Li, Haojie [3 ]
Wang, Zhihui [1 ]
Ouyang, Wanli [4 ]
Tian, Qi [2 ]
机构
[1] Dalian Univ Technol, Int Sch Informat Sci & Engn, Dalian, Peoples R China
[2] Huawei Cloud & AI, Shenzhen, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Comp & Engn, Qingdao, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Open-set fine-grained retrieval is an emerging challenging task that allows to retrieve unknown categories beyond the training set. The best solution for handling unknown categories is to represent them using a set of visual attributes learnt from known categories, as widely used in zero-shot learning. Though important, attribute modeling usually requires significant manual annotations and thus is labor-intensive. Therefore, it is worth to investigate how to transform retrieval models trained by image-level supervision from category semantic extraction to attribute modeling. To this end, we propose a novel Visual Attribute Parameterization Network (VAPNet) to learn visual attributes from known categories and parameterize them into the retrieval model, without the involvement of any attribute annotations. In this way, VAPNet could utilize its parameters to parse a set of visual attributes from unknown categories and precisely represent them. Technically, VAPNet explicitly attains some semantics with rich details via making use of local image patches and distills the visual attributes from these discovered semantics. Additionally, it integrates the online refinement of these visual attributes into the training process to iteratively enhance their quality. Simultaneously, VAPNet treats these attributes as supervisory signals to tune the retrieval models, thereby achieving attribute parameterization. Extensive experiments on open-set fine-grained retrieval datasets validate the superior performance of our VAPNet over existing solutions.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Universal Fine-Grained Visual Categorization by Concept Guided Learning
    Bi, Qi
    Zhou, Beichen
    Ji, Wei
    Xia, Gui-Song
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 394 - 409
  • [32] Cross-X Learning for Fine-Grained Visual Categorization
    Luo, Wei
    Yang, Xitong
    Mo, Xianjie
    Lu, Yuheng
    Davis, Larry S.
    Li, Jun
    Yang, Jian
    Lim, Ser-Nam
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8241 - 8250
  • [33] A survey of fine-grained visual categorization based on deep learning
    XIE Yuxiang
    GONG Quanzhi
    LUAN Xidao
    YAN Jie
    ZHANG Jiahui
    Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1337 - 1356
  • [34] A survey of fine-grained visual categorization based on deep learning
    Xie Yuxiang
    Gong Quanzhi
    Luan Xidao
    Yan Jie
    Zhang Jiahui
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023,
  • [35] Plenty is Plague: Fine-Grained Learning for Visual Question Answering
    Zhou, Yiyi
    Ji, Rongrong
    Sun, Xiaoshuai
    Su, Jinsong
    Meng, Deyu
    Gao, Yue
    Shen, Chunhua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 697 - 709
  • [36] Discovering Localized Attributes for Fine-grained Recognition
    Duan, Kun
    Parikh, Devi
    Crandall, David
    Grauman, Kristen
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3474 - 3481
  • [37] Authentic attributes with fine-grained anonymity protection
    Stubblebine, SG
    Syverson, PF
    FINANCIAL CRYPTOGRAPHY, PROCEEDINGS, 2001, 1962 : 276 - 294
  • [38] Learning Feature Embedding with Strong Neural Activations for Fine-Grained Retrieval
    Shen, Chen
    Zhou, Chang
    Jin, Zhongming
    Chu, Wenqing
    Jiang, Rongxin
    Chen, Yaowu
    Hua, Xian-Sheng
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 424 - 432
  • [39] LoopNet for fine-grained fashion attributes editing
    Zou, Xingxing
    Zhu, Shumin
    Wong, Wai Keung
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [40] Understanding Objects in Detail with Fine-grained Attributes
    Vedaldi, Andrea
    Mahendran, Siddharth
    Tsogkas, Stavros
    Maji, Subhransu
    Girshick, Ross
    Kannala, Juho
    Rahtu, Esa
    Kokkinos, Iasonas
    Blaschko, Matthew B.
    Weiss, David
    Taskar, Ben
    Simonyan, Karen
    Saphra, Naomi
    Mohamed, Sammy
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3622 - 3629