Learning to Parameterize Visual Attributes for Open-set Fine-grained Retrieval

被引:0
|
作者
Wang, Shijie [1 ]
Chang, Jianlong [2 ]
Li, Haojie [3 ]
Wang, Zhihui [1 ]
Ouyang, Wanli [4 ]
Tian, Qi [2 ]
机构
[1] Dalian Univ Technol, Int Sch Informat Sci & Engn, Dalian, Peoples R China
[2] Huawei Cloud & AI, Shenzhen, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Comp & Engn, Qingdao, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Open-set fine-grained retrieval is an emerging challenging task that allows to retrieve unknown categories beyond the training set. The best solution for handling unknown categories is to represent them using a set of visual attributes learnt from known categories, as widely used in zero-shot learning. Though important, attribute modeling usually requires significant manual annotations and thus is labor-intensive. Therefore, it is worth to investigate how to transform retrieval models trained by image-level supervision from category semantic extraction to attribute modeling. To this end, we propose a novel Visual Attribute Parameterization Network (VAPNet) to learn visual attributes from known categories and parameterize them into the retrieval model, without the involvement of any attribute annotations. In this way, VAPNet could utilize its parameters to parse a set of visual attributes from unknown categories and precisely represent them. Technically, VAPNet explicitly attains some semantics with rich details via making use of local image patches and distills the visual attributes from these discovered semantics. Additionally, it integrates the online refinement of these visual attributes into the training process to iteratively enhance their quality. Simultaneously, VAPNet treats these attributes as supervisory signals to tune the retrieval models, thereby achieving attribute parameterization. Extensive experiments on open-set fine-grained retrieval datasets validate the superior performance of our VAPNet over existing solutions.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Open-set Fine-grained Retrieval via Prompting Vision-Language Evaluator
    Wang, Shijie
    Chang, Jianlong
    Li, Haojie
    Wang, Zhihui
    Ouyang, Wanli
    Tian, Qi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19381 - 19391
  • [2] Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning
    Kim, Sungnyun
    Bae, Sangmin
    Yun, Young
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7537 - 7547
  • [3] Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition
    Sun, Jiayin
    Wang, Hong
    Dong, Qiulei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3891 - 3904
  • [4] Fine-Grained Open-Set Deepfake Detection via Unsupervised Domain Adaptation
    Zhou, Xinye
    Han, Hu
    Shan, Shiguang
    Chen, Xilin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 7536 - 7547
  • [5] Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples
    Liu, Huafeng
    Zhang, Chuanyi
    Yao, Yazhou
    Wei, Xiu-Shen
    Shen, Fumin
    Tang, Zhenmin
    Zhang, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 546 - 557
  • [6] Knowledge-Distillation-Based Label Smoothing for Fine-Grained Open-Set Vehicle Recognition
    Wolf, Stefan
    Loran, Dennis
    Beyerer, Juergen
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 330 - 340
  • [7] Towards Fine-grained Open Zero-shot Learning: Inferring Unseen Visual Features from Attributes
    Long, Yang
    Liu, Li
    Shao, Ling
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 944 - 952
  • [8] Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval
    Lu, Xin
    Chen, Shikun
    Cao, Yichao
    Zhou, Xin
    Lu, Xiaobo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6558 - 6566
  • [9] HCRP-OSD: Fine-Grained Open-Set Intrusion Detection Based on Hybrid Convolution and Adversarial Reciprocal Points Learning
    Cai, Nengfu
    Jiang, Yi
    Zhang, Haitao
    Yang, Zhiwen
    Zhong, Laicheng
    Chen, Qi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (4-5):
  • [10] WDAN: A Weighted Discriminative Adversarial Network With Dual Classifiers for Fine-Grained Open-Set Domain Adaptation
    Li, Jing
    Yang, Liu
    Wang, Qilong
    Hu, Qinghua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5133 - 5147