Set of Diverse Queries With Uncertainty Regularization for Composed Image Retrieval

被引:0
作者
Xu, Yahui [1 ,2 ]
Wei, Jiwei [1 ,2 ]
Bin, Yi [3 ]
Yang, Yang [4 ,5 ]
Ma, Zeyu [1 ,2 ]
Shen, Heng Tao [4 ,5 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] Natl Univ Singapore, Inst Data Sci, Singapore 119077, Singapore
[4] Univ Elect Sci & Technol China UESTC, Ctr Future Multimedia, Chengdu 611731, Peoples R China
[5] Univ Elect Sci & Technol China UESTC, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Uncertainty; Semantics; Image retrieval; Probabilistic logic; Task analysis; Fuses; Loss measurement; Composed image retrieval; multi-modal learning; image retrieval;
D O I
10.1109/TCSVT.2024.3401006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Composed image retrieval aims to search a target image by concurrently understanding the composed inputs with a reference image and the complementary modification text. It aims to find a shared latent space where the representation of the composed inputs is close to the desired target image. Most previous methods capture the one-to-one correspondence between the composed inputs and target image, which encodes the composed inputs and the target image into single points in the feature space. However, the one-to-one correspondence cannot effectively handle this task due to the inherent ambiguity problem arising from the various semantic meanings and data uncertainty. Specifically, the composed inputs and target image always exhibit various semantic meanings, affecting the retrieval results. Moreover, given the composed inputs (resp. target image), there are multiple target images (resp. composed inputs) that equally make sense. In this paper, we propose a novel method termed Set of Diverse Queries with Uncertainty Regularization (SDQUR) to solve such inherent ambiguity problem. First, we utilize diverse queries to adaptively aggregate the composed inputs and target image into multiple deterministic embeddings that capture different semantic meanings in the triplet affecting the retrieval process. It can exploit the deterministic many-to-many correspondence within each triple through these set-based queries. Moreover, we provide an uncertainty regularization module to encode the composed inputs and target image into gaussian distribution. Multiple potential positive candidates are sampled from the distribution for probabilistic many-to-many correspondence. Through the complementary deterministic and probabilistic many-to-many correspondence manner, we achieve consistent improvements on the standard FashionIQ, CIRR, and Shoes benchmarks, surpassing the state-of-the-art methods by a large margin.
引用
收藏
页码:10494 / 10506
页数:13
相关论文
共 50 条
  • [41] Image Retrieval Approach Based on Intuitive Fuzzy Set Combined with Genetic Algorithm
    王潇茵
    徐卫华
    胡昌振
    JournalofBeijingInstituteofTechnology, 2009, 18 (01) : 60 - 64
  • [42] Object-Based Image Retrieval System Based On Rough Set Theory
    Sharawy, Gaber A.
    Ghali, Neveen I.
    Ghoneim, Wafaa A.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (01): : 160 - 165
  • [43] Heterogeneous Feature Fusion and Cross-modal Alignment for Composed Image Retrieval
    Zhang, Gangjian
    Wei, Shikui
    Pang, Huaxin
    Zhao, Yao
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5353 - 5362
  • [44] SHAF: Semantic-Guided Hierarchical Alignment and Fusion for Composed Image Retrieval
    Yan, Cairong
    Yang, Erhe
    Tao, Ran
    Wan, Yongquan
    Ai, Derun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 447 - 459
  • [45] A Novel Image Retrieval Model Based on Semantic Information and Probability Rough Set Analysis Method
    Zhu, Yilin
    Jing, Naihuan
    Chen, Wenquan
    2017 2ND ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS (ACIRS), 2017, : 23 - 27
  • [46] Composed image retrieval: a survey on recent research and developmentComposed image retrieval: a survey on recent research and developmentY. Wan et al.
    Yongquan Wan
    Guobing Zou
    Bofeng Zhang
    Applied Intelligence, 2025, 55 (7)
  • [47] Retrieval of Chlorophyll-a concentration and associated product uncertainty in optically diverse lakes and reservoirs
    Liu, Xiaohan
    Steele, Christopher
    Simis, Stefan
    Warren, Mark
    Tyler, Andrew
    Spyrakos, Evangelos
    Selmes, Nick
    Hunter, Peter
    REMOTE SENSING OF ENVIRONMENT, 2021, 267
  • [48] Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
    Wen, Haokun
    Song, Xuemeng
    Chen, Xiaolin
    Wei, Yinwei
    Nie, Liqiang
    Chua, Tat-Seng
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 229 - 239
  • [49] Research on the Multiple Feature Fusion Image Retrieval Algorithm based on Texture Feature and Rough Set Theory
    Shi, Xiaojie
    Shao, Yijun
    PROCEEDINGS OF THE 2016 5TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND COMPUTER SCIENCE, 2016, 80 : 288 - 292
  • [50] Unsupervised query-adaptive implicit subtopic discovery for diverse image retrieval based on intrinsic cluster quality
    Lima Figueredo, Jose Solenir
    Calumby, Rodrigo Tripodi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 42991 - 43011