You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval

被引:2
作者
Koley, Subhadeep [1 ,2 ]
Bhunia, Ayan Kumar [1 ]
Sahli, Aneeshan [1 ]
Chowdhury, Pinaki Nath [1 ]
Xiang, Tao [1 ,2 ]
Song, Yi-Zhe [1 ,2 ]
机构
[1] Univ Surrey, CVSSP, SketchX, Guildford, Surrey, England
[2] iFlyTek Surrey Joint Res Ctr Artificial Intellige, Guildford, Surrey, England
来源
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2024年
关键词
D O I
10.1109/CVPR52733.2024.01562
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two primary input modalities prevail in image retrieval: sketch and text. While text is widely used for inter-category retrieval tasks, sketches have been established as the sole preferred modality for fine-grained image retrieval due to their ability to capture intricate visual details. In this paper, we question the reliance on sketches alone for fine-grained image retrieval by simultaneously exploring the fine-grained representation capabilities of both sketch and text, orchestrating a duet between the two. The end result enables precise retrievals previously unattainable, allowing users to pose ever-finer queries and incorporate attributes like colour and contextual cues from text. For this purpose, we introduce a novel compositionality framework, effectively combining sketches and text using pre-trained CLIP models, while eliminating the need for extensive fine-grained textual descriptions. Last but not least, our system extends to novel applications in composed image retrieval, domain attribute transfer, and fine-grained generation, providing solutions for various real-world scenarios.
引用
收藏
页码:16509 / 16519
页数:11
相关论文
共 74 条
  • [1] [Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00299
  • [2] [Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00840
  • [3] Baldrati Alberto, 2022, CVPR
  • [4] Baldrati Alberto, 2023, ICCV
  • [5] Bhunia A. K., 2022, CVPR, P2293
  • [6] Bhunia AK, 2020, PROC CVPR IEEE, P9776, DOI 10.1109/CVPR42600.2020.00980
  • [7] Bhunia Ayan Kumar, 2022, ECCV
  • [8] Brown TB, 2020, ADV NEUR IN, V33
  • [9] Bulat Adrian, 2023, CVPR
  • [10] Castrejon Lluis, 2016, CVPR