Leveraging Style and Content features for Text Conditioned Image Retrieval

被引：4

作者：

Chawla, Pranit ^{[1
]}

Jandial, Surgan ^{[2
]}

Badjatiya, Pinkesh ^{[3
]}

Chopra, Ayush ^{[4
]}

Sarkar, Mausoom ^{[3
]}

Krishnamurthy, Balaji ^{[3
]}

机构：

[1] IIT Kharagpur, Kharagpur, W Bengal, India

[2] IIT Hyderabad, Kandi, Telangana, India

[3] Adobe, Media & Data Sci Res Lab, San Jose, CA USA

[4] MIT, Cambridge, MA 02139 USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 | 2021年

关键词：

D O I：

10.1109/CVPRW53098.2021.00448

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image Search is a fundamental task playing a significant role in the success of wide variety of frameworks and applications. However, with the increasing sizes of product catalogues and the number of attributes per product, it has become difficult for users to express their needs effectively. Therefore, we focus on the problem of Image Retrieval with Text Feedback, which involves retrieving modified images according to the natural language feedback provided by users. In this work, we hypothesise that since an image can be delineated by its content and style features, modifications to the image can also take place in the two sub spaces respectively. Hence, we decompose an input image into its corresponding style and content features, apply modification of the text feedback individually in both the style and content spaces and finally fuse them for retrieval. Our experiments show that our approach outperforms a recent state of the art method in this task, TIRG, that seeks to use a single vector in contrast to leveraging the modification via text over style and content spaces separately.

引用

页码：3973 / 3977

页数：5

共 50 条

[1] CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback
Lee, Seungmin
Kim, Dongwan
Han, Bohyung
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 802 - 812
[2] Interactive Image Retrieval Using Text and Image Content
Dinakaran, B.
Annapurna, J.
Kumar, Ch. Aswani
CYBERNETICS AND INFORMATION TECHNOLOGIES, 2010, 10 (03) : 20 - 30
[3] Query expansion by text and image features in image retrieval
Zhou, H
Chan, SY
Kok, FL
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1998, 9 (04) : 287 - 299
[4] Text-Image Retrieval With Salient Features
Feng, Xia
Hu, Zhiyi
Liu, Caihua
Ip, W. H.
Chen, Huiying
JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 1 - 13
[5] Image-retrieval agent: integrating image content and text
Favela, J
Meza, V
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (05): : 36 - 39
[6] Combining Image and Text Features for Medicinal Plants Image Retrieval
Madam, Oki
Herdiyeni, Yeni
2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2013, : 273 - 277
[7] LEVERAGING IMPLICIT SPATIAL INFORMATION IN GLOBAL FEATURES FOR IMAGE RETRIEVAL
Jacob, Pierre
Picard, David
Histace, Aymeric
Klein, Edouard
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2002 - 2006
[8] SAC: Semantic Attention Composition for Text-Conditioned Image Retrieval
Jandial, Surgan
Badjatiya, Pinkesh
Chawla, Pranit
Chopra, Ayush
Sarkar, Mausoom
Krishnamurthy, Balaji
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 597 - 606
[9] Weighted Semantic Fusion of Text and Content for Image Retrieval
Goel, Nidhi
Sehgal, Priti
2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 681 - 687
[10] Image Features Optimizing for Content-Based Image Retrieval
Shi, Zhiping
Liu, Xi
He, Qing
Shi, Zhongzhi
2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 4, 2009, : 260 - 264

← 1 2 3 4 5 →