Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback

被引:89
作者
Wu, Hui [1 ,2 ]
Gao, Yupeng [2 ]
Guo, Xiaoxiao [1 ,2 ]
Al-Halah, Ziad [3 ]
Rennie, Steven [4 ]
Grauman, Kristen [3 ]
Feris, Rogerio [1 ,2 ]
机构
[1] MIT IBM Watson AI Lab, Cambridge, MA 02142 USA
[2] IBM Res, Armonk, NY 10504 USA
[3] UT Austin, Austin, TX USA
[4] Pryon, New York, NY USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.01115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conversational interfaces for the detail-oriented retail fashion domain are more natural, expressive, and user friendly than classical keyword-based search interfaces. In this paper, we introduce the Fashion IQ dataset to support and advance research on interactive fashion image retrieval. Fashion IQ is the first fashion dataset to provide human-generated captions that distinguish similar pairs of garment images together with side-information consisting of real-world product descriptions and derived visual attribute labels for these images. We provide a detailed analysis of the characteristics of the Fashion IQ data, and present a transformer-based user simulator and interactive image retriever that can seamlessly integrate visual attributes with image features, user feedback, and dialog history, leading to improved performance over the state of the art in dialogbased image retrieval. We believe that our dataset will encourage further work on developing more natural and realworld applicable conversational shopping assistants.(1)
引用
收藏
页码:11302 / 11312
页数:11
相关论文
共 75 条
  • [1] Al-Halah Z., 2017, ICCV
  • [2] Al-Halah Ziad, 2020, CVPR
  • [3] [Anonymous], 2012, CVPR
  • [4] [Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.126
  • [5] [Anonymous], 2013, ICCV
  • [6] [Anonymous], 2015, CVPR
  • [7] [Anonymous], 2016, CVPR, DOI DOI 10.1109/CVPR.2016.39
  • [8] [Anonymous], 2019, NEURIPS
  • [9] [Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.551
  • [10] [Anonymous], 2014, CVPR