Diverse image search with explanations

被引:0
|
作者
Xinying Zhu
Linhu Liu
机构
[1] Nanchang Institute of Technology,College of Business Administration
[2] Lenovo Research,undefined
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Image retrieval; Sentence generation; Relationship detection; Graph matching;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a novel content based image search framework with explanations, which can not only compare the similarity among images from different perspectives, but also describe the commonalities of two images with language. Specifically, we develop a graph matching method to calculate the similarity of two images and locate their commonalities, where each graph includes perceptual information, conceptual information and relational information. Furthermore, we utilize a language model based method to generate sentences to describe the similarities of two images. Comparing with different perspectives, we follow the principle that rich structured representations are more important than simple ones. To evaluate this principle, we conduct the experiment on the Visual Genome dataset, where each image contains lots of objects and multiple object relationships. The experimental results demonstrate the effectiveness of the principle. We also evaluate our method in the explanation of similar images, and the experimental results demonstrate that our method can obtain comparable performance.
引用
收藏
页码:23067 / 23082
页数:15
相关论文
共 50 条
  • [31] MQSearch: Image Search by Multi-Class Query
    Luo, Yiwen
    Liu, Wei
    Liu, Jianzhuang
    Tang, Xiaoou
    CHI 2008: 26TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2008, : 49 - 52
  • [32] ADVERSARIAL LEARNING FOR FINE-GRAINED IMAGE SEARCH
    Lin, Kevin
    Yang, Fan
    Wang, Qiaosong
    Piramuthu, Robinson
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 490 - 495
  • [33] Computing Tag-Diversity for Social Image Search
    Kim, Eunggyo
    Yamamoto, Takehiro
    Tanaka, Katsumi
    EMERGENCE OF DIGITAL LIBRARIES - RESEARCH AND PRACTICES, 2014, 8839 : 328 - 335
  • [34] Computing tag-diversity for social image search
    Kim, Eunggyo
    Yamamoto, Takehiro
    Tanaka, Katsumi
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8839 : 328 - 335
  • [35] COMPACT TENSOR BASED IMAGE REPRESENTATION FOR SIMILARITY SEARCH
    Negrel, Romain
    Picard, David
    Gosselin, Philippe-Henri
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 2425 - 2428
  • [36] Improving image tags by exploiting web search results
    Zhang, Xiaoming
    Li, Zhoujun
    Chao, Wenhan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 62 (03) : 601 - 631
  • [37] DeepSearch: A Fast Image Search Framework for Mobile Devices
    Wang, Peisong
    Hu, Qinghao
    Fang, Zhiwei
    Zhao, Chaoyang
    Cheng, Jian
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (01)
  • [38] Patterns of semantic relations to improve image content search
    Hollink, Laura
    Schreiber, Guus
    Wielinga, Bob
    JOURNAL OF WEB SEMANTICS, 2007, 5 (03): : 195 - 203
  • [39] Mobile Visual Search Using Image and Text Features
    Tsai, Sam S.
    Chen, Huizhong
    Chen, David
    Vedantham, Ramakrishna
    Grzeszczuk, Radek
    Girod, Bernd
    2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 845 - 849
  • [40] Fast Democratic Aggregation and Query Fusion for Image Search
    Gao, Zhanning
    Xue, Jianru
    Zhou, Wengang
    Pang, Shanmin
    Tian, Qi
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 35 - 42