Zero-Shot Transfer Learning Based on Visual and Textual Resemblance

被引:2
|
作者
Yang, Gang [1 ]
Xu, Jieping [1 ]
机构
[1] Renmin Univ China, Key Lab Data Engn & Knowledge Engn, Beijing, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Transfer learning; Zero-shot learning; Deep learning;
D O I
10.1007/978-3-030-36718-3_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing image search engines, whose ranking functions are built based on labeled images or wrap texts, have poor results on queries in new, or low-frequency keywords. In this paper, we put forward the zero-shot transfer learning (ZSTL), which aims to transfer networks from given classifiers to new zero-shot classifiers with little cost, and helps image searching perform better on new or low-frequency words. Content-based queries (i.e., ranking images was not only based on their visual looks but also depended on their contents) can also be enhanced by ZSTL. ZSTL was proposed after we found the resemblance between photographic composition and the description of objects in natural language. Both composition and description highlight the object by stressing the particularity, so we consider that there exists a resemblance between visual and textual space. We provide several ways to transfer from visual features into textual ones. The method of applying deep learning and Word2Vec models to Wikipedia yielded impressive results. Our experiments present evidence to support the existence of resemblance between composition and description and show the feasibility and effectiveness of transferring zero-shot classifiers. With these transferred zero-shot classifiers, problems of image ranking query with low-frequency or new words can be solved. The image search engine proposed adopts cosine distance ranking as the ranking algorithm. Experiments on image searching show the superior performance of ZSTL.
引用
收藏
页码:353 / 362
页数:10
相关论文
共 50 条
  • [1] Zero-Shot Learning via Visual Abstraction
    Antol, Stanislaw
    Zitnick, C. Lawrence
    Parikh, Devi
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 401 - 416
  • [2] ZEST: Zero-shot Learning from Text Descriptions using Textual Similarity and Visual Summarization
    Paz-Argaman, Tzuf
    Atzmon, Yuval
    Chechik, Gal
    Tsarfaty, Reut
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 569 - 579
  • [3] Relational Knowledge Transfer for Zero-Shot Learning
    Wang, Donghui
    Li, Yanan
    Lin, Yuetan
    Zhuang, Yueting
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2145 - 2151
  • [4] Hypernetworks for Zero-Shot Transfer in Reinforcement Learning
    Rezaei-Shoshtari, Sahand
    Morissette, Charlotte
    Hogan, Francois R.
    Dudek, Gregory
    Meger, David
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9579 - 9587
  • [5] Zero-Shot Transfer Learning for Event Extraction
    Huang, Lifu
    Ji, Heng
    Cho, Kyunghyun
    Dagan, Ido
    Riedel, Sebastian
    Voss, Clare R.
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2160 - 2170
  • [6] Combined scaling for zero-shot transfer learning
    Pham, Hieu
    Dai, Zihang
    Ghiasi, Golnaz
    Kawaguchi, Kenji
    Liu, Hanxiao
    Yu, Adams Wei
    Yu, Jiahui
    Chen, Yi-Ting
    Luong, Minh-Thang
    Wu, Yonghui
    Tan, Mingxing
    V. Le, Quoc
    NEUROCOMPUTING, 2023, 555
  • [7] Transfer Increment for Generalized Zero-Shot Learning
    Feng, Liangjun
    Zhao, Chunhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2506 - 2520
  • [8] Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview
    Ren, Wenqi
    Tang, Yang
    Sun, Qiyu
    Zhao, Chaoqiang
    Han, Qing-Long
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (05) : 1106 - 1126
  • [9] Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview
    Wenqi Ren
    Yang Tang
    Qiyu Sun
    Chaoqiang Zhao
    Qing-Long Han
    IEEE/CAA Journal of Automatica Sinica, 2024, 11 (05) : 1106 - 1126
  • [10] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
    Zhang, Tian
    Liang, Kongming
    Du, Ruoyi
    Sun, Xian
    Ma, Zhanyu
    Guo, Jun
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355