CROSS-MODAL LEARNING TO RANK WITH ADAPTIVE LISTWISE CONSTRAINT

被引:0
|
作者
Qu, Guangzhuo [1 ]
Xiao, Jing [1 ]
Zhu, Jia [1 ]
Cao, Yang [1 ]
Huang, Changqin [2 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou, Guangdong, Peoples R China
[2] South China Normal Univ, Sch Informat Technol Educ, Guangzhou, Guangdong, Peoples R China
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; common space; adaptive listwise theory; cross-modal learning to rank;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multi-modal data lies on heterogeneous feature spaces, which brings a significant challenge to cross-modal retrieval. Some works have been proposed to cope with this problem by learning a common subspace. However, previous methods often learn the common subspace by enhancing the relation between embedded features and relevant class labels but ignore the relation between embedded features and irrelevant class labels. Additionally, most methods assume that irrelevant samples are of equal importance. Considering this, we propose to train an optimal common embedding space via cross-modal learning to rank with adaptive listwise constraint (CMAL(2)R) based on two-branch neural networks. The listwise loss function in CMAL(2)R adaptively assigns larger margins to harder irrelevant samples, strengthening the relation between embedded features and irrelevant class labels. Experiments on Wikipedia and Pascal datasets demonstrate the effectiveness for bi-directional image-text retrieval.
引用
收藏
页码:1658 / 1662
页数:5
相关论文
共 50 条
  • [1] Semantic consistency cross-modal dictionary learning with rank constraint
    Shang, Fei
    Zhang, Huaxiang
    Sun, Jiande
    Liu, Li
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 259 - 266
  • [2] Learning to rank with relational graph and pointwise constraint for cross-modal retrieval
    Qingzhen Xu
    Miao Li
    Mengjing Yu
    Soft Computing, 2019, 23 : 9413 - 9427
  • [3] Learning to rank with relational graph and pointwise constraint for cross-modal retrieval
    Xu, Qingzhen
    Li, Miao
    Yu, Mengjing
    SOFT COMPUTING, 2019, 23 (19) : 9413 - 9427
  • [4] Simple to complex cross-modal learning to rank
    Luo, Minnan
    Chang, Xiaojun
    Li, Zhihui
    Nie, Liqiang
    Hauptmann, Alexander G.
    Zheng, Qinghua
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 163 : 67 - 77
  • [5] Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval
    Wu, Yiling
    Wang, Shuhui
    Huang, Qingming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1310 - 1322
  • [6] Listwise learning to rank with extreme order sensitive constraint via cross-correntropy
    Liu, Dezheng
    Li, Zhongyu
    Ma, Yuanyuan
    Zhang, Yulong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (22):
  • [7] Cross-Modal Learning to Rank via Latent Joint Representation
    Wu, Fei
    Jiang, Xinyang
    Li, Xi
    Tang, Siliang
    Lu, Weiming
    Zhang, Zhongfei
    Zhuang, Yueting
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (05) : 1497 - 1509
  • [8] Adaptive Adversarial Learning based cross-modal retrieval
    Li, Zhuoyi
    Lu, Huibin
    Fu, Hao
    Wang, Zhongrui
    Gu, Guanghun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [9] Adaptive Cross-Modal Few-shot Learning
    Xing, Chen
    Rostamzadeh, Negar
    Oreshkin, Boris N.
    Pinheiro, Pedro O.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [10] ONLINE LOW-RANK SIMILARITY FUNCTION LEARNING WITH ADAPTIVE RELATIVE MARGIN FOR CROSS-MODAL RETRIEVAL
    Wu, Yiling
    Wang, Shuhui
    Zhang, Weigang
    Huang, Qingming
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 823 - 828