CROSS-MODAL LEARNING TO RANK WITH ADAPTIVE LISTWISE CONSTRAINT

被引：0

作者：

Qu, Guangzhuo ^{[1
]}

Xiao, Jing ^{[1
]}

Zhu, Jia ^{[1
]}

Cao, Yang ^{[1
]}

Huang, Changqin ^{[2
]}

机构：

[1] South China Normal Univ, Sch Comp Sci, Guangzhou, Guangdong, Peoples R China

[2] South China Normal Univ, Sch Informat Technol Educ, Guangzhou, Guangdong, Peoples R China

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

基金：

中国国家自然科学基金;

关键词：

Cross-modal retrieval; common space; adaptive listwise theory; cross-modal learning to rank;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Multi-modal data lies on heterogeneous feature spaces, which brings a significant challenge to cross-modal retrieval. Some works have been proposed to cope with this problem by learning a common subspace. However, previous methods often learn the common subspace by enhancing the relation between embedded features and relevant class labels but ignore the relation between embedded features and irrelevant class labels. Additionally, most methods assume that irrelevant samples are of equal importance. Considering this, we propose to train an optimal common embedding space via cross-modal learning to rank with adaptive listwise constraint (CMAL(2)R) based on two-branch neural networks. The listwise loss function in CMAL(2)R adaptively assigns larger margins to harder irrelevant samples, strengthening the relation between embedded features and irrelevant class labels. Experiments on Wikipedia and Pascal datasets demonstrate the effectiveness for bi-directional image-text retrieval.

引用

页码：1658 / 1662

页数：5

共 50 条

[1] Semantic consistency cross-modal dictionary learning with rank constraint
Shang, Fei
Zhang, Huaxiang
Sun, Jiande
Liu, Li
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 259 - 266
[2] Learning to rank with relational graph and pointwise constraint for cross-modal retrieval
Qingzhen Xu
Miao Li
Mengjing Yu
Soft Computing, 2019, 23 : 9413 - 9427
[3] Learning to rank with relational graph and pointwise constraint for cross-modal retrieval
Xu, Qingzhen
Li, Miao
Yu, Mengjing
SOFT COMPUTING, 2019, 23 (19) : 9413 - 9427
[4] Simple to complex cross-modal learning to rank
Luo, Minnan
Chang, Xiaojun
Li, Zhihui
Nie, Liqiang
Hauptmann, Alexander G.
Zheng, Qinghua
COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 163 : 67 - 77
[5] Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval
Wu, Yiling
Wang, Shuhui
Huang, Qingming
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1310 - 1322
[6] Listwise learning to rank with extreme order sensitive constraint via cross-correntropy
Liu, Dezheng
Li, Zhongyu
Ma, Yuanyuan
Zhang, Yulong
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (22):
[7] Cross-Modal Learning to Rank via Latent Joint Representation
Wu, Fei
Jiang, Xinyang
Li, Xi
Tang, Siliang
Lu, Weiming
Zhang, Zhongfei
Zhuang, Yueting
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (05) : 1497 - 1509
[8] Adaptive Adversarial Learning based cross-modal retrieval
Li, Zhuoyi
Lu, Huibin
Fu, Hao
Wang, Zhongrui
Gu, Guanghun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[9] Adaptive Cross-Modal Few-shot Learning
Xing, Chen
Rostamzadeh, Negar
Oreshkin, Boris N.
Pinheiro, Pedro O.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[10] ONLINE LOW-RANK SIMILARITY FUNCTION LEARNING WITH ADAPTIVE RELATIVE MARGIN FOR CROSS-MODAL RETRIEVAL
Wu, Yiling
Wang, Shuhui
Zhang, Weigang
Huang, Qingming
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 823 - 828

← 1 2 3 4 5 →