Multimedia Feature Mapping and Correlation Learning for Cross-Modal Retrieval

被引：4

作者：

Yuan, Xu ^{[1
]}

Zhong, Hua ^{[1
]}

Chen, Zhikui ^{[1
]}

Zhong, Fangming ^{[1
]}

Hu, Yueming ^{[2
]}

机构：

[1] Dalian Univ Technol, Sch Software Technol, Dalian, Peoples R China

[2] South China Agr Univ, Coll Nat Resources & Environm, Guangzhou, Guangdong, Peoples R China

来源：

INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING | 2018年 / 10卷 / 03期

关键词：

Correlation Learning; Cross-Modal Retrieval; Multimedia; Semantic Feature; Text and Image;

D O I：

10.4018/IJGHPC.2018070103

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This article describes how with the rapid increasing of multimedia content on the Internet, the need for effective cross-modal retrieval has attracted much attention recently. Many related works ignore the latent semantic correlations of modalities in the non-linear space and the extraction of high-level modality features, which only focuses on the semantic mapping of modalities in linear space and the use of low-level artificial features as modality feature representation. To solve these issues, the authors first utilizes convolutional neural networks and topic modal to obtain a high-level semantic feature of various modalities. Sequentially, they propose a supervised learning algorithm based on a kernel with partial least squares that can capture semantic correlations across modalities. Finally, the joint model of different modalities is learnt by the training set. Extensive experiments are conducted on three benchmark datasets that include Wikipedia, Pascal and MIRFlickr. The results show that the proposed approach achieves better retrieval performance over several state-of-the-art approaches.

引用

页码：29 / 45

页数：17

共 50 条

[1] Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval
Kang, Cuicui
Xiang, Shiming
Liao, Shengcai
Xu, Changsheng
Pan, Chunhong
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (03) : 370 - 381
[2] COUPLED DICTIONARY LEARNING AND FEATURE MAPPING FOR CROSS-MODAL RETRIEVAL
Xu, Xing
Shimada, Atsushi
Taniguchi, Rin-ichiro
He, Li
2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
[3] Cross-Modal Retrieval with Correlation Feature Propagation
Zhang L.
Cao F.
Liang X.
Qian Y.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (09): : 1993 - 2002
[4] On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval
Costa Pereira, Jose
Coviello, Emanuele
Doyle, Gabriel
Rasiwasia, Nikhil
Lanckriet, Gert R. G.
Levy, Roger
Vasconcelos, Nuno
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) : 521 - 535
[5] Combining Link and Content Correlation Learning for Cross-Modal Retrieval in Social Multimedia
Zhang, Longtao
Liu, Fangfang
Zeng, Zhimin
HUMAN CENTERED COMPUTING, HCC 2017, 2018, 10745 : 516 - 526
[6] Deep Semantic Correlation Learning based Hashing for Multimedia Cross-Modal Retrieval
Gong, Xiaolong
Huang, Linpeng
Wang, Fuwei
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 117 - 126
[7] Heterogeneous Metric Learning for Cross-Modal Multimedia Retrieval
Deng, Jun
Du, Liang
Shen, Yi-Dong
WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT I, 2013, 8180 : 43 - 56
[8] Topic correlation model for cross-modal multimedia information retrieval
Qin, Zengchang
Yu, Jing
Cong, Yonghui
Wan, Tao
PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (04) : 1007 - 1022
[9] Topic correlation model for cross-modal multimedia information retrieval
Zengchang Qin
Jing Yu
Yonghui Cong
Tao Wan
Pattern Analysis and Applications, 2016, 19 : 1007 - 1022
[10] Discriminative Latent Feature Space Learning for Cross-Modal Retrieval
Tang, Xu
Deng, Cheng
Gao, Xinbo
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 507 - 510

← 1 2 3 4 5 →