Multimedia Feature Mapping and Correlation Learning for Cross-Modal Retrieval

被引:4
|
作者
Yuan, Xu [1 ]
Zhong, Hua [1 ]
Chen, Zhikui [1 ]
Zhong, Fangming [1 ]
Hu, Yueming [2 ]
机构
[1] Dalian Univ Technol, Sch Software Technol, Dalian, Peoples R China
[2] South China Agr Univ, Coll Nat Resources & Environm, Guangzhou, Guangdong, Peoples R China
关键词
Correlation Learning; Cross-Modal Retrieval; Multimedia; Semantic Feature; Text and Image;
D O I
10.4018/IJGHPC.2018070103
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This article describes how with the rapid increasing of multimedia content on the Internet, the need for effective cross-modal retrieval has attracted much attention recently. Many related works ignore the latent semantic correlations of modalities in the non-linear space and the extraction of high-level modality features, which only focuses on the semantic mapping of modalities in linear space and the use of low-level artificial features as modality feature representation. To solve these issues, the authors first utilizes convolutional neural networks and topic modal to obtain a high-level semantic feature of various modalities. Sequentially, they propose a supervised learning algorithm based on a kernel with partial least squares that can capture semantic correlations across modalities. Finally, the joint model of different modalities is learnt by the training set. Extensive experiments are conducted on three benchmark datasets that include Wikipedia, Pascal and MIRFlickr. The results show that the proposed approach achieves better retrieval performance over several state-of-the-art approaches.
引用
收藏
页码:29 / 45
页数:17
相关论文
共 50 条
  • [1] Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval
    Kang, Cuicui
    Xiang, Shiming
    Liao, Shengcai
    Xu, Changsheng
    Pan, Chunhong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (03) : 370 - 381
  • [2] COUPLED DICTIONARY LEARNING AND FEATURE MAPPING FOR CROSS-MODAL RETRIEVAL
    Xu, Xing
    Shimada, Atsushi
    Taniguchi, Rin-ichiro
    He, Li
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [3] Cross-Modal Retrieval with Correlation Feature Propagation
    Zhang L.
    Cao F.
    Liang X.
    Qian Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (09): : 1993 - 2002
  • [4] On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval
    Costa Pereira, Jose
    Coviello, Emanuele
    Doyle, Gabriel
    Rasiwasia, Nikhil
    Lanckriet, Gert R. G.
    Levy, Roger
    Vasconcelos, Nuno
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) : 521 - 535
  • [5] Combining Link and Content Correlation Learning for Cross-Modal Retrieval in Social Multimedia
    Zhang, Longtao
    Liu, Fangfang
    Zeng, Zhimin
    HUMAN CENTERED COMPUTING, HCC 2017, 2018, 10745 : 516 - 526
  • [6] Deep Semantic Correlation Learning based Hashing for Multimedia Cross-Modal Retrieval
    Gong, Xiaolong
    Huang, Linpeng
    Wang, Fuwei
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 117 - 126
  • [7] Heterogeneous Metric Learning for Cross-Modal Multimedia Retrieval
    Deng, Jun
    Du, Liang
    Shen, Yi-Dong
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT I, 2013, 8180 : 43 - 56
  • [8] Topic correlation model for cross-modal multimedia information retrieval
    Qin, Zengchang
    Yu, Jing
    Cong, Yonghui
    Wan, Tao
    PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (04) : 1007 - 1022
  • [9] Topic correlation model for cross-modal multimedia information retrieval
    Zengchang Qin
    Jing Yu
    Yonghui Cong
    Tao Wan
    Pattern Analysis and Applications, 2016, 19 : 1007 - 1022
  • [10] Discriminative Latent Feature Space Learning for Cross-Modal Retrieval
    Tang, Xu
    Deng, Cheng
    Gao, Xinbo
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 507 - 510