Combining Generic and Specific Information for Cross-modal Retrieval

被引:0
作者
Thi Quynh Nhi Tran [1 ]
Le Borgne, Nerve [1 ]
Crucianu, Michel [2 ]
机构
[1] CEA LIST, Vis & Content Engn Lab, Gif Sur Yvette, France
[2] CNAM, CEDRIC, Paris, France
来源
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL | 2015年
关键词
Cross-modal retrieval; text illustration; canonical correlation analysis; IMAGES;
D O I
10.1145/2671188.2749348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal retrieval increasingly relies on joint statistical models built from large amounts of data represented according to several modalities. However, some information that is poorly represented by these models can be very significant for a retrieval task. We show that, by appropriately identifying and taking such information into account, the results of cross-modal retrieval can be strongly improved. We apply our model to three benchmarks for the text illustration task and find that the more data has misrepresented information, the more our model is comparatively effective.
引用
收藏
页码:551 / 554
页数:4
相关论文
共 9 条
[1]  
[Anonymous], P ACM INT C MULT MM
[2]   On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval [J].
Costa Pereira, Jose ;
Coviello, Emanuele ;
Doyle, Gabriel ;
Rasiwasia, Nikhil ;
Lanckriet, Gert R. G. ;
Levy, Roger ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) :521-535
[3]  
Feng Y., 2010, Proceedings of Human Language Technologies: The 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010), P831
[4]   A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics [J].
Gong, Yunchao ;
Ke, Qifa ;
Isard, Michael ;
Lazebnik, Svetlana .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 106 (02) :210-233
[5]   Canonical correlation analysis: An overview with application to learning methods [J].
Hardoon, DR ;
Szedmak, S ;
Shawe-Taylor, J .
NEURAL COMPUTATION, 2004, 16 (12) :2639-2664
[6]   Relations between two sets of variates [J].
Hotelling, H .
BIOMETRIKA, 1936, 28 :321-377
[7]   Learning the Relative Importance of Objects from Tagged Images for Retrieval and Cross-Modal Search [J].
Hwang, Sung Ju ;
Grauman, Kristen .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 100 (02) :134-153
[8]  
Mao X, 2013, P 21 ACM MULT
[9]  
Sermanet Pierre, 2013, ABS13126229 CORR