An Overview of Cross-Media Retrieval: Concepts, Methodologies, Benchmarks, and Challenges

被引:227
作者
Peng, Yuxin [1 ]
Huang, Xin [1 ]
Zhao, Yunzhen [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-media retrieval; overview; concepts; methodologies; benchmarks; challenges; RANK; REPRESENTATION; IMAGES; SPARSE;
D O I
10.1109/TCSVT.2017.2705068
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multimedia retrieval plays an indispensable role in big data utilization. Past efforts mainly focused on single-media retrieval. However, the requirements of users are highly flexible, such as retrieving the relevant audio clips with one query of image. So challenges stemming from the "media gap," which means that representations of different media types are inconsistent, have attracted increasing attention. Cross-media retrieval is designed for the scenarios where the queries and retrieval results are of different media types. As a relatively new research topic, its concepts, methodologies, and benchmarks are still not clear in the literature. To address these issues, we review more than 100 references, give an overview including the concepts, methodologies, major challenges, and open issues, as well as build up the benchmarks, including data sets and experimental results. Researchers can directly adopt the benchmarks to promptly evaluate their proposed methods. This will help them to focus on algorithm design, rather than the time-consuming compared methods and results. It is noted that we have constructed a new data set XMedia, which is the first publicly available data set with up to five media types (text, image, video, audio, and 3-D model). We believe this overview will attract more researchers to focus on cross-media retrieval and be helpful to them.
引用
收藏
页码:2372 / 2385
页数:14
相关论文
共 107 条
[1]  
Akaho S., 2006, ARXIVCS0609071
[2]  
Andrew Galen, 2010, INT C MACH LEARN, P3408
[3]  
[Anonymous], 2016, INT JOINT C ART INT
[4]  
[Anonymous], 2013, ICML
[5]  
[Anonymous], 2017, Nips 2016 tutorial: Generative adversarial networks
[6]  
[Anonymous], 2010, P 18 ACM INT C MULT
[7]  
[Anonymous], 2003, P 26 ANN INT ACM SIG
[8]  
[Anonymous], 2013, P ACM INT C MULTIMED
[9]  
[Anonymous], 2013, PROC 27 AAAI C ARTIF
[10]  
[Anonymous], MSRTR201086