Towards large-scale multimedia retrieval enriched by knowledge about human interpretationRetrospective survey

被引:0
作者
Kimiaki Shirahama
Marcin Grzegorzek
机构
[1] University of Siegen,Pattern Recognition Group
来源
Multimedia Tools and Applications | 2016年 / 75卷
关键词
Large-scale multimedia retrieval; Human-machine cooperation; Machine-based methods; Human-based methods;
D O I
暂无
中图分类号
学科分类号
摘要
Recent Large-Scale Multimedia Retrieval (LSMR) methods seem to heavily rely on analysing a large amount of data using high-performance machines. This paper aims to warn this research trend. We advocate that the above methods are useful only for recognising certain primitive meanings, knowledge about human interpretation is necessary to derive high-level meanings from primitive ones. We emphasise this by conducting a retrospective survey on machine-based methods which build classifiers based on features, and human-based methods which exploit user annotation and interaction. Our survey reveals that due to prioritising the generality and scalability for large-scale data, knowledge about human interpretation is left out by recent methods, while it was fully used in classical methods. Thus, we defend the importance of human-machine cooperation which incorporates the above knowledge into LSMR. In particular, we define its three future directions (cognition-based, ontology-based and adaptive learning) depending on types of knowledge, and suggest to explore each direction by considering its relation to the others.
引用
收藏
页码:297 / 331
页数:34
相关论文
共 5 条
  • [1] Towards large-scale multimedia retrieval enriched by knowledge about human interpretation
    Shirahama, Kimiaki
    Grzegorzek, Marcin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (01) : 297 - 331
  • [2] Human-Machine Cooperation in Large-Scale Multimedia Retrieval: A Survey
    Shirahama, Kimiaki
    Grzegorzek, Marcin
    Indurkhya, Bipin
    JOURNAL OF PROBLEM SOLVING, 2015, 8 (01): : 36 - 63
  • [3] Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval
    Lu, Xu
    Zhu, Lei
    Cheng, Zhiyong
    Li, Jingjing
    Nie, Xiushan
    Zhang, Huaxiang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1129 - 1137
  • [4] An Adaptive Search Path Traverse for Large-scale Video Frame Retrieval
    Diep Thi-Ngoc Nguyen
    Kiyoki, Yasushi
    INFORMATION MODELLING AND KNOWLEDGE BASES XXVI, 2014, 272 : 324 - 342
  • [5] Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval
    Liu, Song
    Qian, Shengsheng
    Guan, Yang
    Zhan, Jiawei
    Ying, Long
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1379 - 1388