Towards large-scale multimedia retrieval enriched by knowledge about human interpretation

被引:14
作者
Shirahama, Kimiaki [1 ]
Grzegorzek, Marcin [1 ]
机构
[1] Univ Siegen, Pattern Recognit Grp, D-57076 Siegen, Germany
关键词
Large-scale multimedia retrieval; Human-machine cooperation; Machine-based methods; Human-based methods; RELEVANCE FEEDBACK; IMAGE RETRIEVAL; EVENT DETECTION; SEARCH METHOD; VIDEO DATA; OBJECT; QUERY; INFORMATION; DISCOVERY; FRAMEWORK;
D O I
10.1007/s11042-014-2292-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent Large-Scale Multimedia Retrieval (LSMR) methods seem to heavily rely on analysing a large amount of data using high-performance machines. This paper aims to warn this research trend. We advocate that the above methods are useful only for recognising certain primitive meanings, knowledge about human interpretation is necessary to derive high-level meanings from primitive ones. We emphasise this by conducting a retrospective survey on machine-based methods which build classifiers based on features, and human-based methods which exploit user annotation and interaction. Our survey reveals that due to prioritising the generality and scalability for large-scale data, knowledge about human interpretation is left out by recent methods, while it was fully used in classical methods. Thus, we defend the importance of human-machine cooperation which incorporates the above knowledge into LSMR. In particular, we define its three future directions (cognition-based, ontology-based and adaptive learning) depending on types of knowledge, and suggest to explore each direction by considering its relation to the others.
引用
收藏
页码:297 / 331
页数:35
相关论文
共 50 条
  • [1] Towards large-scale multimedia retrieval enriched by knowledge about human interpretationRetrospective survey
    Kimiaki Shirahama
    Marcin Grzegorzek
    Multimedia Tools and Applications, 2016, 75 : 297 - 331
  • [2] Human-Machine Cooperation in Large-Scale Multimedia Retrieval: A Survey
    Shirahama, Kimiaki
    Grzegorzek, Marcin
    Indurkhya, Bipin
    JOURNAL OF PROBLEM SOLVING, 2015, 8 (01): : 36 - 63
  • [3] Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval
    Lu, Xu
    Zhu, Lei
    Cheng, Zhiyong
    Li, Jingjing
    Nie, Xiushan
    Zhang, Huaxiang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1129 - 1137
  • [4] A gradual approach to knowledge distillation in deep supervised hashing for large-scale image retrieval
    Hussain, Abid
    Li, Heng-Chao
    Hussain, Mehboob
    Ali, Muqadar
    Abbas, Shaheen
    Ali, Danish
    Rehman, Amir
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [5] Region Division for Large-scale Image Retrieval
    Rao, Yunbo
    Liu, Wei
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (10) : 5197 - 5218
  • [6] Manhattan Hashing for Large-Scale Image Retrieval
    Kong, Weihao
    Li, Wu-Jun
    Guo, Minyi
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 45 - 54
  • [7] Large-scale image retrieval using transductive support vector machines
    Cevikalp, Hakan
    Elmas, Merve
    Ozkan, Savas
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 173 : 2 - 12
  • [8] Large-scale image retrieval with supervised sparse hashing
    Xu, Yan
    Shen, Fumin
    Xu, Xing
    Gao, Lianli
    Wang, Yuan
    Tan, Xiao
    NEUROCOMPUTING, 2017, 229 : 45 - 53
  • [9] Coupled Binary Embedding for Large-Scale Image Retrieval
    Zheng, Liang
    Wang, Shengjin
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (08) : 3368 - 3380
  • [10] Large-scale Image Retrieval with Sparse Binary Projections
    Ma, Changyi
    Gu, Chonglin
    Li, Wenye
    Cui, Shuguang
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1817 - 1820