Towards large-scale multimedia retrieval enriched by knowledge about human interpretationRetrospective survey

被引：0

作者：

Kimiaki Shirahama

Marcin Grzegorzek

机构：

[1] University of Siegen,Pattern Recognition Group

来源：

Multimedia Tools and Applications | 2016年 / 75卷

关键词：

Large-scale multimedia retrieval; Human-machine cooperation; Machine-based methods; Human-based methods;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recent Large-Scale Multimedia Retrieval (LSMR) methods seem to heavily rely on analysing a large amount of data using high-performance machines. This paper aims to warn this research trend. We advocate that the above methods are useful only for recognising certain primitive meanings, knowledge about human interpretation is necessary to derive high-level meanings from primitive ones. We emphasise this by conducting a retrospective survey on machine-based methods which build classifiers based on features, and human-based methods which exploit user annotation and interaction. Our survey reveals that due to prioritising the generality and scalability for large-scale data, knowledge about human interpretation is left out by recent methods, while it was fully used in classical methods. Thus, we defend the importance of human-machine cooperation which incorporates the above knowledge into LSMR. In particular, we define its three future directions (cognition-based, ontology-based and adaptive learning) depending on types of knowledge, and suggest to explore each direction by considering its relation to the others.

引用

页码：297 / 331

页数：34

共 5 条

[1] Towards large-scale multimedia retrieval enriched by knowledge about human interpretation
Shirahama, Kimiaki
Grzegorzek, Marcin
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (01) : 297 - 331
[2] Human-Machine Cooperation in Large-Scale Multimedia Retrieval: A Survey
Shirahama, Kimiaki
Grzegorzek, Marcin
Indurkhya, Bipin
JOURNAL OF PROBLEM SOLVING, 2015, 8 (01): : 36 - 63
[3] Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval
Lu, Xu
Zhu, Lei
Cheng, Zhiyong
Li, Jingjing
Nie, Xiushan
Zhang, Huaxiang
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1129 - 1137
[4] An Adaptive Search Path Traverse for Large-scale Video Frame Retrieval
Diep Thi-Ngoc Nguyen
Kiyoki, Yasushi
INFORMATION MODELLING AND KNOWLEDGE BASES XXVI, 2014, 272 : 324 - 342
[5] Joint-modal Distribution-based Similarity Hashing for Large-scale Unsupervised Deep Cross-modal Retrieval
Liu, Song
Qian, Shengsheng
Guan, Yang
Zhan, Jiawei
Ying, Long
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1379 - 1388

← 1 →