A survey on automatic image annotation

被引:20
作者
Chen, Yilu [1 ]
Zeng, Xiaojun [1 ]
Chen, Xing [1 ]
Guo, Wenzhong [1 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Fujian, Peoples R China
关键词
Computer vision; Image annotation; Tag assignment; Image retrieval; MAXIMUM-LIKELIHOOD-ESTIMATION; MACHINE TRANSLATION; MODEL; SHAPE;
D O I
10.1007/s10489-020-01696-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic image annotation is a crucial area in computer vision, which plays a significant role in image retrieval, image description, and so on. Along with the internet technique developing, there are numerous images posted on the web, resulting in the fact that it is a challenge to annotate images only by humans. Hence, many computer vision researchers are interested in automatic image annotation and make a great effort in optimizing its performance. Automatic image annotation is a task that assigns several tags in a limited vocabulary to describe an image. There are many algorithms proposed to tackle this problem and all achieve great performance. In this paper, we review seven algorithms for automatic image annotation and evaluate these algorithms leveraging different image features, such as color histograms and Gist descriptor. Our goal is to provide insights into the automatic image annotation. A lot of comprehensive experiments, which are based on Corel5K, IAPR TC-12, and ESP Game datasets, are designed to compare the performance of these algorithms. We also compare the performance of traditional algorithms employing deep learning features. Considering that not all associated labels are annotated by human annotators, we leverage the DIA metrics on IAPR TC-12 and ESP Game datasets.
引用
收藏
页码:3412 / 3428
页数:17
相关论文
共 66 条
[1]  
Adeniyi DA, 2016, APPL COMPUT INFORM, V12, pS221083271400026X
[2]  
Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
[3]  
[Anonymous], FOUND TRENDS MACH LE
[4]  
[Anonymous], 2012, ADV NEURAL INFORM PR
[5]  
[Anonymous], P C COMP VIS PATT RE
[6]   Building and using fuzzy multimedia ontologies for semantic image annotation [J].
Bannour, Hichem ;
Hudelot, Celine .
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (03) :2107-2141
[7]   Robust Maximum Likelihood Estimation [J].
Bertsimas, Dimitris ;
Nohadani, Omid .
INFORMS JOURNAL ON COMPUTING, 2019, 31 (03) :445-458
[8]  
Brinker K, 2007, P INT JOINT C ART IN
[9]   Shape annotation for intelligent image retrieval [J].
Castellano, Giovanna ;
Fanelli, Anna M. ;
Sforza, Gianluca ;
Torsello, M. Alessandra .
APPLIED INTELLIGENCE, 2016, 44 (01) :179-195
[10]   The devil is in the details: an evaluation of recent feature encoding methods [J].
Chatfield, Ken ;
Lempitsky, Victor ;
Vedaldi, Andrea ;
Zisserman, Andrew .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,