A survey and analysis on automatic image annotation

被引:75
作者
Cheng, Qimin [1 ]
Zhang, Qian [1 ]
Fu, Peng [2 ]
Tu, Conghuan [1 ]
Li, Sen [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Hubei, Peoples R China
[2] Indiana State Univ, Dept Earth & Environm Syst, Terre Haute, IN 47809 USA
基金
中国国家自然科学基金;
关键词
Automatic image annotation; Generative model; Nearest-neighbor model; Discriminative model; Tag-completion; Deep learning; TAG COMPLETION; MODEL; GRAPH; RANK; RETRIEVAL;
D O I
10.1016/j.patcog.2018.02.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, image annotation has attracted extensive attention due to the explosive growth of image data. With the capability of describing images at the semantic level, image annotation has many applications not only in image analysis and understanding but also in some relative disciplines, such as urban management and biomedical engineering. Because of the inherent weaknesses of manual image annotation, Automatic Image Annotation (AIA) has been raised since the late 1990s. In this paper, a deep review of state-of-the-art AIA methods is presented by synthesizing 138 literatures published during the past two decades. We classify AIA methods into five categories: 1) Generative model-based image annotation, 2) Nearest neighbor-based image annotation, 3) Discriminative model-based image annotation, and 4) Tag completion-based image annotation, 5) Deep Learning-based image annotation. Comparisons of the five types of AIA methods are made on the basis of the underlying idea, main contribution, model framework, computational complexity, computation time, and annotation accuracy. We also give an overview of five publicly available image datasets and four standard evaluation metrics commonly used as benchmarks for evaluating AIA methods. Then the performance of some typical or well-behaved models is assessed based on benchmark dataset and standard evaluation metrics. Finally, we share our viewpoints on the open issues and challenges in AIA as well as research trends in the future. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:242 / 259
页数:18
相关论文
共 99 条
[1]   Review on Statistical Approaches for Automatic Image Annotation [J].
Abd Manaf, Syaifulnizam ;
Nordin, Md Jan .
2009 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS, VOLS 1 AND 2, 2009, :56-61
[2]  
[Anonymous], 2009, ACM INT C IM VID RET
[3]  
[Anonymous], INT C MACH LEARN
[4]  
[Anonymous], 2007, THESIS
[5]  
[Anonymous], 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), DOI DOI 10.1109/CVPR.2006.167
[6]  
[Anonymous], 2010, P ACM MULTIMEDIA
[7]  
[Anonymous], P 5 NAT C COMP VIS P
[8]  
[Anonymous], 2013, ARXIV13124894
[9]   Hidden-Concept Driven Multilabel Image Annotation and Label Ranking [J].
Bao, Bing-Kun ;
Li, Teng ;
Yan, Shuicheng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) :199-210
[10]  
Bar-Hillel AB, 2005, J MACH LEARN RES, V6, P937