Automatic image annotation using model fusion and multi-label selection algorithm

被引:4
作者
Wang, Liqin [1 ,2 ]
Zhang, Aofan [1 ]
Wang, Peng [1 ,2 ]
Dong, Yongfeng [1 ,2 ]
机构
[1] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin, Peoples R China
[2] Hebei Univ Technol, Hebei Prov Key Lab Big Data Calculat, Tianjin, Peoples R China
关键词
Automatic image annotation; deep learning; CNN; model fusion; multi-label selection;
D O I
10.3233/JIFS-182587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Image Annotation (AIA) aims to provide a semantic description for the content of image by assigning a set of textual labels. The recent approaches mainly focus on the improvement of single model and neglect the potential advantages of different models. In order to make full use of the advantages of different annotation models, Dual Model based on Multi-Label Selection Algorithm(DM-SA) is proposed in this research which combines a discriminative model with a nearest-neighbor-based model. The algorithm takes consideration of the advantages of each model, thus provides better annotation performance. A deep Convolutional Neural Network (CNN) is used to obtain visual representation of images first, then a discriminative model, CNN with Label Smoothing (CNN-LS), and a nearest-neighbor-based model, 2PKNN with Canonical Correlation Analysis (2PKNN-CCA) generate candidate label set respectively. Finally, a multi-label selection algorithm based on inverse document frequency is adopted to assign the final labels from two candidate label sets. Experimental results based on Corel5K and IAPRTC-12 datasets show that the proposed method can achieve state-of-the-art performance for average recall, 0.52 and 0.42 on Corel5K and IAPRTC-12 respectively.
引用
收藏
页码:4999 / 5008
页数:10
相关论文
共 26 条
[1]   Automatic image annotation using semi-supervised generative modeling [J].
Amiri, S. Hamid ;
Jamzad, Mansour .
PATTERN RECOGNITION, 2015, 48 (01) :174-188
[2]  
[Anonymous], 2018, SCIENTOMETRICS, DOI [DOI 10.1007/S11192-017-2571-Z, 10.1007/s11192-017-2571-z]
[3]  
[Anonymous], ADV NEURAL INFORM PR
[4]  
[Anonymous], INT TRENDS 2018 COD
[5]  
[Anonymous], 2016, P 7 S INF COMM, DOI DOI 10.1145/3011077.3011118
[6]   Image Super-Resolution Using Deep Convolutional Networks [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307
[7]  
Feng SL, 2004, PROC CVPR IEEE, P1002
[8]  
Glorot X., 2010, P 13 INT C ART INT S, P249
[9]   A Survey on Ensemble Learning for Data Stream Classification [J].
Gomes, Heitor Murilo ;
Barddal, Jean Paul ;
Enembreck, Fabricio ;
Bifet, Albert .
ACM COMPUTING SURVEYS, 2017, 50 (02)
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778