CNN-feature based automatic image annotation method

被引：0

作者：

Yanchun Ma

Yongjian Liu

Qing Xie

Lin Li

机构：

[1] Wuhan University of Technology,School of Computer Science and Technology

来源：

Multimedia Tools and Applications | 2019年 / 78卷

关键词：

Image annotation; Convolutional neural network; Feature extraction; Semantic extension;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Automatic image annotation(AIA) methods are considered as a kind of efficient schemes to solve the problem of semantic-gap between the original images and their semantic information. However, traditional annotation models work well only with finely crafted manual features. To address this problem, we combined the CNN feature of an image into our proposed model which we referred as SEM by using a famous CNN model-AlexNet. We extracted a CNN feature by removing its final layer and it is proved to be useful in our SEM model. Additionally, based on the experience of the traditional KNN models, we propose a model to address the problem of simultaneously addressing the image tag refinement and assignment while maintaining the simplicity of the KNN model. The proposed model divides the images which have similar features into a semantic neighbor group. Moreover, utilizing a self-defined Bayesian-based model, we distribute the tags which belong to the neighbor group to the test images according to the distance between the test image and the neighbors. At last, the experiments are performed on three typical image datasets corel5k, espGame and laprtc12, which verify the effectiveness of the proposed model.

引用

页码：3767 / 3780

页数：13

共 86 条

[1] Cusano C(2003)Image annotation using SVM Proc SPIE 1 330-338
[2] Bicocca M(2008)Image retrieval: ideas, influences, and trends of the new age ACM Comput Surv 40 1-60
[3] Bicocca V(2004)Multiple Bernoulli relevance models for image and video annotation Proc 2004 IEEE Comput Soc Confon Comput Vis Pattern Recogn 2004 CVPR 2004 2 1002-1009
[4] Datta R(2017)ImageNet classification with deep convolutional neural networks Commun ACM 60 84-90
[5] Joshi D(2015)Weakly supervised deep metric learning for community-contributed image retrieval IEEE Trans Multimed 17 1989-1999
[6] Li J(2015)Robust structured subspace learning for data representation IEEE Trans Pattern Anal Mach Intell 37 2085-2098
[7] Wang JZ(2017)Weakly supervised deep matrix factorization for social image understanding IEEE Trans Image Process 26 276-288
[8] Feng SL(2018)Robust discrete code modeling for supervised hashing Pattern Recogn 75 128-135
[9] Manmatha R(2010)A new baselines for image annotation Int J Comput Vis 90 88-105
[10] Lavrenko V(2017)Graph self-representation method for unsupervised feature selection Neurocomputing 220 130-137

← 1 2 3 4 5 6 7 8 9 →