Automatic image annotation via category labels

被引：2

作者：

Weifeng Zhang

Hua Hu

Haiyang Hu

Jing Yu

机构：

[1] Jiaxing University,College of Mathematics, Physics and Information Engineering

[2] Hangzhou Dianzi University,School of Computer Science and Technology

[3] Hangzhou Normal University,School of Information Science and Engineering

[4] Chinese Academy of Sciences,Institute of Information Engineering

来源：

Multimedia Tools and Applications | 2020年 / 79卷

关键词：

Automatic image annotation; Image understanding; Deep learning; Sparse coding;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Automatic image annotation aims to assign relevant keywords to images and has become a research focus. Although many techniques have been proposed to solve this problem in the last decade, giving promissing performance on standard datasets, we propose a novel automatic image annotation technique in this paper. Our method uses a label transfer mechanism to automatically recommend those promising tags to each image by using the category information of images. As image representation is one of the key technique in image annotation, we use sparse coding based spatial pyramid matching and deep convolutional neural networks to model image features. And metric learning technique is further used to combine these features to achieve more effective image representation in this paper. Experimental results illustrate that the proposed method get similar or better results than the state-of-the-art methods on three standard image datasets.

引用

页码：11421 / 11435

页数：14

共 36 条

[1] Barnard K(2005)Word sense disambiguation with pictures Artif Intell 167 13-30
[2] Jordan MI(2003)Automatic linguistic indexing of pictures by a statistical modeling approach IEEE PAMI 25 1075-1088
[3] Li J(2013)Mlrank: multi-correlation learning to rank for image annotation Pattern Recogn 46 2700-2710
[4] Wang J(2007)Using large-scale web data to facilitate textual query based retrieval of consumer photos ACM MM 163 1277-1283
[5] Li Z(2016)Optimized graph learning using partial tags and multiple features for image and video annotation IEEE Trans Image Process 25 4999-5011
[6] Liu J(2018)Self-supervised video hashing with hierarchical binary auto-encoder IEEE Trans Image Process 27 3210-3221
[7] Xu C(2017)Beyond frame-level cnn: saliency-aware 3-d cnn with lstm for video action recognition IEEE Signal Process Lett 24 510-514
[8] Lu H(2018)Two-stream 3d convnet fusion for action recognition in videos with arbitrary size and length IEEE Trans Multimed 20 634-644
[9] Liu Y(2018)Training visual-semantic embedding network for boosting automatic image annotation Neural Process Lett 3 1-17
[10] Xu D(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 4 →