The image annotation algorithm using convolutional features from intermediate layer of deep learning

被引：69

作者：

Chen, Yuantao ^{[1
,2
]}

Liu, Linwu ^{[1
,2
]}

Tao, Jiajun ^{[1
,2
]}

Chen, Xi ^{[1
,2
]}

Xia, Runlong ^{[3
]}

Zhang, Qian ^{[4
]}

Xiong, Jie ^{[5
]}

Yang, Kai ^{[4
]}

Xie, Jingbo ^{[3
]}

机构：

[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Hunan, Peoples R China

[2] Changsha Univ Sci & Technol, Hunan Prov Key Lab Intelligent Proc Big Data Tran, Changsha 410114, Hunan, Peoples R China

[3] Hunan Inst Sci & Tech Informat, Changsha 411105, Hunan, Peoples R China

[4] Hunan ZOOMLION Intelligent Technol Corp Ltd, Dept Elect Prod, Changsha 410005, Hunan, Peoples R China

[5] Yangtze Univ, Elect & Informat Sch, Jingzhou 434023, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2021年 / 80卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Deep learning; Image annotation; Convolutional results; Positive mean vector; Eigenvector;

D O I：

10.1007/s11042-020-09887-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The automatic image annotation is an effective computer operation that predicts the annotation of an unknown image by automatically learning potential relationships between the semantic concept space and the visual feature space in the annotation image dataset. Usually, the auto-labeling image includes the processing: learning processing and labeling processing. Existing image annotation methods that employ convolutional features of deep learning methods have a number of limitations, including complex training and high space/time expenses associated with the image annotation procedure. Accordingly, this paper proposes an innovative method in which the visual features of the image are presented by the intermediate layer features of deep learning, while semantic concepts are represented by mean vectors of positive samples. Firstly, the convolutional result is directly output in the form of low-level visual features through the mid-level of the pre-trained deep learning model, with the image being represented by sparse coding. Secondly, the positive mean vector method is used to construct visual feature vectors for each text vocabulary item, so that a visual feature vector database is created. Finally, the visual feature vector similarity between the testing image and all text vocabulary is calculated, and the vocabulary with the largest similarity used for annotation. Experiments on the datasets demonstrate the effectiveness of the proposed method; in terms of F1 score, the proposed method's performance on the Corel5k dataset and IAPR TC-12 dataset is superior to that of MBRM, JEC-AF, JEC-DF, and 2PKNN with end-to-end deep features.

引用

页码：4237 / 4261

页数：25

共 37 条

[1] ConceptRank for search-based image annotation [J].

Budikova, Petra ;

Batko, Michal ;

Zezula, Pavel .

MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (07) :8847-8882

[2]

Chen YB, 2020, J CONTEMP CHINA, V29, P1, DOI [10.1080/10670564.2019.1621526, 10.1007/s12652-020-02066-z, 10.1080/01932691.2020.1791172]

[3] Freeze-Drying Formulations Increased the Adenovirus and Poxvirus Vaccine Storage Times and Antigen Stabilities [J].

Chen, Ye ;

Liao, Qibin ;

Chen, Tianyue ;

Zhang, Yuchao ;

Yuan, Weien ;

Xu, Jianqing ;

Zhang, Xiaoyan .

VIROLOGICA SINICA, 2021, 36 (03) :365-372

[4] The face image super-resolution algorithm based on combined representation learning [J].

Chen, Yuantao ;

Phonevilay, Volachith ;

Tao, Jiajun ;

Chen, Xi ;

Xia, Runlong ;

Zhang, Qian ;

Yang, Kai ;

Xiong, Jie ;

Xie, Jingbo .

MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) :30839-30861

[5] Research on image Inpainting algorithm of improved GAN based on two-discriminations networks [J].

Chen, Yuantao ;

Zhang, Haopeng ;

Liu, Linwu ;

Chen, Xi ;

Zhang, Qian ;

Yang, Kai ;

Xia, Runlong ;

Xie, Jingbo .

APPLIED INTELLIGENCE, 2021, 51 (06) :3460-3474

[6] Saliency Detection via the Improved Hierarchical Principal Component Analysis Method [J].

Chen, Yuantao ;

Tao, Jiajun ;

Zhang, Qian ;

Yang, Kai ;

Chen, Xi ;

Xiong, Jie ;

Xia, Runlong ;

Xie, Jingbo .

WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020

[7] RETRACTED: Multiscale fast correlation filtering tracking algorithm based on a feature fusion model (Retracted article. See vol. 34, 2022) [J].

Chen, Yuantao ;

Wang, Jin ;

Liu, Songjie ;

Chen, Xi ;

Xiong, Jie ;

Xie, Jingbo ;

Yang, Kai .

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (15)

[8] RETRACTED: The visual object tracking algorithm research based on adaptive combination kernel (Retracted Article) [J].

Chen, Yuantao ;

Wang, Jin ;

Xia, Runlong ;

Zhang, Qian ;

Cao, Zhouhong ;

Yang, Kai .

JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (12) :4855-4867

[9] The fire recognition algorithm using dynamic feature fusion and IV-SVM classifier [J].

Chen, Yuantao ;

Xu, Weihong ;

Zuo, Jingwen ;

Yang, Kai .

CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3) :S7665-S7675

[10] A survey and analysis on automatic image annotation [J].

Cheng, Qimin ;

Zhang, Qian ;

Fu, Peng ;

Tu, Conghuan ;

Li, Sen .

PATTERN RECOGNITION, 2018, 79 :242-259

← 1 2 3 4 →