Image classification based on improved VLAD

被引：0

作者：

Xianzhong Long

Hongtao Lu

Yong Peng

Xianzhong Wang

Shaokun Feng

机构：

[1] Nanjing University of Posts and Telecommunications,School of Computer Science & Technology, School of Software

[2] Shanghai Jiao Tong University,Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering

来源：

Multimedia Tools and Applications | 2016年 / 75卷

关键词：

Image classification; Scale-invariant feature transform; Vector of locally aggregated descriptors; K-means clustering algorithm;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recently, a coding scheme called vector of locally aggregated descriptors (VLAD) has got tremendous successes in large scale image retrieval due to its efficiency of compact representation. VLAD employs only the nearest neighbor visual word in dictionary to aggregate each descriptor feature. It has fast retrieval speed and high retrieval accuracy under small dictionary size. In this paper, we give three improved VLAD variations for image classification: first, similar to the bag of words (BoW) model, we count the number of descriptors belonging to each cluster center and add it to VLAD; second, in order to expand the impact of residuals, squared residuals are taken into account; thirdly, in contrast with one nearest neighbor visual word, we try to look for two nearest neighbor visual words for aggregating each descriptor. Experimental results on UIUC Sports Event, Corel 10 and 15 Scenes datasets show that the proposed methods outperform some state-of-the-art coding schemes in terms of the classification accuracy and computation speed.

引用

页码：5533 / 5555

页数：22

共 50 条

[31] Image classification based on nearest neighbor basis vectors
Long, Xianzhong
Lu, Hongtao
Li, Wenbin
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (03) : 1559 - 1576
[32] Image classification based on nearest neighbor basis vectors
Xianzhong Long
Hongtao Lu
Wenbin Li
Multimedia Tools and Applications, 2014, 71 : 1559 - 1576
[33] Image Classification Based On Image Pixel Value
Arya, Mukesh Chandra
Semwal, Ashish
2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
[34] An Improved Bag of Visual Words Model for Image Classification
Guo Ye
Meng Qingchao
AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 3261 - 3265
[35] IMAGE CLASSIFICATION BASED ON FOCUS
Patel, Mehul B.
Rodriguez, Jeffrey J.
Gmitro, Arthur F.
2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 397 - 400
[36] A multiscale retinex for improved performance in multispectral image classification
Thompson, B
Rahman, Z
Park, S
VISUAL INFORMATION PROCESSING IX, 2000, 4041 : 34 - 44
[37] An Improved Convolutional Neural Network Architecture for Image Classification
Ferreyra-Ramirez, A.
Aviles-Cruz, C.
Rodriguez-Martinez, E.
Villegas-Cortez, J.
Zuniga-Lopez, A.
PATTERN RECOGNITION, MCPR 2019, 2019, 11524 : 89 - 101
[38] Hyperdimensional Computing Encoding Schemes for Improved Image Classification
Miranda, Victor
d'Aliberti, Olivia
2022 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY (HST), 2022,
[39] Improved Kernel Descriptors for Effective and Efficient Image Classification
Karmakarl, Priyabrata
Teng, Shyh Wei
Zhang, Dengsheng
Liu, Ying
Lu, Guojun
2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 195 - 202
[40] Improved deep learning image classification algorithm based on Swin Transformer V2
Wei, Jiangshu
Chen, Jinrong
Wang, Yuchao
Luo, Hao
Li, Wujie
PEERJ COMPUTER SCIENCE, 2023, 9

← 1 2 3 4 5 →