Image classification based on improved VLAD

被引:0
|
作者
Xianzhong Long
Hongtao Lu
Yong Peng
Xianzhong Wang
Shaokun Feng
机构
[1] Nanjing University of Posts and Telecommunications,School of Computer Science & Technology, School of Software
[2] Shanghai Jiao Tong University,Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering
来源
Multimedia Tools and Applications | 2016年 / 75卷
关键词
Image classification; Scale-invariant feature transform; Vector of locally aggregated descriptors; K-means clustering algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, a coding scheme called vector of locally aggregated descriptors (VLAD) has got tremendous successes in large scale image retrieval due to its efficiency of compact representation. VLAD employs only the nearest neighbor visual word in dictionary to aggregate each descriptor feature. It has fast retrieval speed and high retrieval accuracy under small dictionary size. In this paper, we give three improved VLAD variations for image classification: first, similar to the bag of words (BoW) model, we count the number of descriptors belonging to each cluster center and add it to VLAD; second, in order to expand the impact of residuals, squared residuals are taken into account; thirdly, in contrast with one nearest neighbor visual word, we try to look for two nearest neighbor visual words for aggregating each descriptor. Experimental results on UIUC Sports Event, Corel 10 and 15 Scenes datasets show that the proposed methods outperform some state-of-the-art coding schemes in terms of the classification accuracy and computation speed.
引用
收藏
页码:5533 / 5555
页数:22
相关论文
共 50 条
  • [31] Image classification based on nearest neighbor basis vectors
    Long, Xianzhong
    Lu, Hongtao
    Li, Wenbin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (03) : 1559 - 1576
  • [32] Image classification based on nearest neighbor basis vectors
    Xianzhong Long
    Hongtao Lu
    Wenbin Li
    Multimedia Tools and Applications, 2014, 71 : 1559 - 1576
  • [33] Image Classification Based On Image Pixel Value
    Arya, Mukesh Chandra
    Semwal, Ashish
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [34] An Improved Bag of Visual Words Model for Image Classification
    Guo Ye
    Meng Qingchao
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 3261 - 3265
  • [35] IMAGE CLASSIFICATION BASED ON FOCUS
    Patel, Mehul B.
    Rodriguez, Jeffrey J.
    Gmitro, Arthur F.
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 397 - 400
  • [36] A multiscale retinex for improved performance in multispectral image classification
    Thompson, B
    Rahman, Z
    Park, S
    VISUAL INFORMATION PROCESSING IX, 2000, 4041 : 34 - 44
  • [37] An Improved Convolutional Neural Network Architecture for Image Classification
    Ferreyra-Ramirez, A.
    Aviles-Cruz, C.
    Rodriguez-Martinez, E.
    Villegas-Cortez, J.
    Zuniga-Lopez, A.
    PATTERN RECOGNITION, MCPR 2019, 2019, 11524 : 89 - 101
  • [38] Hyperdimensional Computing Encoding Schemes for Improved Image Classification
    Miranda, Victor
    d'Aliberti, Olivia
    2022 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY (HST), 2022,
  • [39] Improved Kernel Descriptors for Effective and Efficient Image Classification
    Karmakarl, Priyabrata
    Teng, Shyh Wei
    Zhang, Dengsheng
    Liu, Ying
    Lu, Guojun
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 195 - 202
  • [40] Improved deep learning image classification algorithm based on Swin Transformer V2
    Wei, Jiangshu
    Chen, Jinrong
    Wang, Yuchao
    Luo, Hao
    Li, Wujie
    PEERJ COMPUTER SCIENCE, 2023, 9