Image classification based on improved VLAD

被引:0
|
作者
Xianzhong Long
Hongtao Lu
Yong Peng
Xianzhong Wang
Shaokun Feng
机构
[1] Nanjing University of Posts and Telecommunications,School of Computer Science & Technology, School of Software
[2] Shanghai Jiao Tong University,Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Department of Computer Science and Engineering
来源
Multimedia Tools and Applications | 2016年 / 75卷
关键词
Image classification; Scale-invariant feature transform; Vector of locally aggregated descriptors; K-means clustering algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, a coding scheme called vector of locally aggregated descriptors (VLAD) has got tremendous successes in large scale image retrieval due to its efficiency of compact representation. VLAD employs only the nearest neighbor visual word in dictionary to aggregate each descriptor feature. It has fast retrieval speed and high retrieval accuracy under small dictionary size. In this paper, we give three improved VLAD variations for image classification: first, similar to the bag of words (BoW) model, we count the number of descriptors belonging to each cluster center and add it to VLAD; second, in order to expand the impact of residuals, squared residuals are taken into account; thirdly, in contrast with one nearest neighbor visual word, we try to look for two nearest neighbor visual words for aggregating each descriptor. Experimental results on UIUC Sports Event, Corel 10 and 15 Scenes datasets show that the proposed methods outperform some state-of-the-art coding schemes in terms of the classification accuracy and computation speed.
引用
收藏
页码:5533 / 5555
页数:22
相关论文
共 50 条
  • [1] Image classification based on improved VLAD
    Long, Xianzhong
    Lu, Hongtao
    Peng, Yong
    Wang, Xianzhong
    Feng, Shaokun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (10) : 5533 - 5555
  • [2] Scene Image Classification Based on Improved VLAD Reprensentation
    Zhang, Zhiyi
    Long, Xianzhong
    Li, Yun
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] VLAD Encoding Based on LLC for Image Classification
    Cheng, Cheng
    Long, Xianzhong
    Li, Yun
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 417 - 422
  • [4] A New VLAD Method with Dense SIFT Selection Application in Image Classification
    Qian, Zhi
    Hong, Qijun
    Huang, Gang
    Liu, Pingping
    Yan, Yuanjie
    Xie, Min
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (CAAI 2017), 2017, 134 : 568 - 574
  • [5] Image Classification Based on Improved Random Forest Algorithm
    Man, Weishi
    Ji, Yuanyuan
    Zhang, Zhiyu
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 346 - 350
  • [6] An Improved Method for Pneumonia Image Classification Based on CoAtNet
    You, Siyu
    Qin, Yunhan
    Yan, Longcheng
    Zhang, Houpeng
    Zhu, Jiaxian
    Yu, Shiyu
    Wu, Zhengyi
    Toe, Teoh Teik
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1052 - 1056
  • [7] Fusion of Multiple VLAD Vectors Based on Different Features for Image Retrieval
    Wang, Yanhong
    Cen, Yigang
    Zhao, Ruizhen
    Kan, Shichao
    Hu, Shaohai
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 742 - 746
  • [8] Garbage Image Classification Based on Improved Residual Neural Networks
    Zhao, Ling-ying
    Zhao, Hong-yu
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 7 - 13
  • [9] A Classification of Remote Sensing Image Based on Improved Compound Kernels of Svm
    Zhao, Jianing
    Gao, Wanlin
    Liu, Zili
    Mou, Guifen
    Lu, Lin
    Yu, Lina
    COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE III, 2010, 317 : 15 - +
  • [10] Improved Tamura Features for Image Classification using Kernel based Descriptors
    Karmakar, Priyabrata
    Teng, Shyh Wei
    Zhang, Dengsheng
    Liu, Ying
    Lu, Guojun
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 461 - 467