Compressing Deep Neural Networks for Recognizing Places

被引:0
|
作者
Saha, Soham [1 ]
Varma, Girish [1 ]
Jawahar, C. V. [1 ]
机构
[1] Int Inst Informat Technol, KCIS, CVIT, Hyderabad, India
来源
PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR) | 2017年
关键词
Visual Place Recognition; Model Compression; Image Retrieval;
D O I
10.1109/ACPR.2017.154
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual place recognition on low memory devices such as mobile phones and robotics systems is a challenging problem. The state of the art models for this task uses deep learning architectures having close to 100 million parameters which takes over 400MB of memory. This makes these models infeasible to be deployed in low memory devices and gives rise to the need of compressing them. Hence we study the effectiveness of model compression techniques like trained quantization and pruning for reducing the number of parameters on one of the best performing image retrieval models called NetVLAD. We show that a compressed network can be created by starting with a model pre-trained for the task of visual place recognition and then fine-tuning it via trained pruning and quantization. The compressed model is able to produce the same mAP as the original uncompressed network. We achieve almost 50% parameter pruning with no loss in mAP and 70% pruning with close to 2% mAP reduction, while also performing 8-bit quantization. Furthermore, together with 5-bit quantization, we perform about 50% parameter reduction by pruning and get only about 3% reduction in mAP. The resulting compressed networks have sizes of around 30MB and 65MB which makes them easily usable in memory constrained devices.
引用
收藏
页码:352 / 357
页数:6
相关论文
共 50 条
  • [11] Performance evaluation of stochastic quantization methods for compressing the deep neural network model
    Choi J.-Y.
    Yoo J.
    Journal of Institute of Control, Robotics and Systems, 2019, 25 (09) : 775 - 781
  • [12] EvoPrunerPool: An Evolutionary Pruner using Pruner Pool for Compressing Convolutional Neural Networks
    Subramanian, S. Sujit
    Arvindram, K.
    Velayutham, C. Shunmuga
    Sathya, Madhusoodhan
    Sengodan, Nathiya
    Kosuri, Divesh
    Satvik, Arvapalli Sai
    Thangavelu, S.
    Jeyakumar, G.
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 2136 - 2143
  • [13] Neural self-compressor: Collective interpretation by compressing multi-layered neural networks into non-layered networks
    Kamimura, Ryotaro
    NEUROCOMPUTING, 2019, 323 : 12 - 36
  • [14] Filter pruning with a feature map entropy importance criterion for convolution neural networks compressing
    Wang, Jielei
    Jiang, Ting
    Cui, Zongyong
    Cao, Zongjie
    NEUROCOMPUTING, 2021, 461 : 41 - 54
  • [15] Compressing Deep Reinforcement Learning Networks With a Dynamic Structured Pruning Method for Autonomous Driving
    Su, Wensheng
    Li, Zhenni
    Xu, Minrui
    Kang, Jiawen
    Niyato, Dusit
    Xie, Shengli
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (12) : 18017 - 18030
  • [16] Re-training and parameter sharing with the Hash trick for compressing convolutional neural networks
    Gou, Xu
    Qing, Linbo
    Wang, Yi
    Xin, Mulin
    Wang, Xianmin
    APPLIED SOFT COMPUTING, 2020, 97
  • [17] Automatic Selection of Tensor Decomposition for Compressing Convolutional Neural Networks A Case Study on VGG-type Networks
    Liang, Chia-Chun
    Lee, Che-Rung
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 770 - 778
  • [18] DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework
    Yao, Shuochao
    Zhao, Yiran
    Zhang, Aston
    Su, Lu
    Abdelzaher, Tarek
    PROCEEDINGS OF THE 15TH ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS (SENSYS'17), 2017,
  • [19] A survey of model compression for deep neural networks
    Li J.-Y.
    Zhao Y.-K.
    Xue Z.-E.
    Cai Z.
    Li Q.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2019, 41 (10): : 1229 - 1239
  • [20] Review of Lightweight Deep Convolutional Neural Networks
    Chen, Fanghui
    Li, Shouliang
    Han, Jiale
    Ren, Fengyuan
    Yang, Zhen
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (04) : 1915 - 1937