MG-ViT: A Multi-Granularity Method for Compact and Efficient Vision Transformers

被引:0
|
作者
Zhang, Yu [1 ]
Liu, Yepeng [2 ]
Miao, Duoqian [1 ]
Zhang, Qi [1 ]
Shi, Yiwei [3 ]
Hu, Liang [1 ]
机构
[1] Tongji Univ, Shanghai, Peoples R China
[2] Univ Florida, Gainesville, FL 32611 USA
[3] Univ Bristol, Bristol BS81TH, Avon, England
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision Transformer (ViT) faces obstacles in wide application due to its huge computational cost. Almost all existing studies on compressing ViT adopt the manner of splitting an image with a single granularity, with very few exploration of splitting an image with multi-granularity. As we know, important information often randomly concentrate in few regions of an image, necessitating multi-granularity attention allocation to an image. Enlightened by this, we introduce the multi-granularity strategy to compress ViT, which is simple but effective. We propose a two-stage multi-granularity framework, MG-ViT, to balance ViT's performance and computational cost. In single-granularity inference stage, an input image is split into a small number of patches for simple inference. If necessary, multi-granularity inference stage will be instigated, where the important patches are further subsplit into multi-finer-grained patches for subsequent inference. Moreover, prior studies on compression only for classification, while we extend the multi-granularity strategy to hierarchical ViT for downstream tasks such as detection and segmentation. Extensive experiments Prove the effectiveness of the multi-granularity strategy. For instance, on ImageNet, without any loss of performance, MG-ViT reduces 47% FLOPs of LV-ViT-S and 56% FLOPs of DeiT-S.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Zoom method for association rules in multi-granularity formal context
    Lihui Niu
    Jusheng Mi
    Yuzhang Bai
    Zhongling Li
    Meizheng Li
    Soft Computing, 2025, 29 (2) : 613 - 627
  • [22] Multi-granularity hazard detection method for electrical power system
    Xu X.
    Qian P.
    Wang Y.
    Zhou X.
    Xu H.
    Xu L.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (03): : 520 - 530
  • [23] Study on fusion method of multi-granularity linguistic term sets
    School of Business Administration, Northeastern University, Shenyang 110004, China
    不详
    Dongbei Daxue Xuebao, 2007, 11 (1669-1672):
  • [24] A Multi-granularity Decision Fusion Method Based on Category Hierarchy
    Mi, Jian-Xun
    Huang, Ke-Yang
    Li, Nuo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 149 - 161
  • [25] A Multi-Granularity FPGA with Hierarchical Interconnects for Efficient and Flexible Mobile Computing
    Wang, Cheng C.
    Yuan, Fang-Li
    Yu, Tsung-Han
    Markovic, Dejan
    2014 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE DIGEST OF TECHNICAL PAPERS (ISSCC), 2014, 57 : 460 - +
  • [26] Efficient Service-oriented Encapsulation of Multi-granularity Heterogeneous Resources
    You, Kun
    Xun, Zhide
    Ding, Feng
    2015 IEEE THIRD INTERNATIONAL CONFERENCE ON MOBILE SERVICES MS 2015, 2015, : 376 - 382
  • [27] Multi-granularity vision transformer via semantic token for hyperspectral image classification
    Li, Bin
    Ouyang, Er
    Hu, Wenjing
    Zhang, Guoyun
    Zhao, Lin
    Wu, Jianhui
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (17) : 6538 - 6560
  • [28] Efficient multi-granularity network for fine-grained image classification
    Jiabao Wang
    Yang Li
    Hang Li
    Xun Zhao
    Rui Zhang
    Zhuang Miao
    Journal of Real-Time Image Processing, 2022, 19 : 853 - 866
  • [29] A multi-granularity NC program optimization approach for energy efficient machining
    Li, X. X.
    Li, W. D.
    He, F. Z.
    ADVANCES IN ENGINEERING SOFTWARE, 2018, 115 : 75 - 86
  • [30] Efficient multi-granularity network for fine-grained image classification
    Wang, Jiabao
    Li, Yang
    Li, Hang
    Zhao, Xun
    Zhang, Rui
    Miao, Zhuang
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2022, 19 (05) : 853 - 866