COMPRESSING AUDIO CNNS WITH GRAPH CENTRALITY BASED FILTER PRUNING

被引:0
作者
King, James A. [1 ]
Singh, Arshdeep [1 ]
Plumbley, Mark D. [1 ]
机构
[1] Univ Surrey, CVSSP, Guildford, Surrey, England
来源
2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA | 2023年
基金
英国工程与自然科学研究理事会;
关键词
Convolutional Neural Network; Pruning; Audio classification; PANNs; DCASE;
D O I
10.1109/WASPAA58266.2023.10248103
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Convolutional neural networks (CNNs) are popular in high-performing solutions to many real-world problems, such as audio classification. CNNs have many parameters and filters, with some having a larger impact on the performance than others. This means that networks may contain many unnecessary filters, increasing a CNN's computation and memory requirements while providing limited performance benefits. To make CNNs more efficient, we propose a pruning framework that eliminates filters with the highest "commonality". We measure this commonality using the graph-theoretic concept of centrality. We hypothesise that a filter with a high centrality should be eliminated as it represents commonality and can be replaced by other filters without affecting the performance of a network much. An experimental evaluation of the proposed framework is performed on acoustic scene classification and audio tagging. On the DCASE 2021 Task 1A baseline network, our proposed method reduces computations per inference by 71% with 50% fewer parameters with less than a two percentage point drop in accuracy compared to the original network. For large-scale CNNs such as PANNs designed for audio tagging, our method reduces computations per inference by 24% with 41% fewer parameters at a slight improvement in performance.
引用
收藏
页数:5
相关论文
共 27 条
  • [1] Denton E, 2014, ADV NEUR IN, V27
  • [2] Gemmeke JF, 2017, INT CONF ACOUST SPEE, P776, DOI 10.1109/ICASSP.2017.7952261
  • [3] Han S, 2016, Arxiv, DOI [arXiv:1510.00149, DOI 10.48550/ARXIV.1510.00149]
  • [4] Han S, 2015, ADV NEUR IN, V28
  • [5] He Y, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2234
  • [6] Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
    He, Yang
    Liu, Ping
    Wang, Ziwei
    Hu, Zhilan
    Yang, Yi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4335 - 4344
  • [7] Heittola T, 2020, ACOUSTIC SCENE CLASS, P56
  • [8] Hu HY, 2016, Arxiv, DOI [arXiv:1607.03250, DOI 10.48550/ARXIV.1607.03250]
  • [9] PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
    Kong, Qiuqiang
    Cao, Yin
    Iqbal, Turab
    Wang, Yuxuan
    Wang, Wenwu
    Plumbley, Mark D.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2880 - 2894
  • [10] Lacoste A, 2019, Arxiv, DOI [arXiv:1910.09700, 10.48550/arXiv.1910.09700, DOI 10.48550/ARXIV.1910.09700]