An Adversarial Feature Distillation Method for Audio Classification

被引:13
作者
Gao, Liang [1 ]
Mi, Haibo [1 ]
Zhu, Boqing [1 ]
Feng, Dawei [1 ]
Li, Yicong [1 ]
Peng, Yuxing [1 ]
机构
[1] Natl Univ Def Technol, Natl Key Lab Parallel & Distributed Proc, Changsha 410073, Hunan, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Convolutional neural networks; audio tagging; knowledge distillation; model compression;
D O I
10.1109/ACCESS.2019.2931656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The audio classification task aims to discriminate between different audio signal types. In this task, deep neural networks have achieved better performance than the traditional shallow architecture-based machine-learning method. However, deep neural networks often require huge computational and storage requirements that hinder the deployment in embedded devices. In this paper, we proposed a distillation method which transfers knowledge from well-trained networks to a small network, and the method can compress model size while improving audio classification precision. The contributions of the proposed method are two folds: a multi-level feature distillation method was proposed and an adversarial learning strategy was employed to improve the knowledge transfer. The extensive experiments are conducted on three audio classification tasks, audio scene classification, general audio tagging, and speech command recognition. The experimental results demonstrate that: the small network can provide better performance while achieves the calculated amount of floating-point operations per second (FLOPS) compression ratio of 76:1 and parameters compression ratio of 3:1.
引用
收藏
页码:105319 / 105330
页数:12
相关论文
共 38 条
[1]  
Anil Rohan, 2018, P 6 INT C LEARN REPR
[2]  
[Anonymous], 2015, ICLR
[3]  
[Anonymous], 2014, P INTERSPEECH
[4]   MEAL: Multi-Model Ensemble via Adversarial Learning [J].
Shen, Zhiqiang ;
He, Zhankui ;
Xue, Xiangyang .
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, :4886-4893
[5]  
[Anonymous], 2016, P 24 ACM INT C MULT, DOI DOI 10.1145/2964284.2964310
[6]  
[Anonymous], 2008, Int. Symp. on Music Information Retrieval (ISMIR)
[7]  
[Anonymous], P INT SOC MUS INF RE
[8]  
[Anonymous], 2017, CoRR
[9]  
[Anonymous], ARXIV180403209
[10]  
Ba LJ, 2014, ADV NEUR IN, V27