Multi-LCNN: A Hybrid Neural Network based on Integrated Time-Frequency Characteristics for Acoustic Scene Classification

被引:3
作者
Lei, Jin [1 ]
Wang, Changjian [1 ]
Zhu, Boqing [1 ]
Huang, Zhen [1 ]
Peng, Yuxing [1 ]
Lv, Qin [2 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Lab, Changsha, Hunan, Peoples R China
[2] Univ Colorado Boulder, Dept Comp Sci, Boulder, CO 80309 USA
来源
2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) | 2018年
关键词
Acoustic Scene Classification; Time-Frequency Characteristics; Deep Learning;
D O I
10.1109/ICTAI.2018.00019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic scene classification (ASC) is an important task in audio signal processing and can be useful in many real-world applications. Recently, several deep neural network models have been proposed for ASC, such as LSTMs based on temporal analysis and CNNs based on frequency spectrum, as well as hybrid models of LSTM and CNN to further improve classification performance. However, existing hybrid models fail to properly preserve the temporal information when transferring data between different models. In this work, we first analyze the cause of such temporal information loss. We then propose Multi-LCNN, a new hybrid model with two important mechanisms: (1) a LCNN architecture to effectively preserve temporal information; and (2) a multi-channel feature fusion mechanism (MCFF) that combines enhanced temporal information and frequency spectrogram information to learn highly integrated and discriminative features for ASC. Evaluations on the TUT ASC 2016 dataset show that our model can achieve an improvement of 10.23% over the baseline method, and is currently the best performing end-to-end model on this dataset.
引用
收藏
页码:52 / 59
页数:8
相关论文
共 32 条
[1]  
Allegro Sylvia, 2005, US Patent, Patent No. [6,910,013, 6910013]
[2]  
[Anonymous], ARXIV160702857
[3]  
[Anonymous], IEEE AASP CHALLENGE
[4]  
[Anonymous], 2015, IEEE INT WORKS MACH
[5]  
[Anonymous], 2016, FEATURE LEARNING RAW
[6]  
[Anonymous], 1997, Neural Computation
[7]  
[Anonymous], BIGLEARN NIPS WORKSH
[8]  
[Anonymous], 2013, 2013 IEEE WORKSHOP A, DOI DOI 10.1109/WASPAA.2013.6701857
[9]  
[Anonymous], 2016, IEEE AASP CHALLENGE
[10]  
Bae SH, 2016, DCASE, P11