Acoustic event recognition using cochleagram image and convolutional neural networks

被引:41
|
作者
Sharan, Roneel V. [1 ]
Moir, Tom J. [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Auckland Univ Technol, Sch Engn, Private Bag 92006, Auckland 1142, New Zealand
关键词
Acoustic event recognition; Cochleagram; Convolutional neural network; Mel-spectrogram; Spectrogram; FEATURES; CLASSIFICATION;
D O I
10.1016/j.apacoust.2018.12.006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Convolutional neural networks (CNN) have produced encouraging results in image classification tasks and have been increasingly adopted in audio classification applications. However, in using CNN for acoustic event recognition, the first hurdle is finding the best image representation of an audio signal. In this work, we evaluate the performance of four time-frequency representations for use with CNN. Firstly, we consider the conventional spectrogram image. Secondly, we apply moving average to the spectrogram along the frequency domain to obtain what we refer as the smoothed spectrogram. Thirdly, we use the mel-spectrogram which utilizes the mel-filter, as in mel-frequency cepstral coefficients. Finally, we propose the use of a cochleagram image the frequency components of which are based on the frequency selectivity property of the human cochlea. We test the proposed techniques on an acoustic event database containing 50 sound classes. The results show that the proposed cochleagram time-frequency image representation gives the best classification performance when used with CNN. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:62 / 66
页数:5
相关论文
共 50 条
  • [41] Speech Emotion Recognition Using Convolution Neural Networks and Multi-Head Convolutional Transformer
    Ullah, Rizwan
    Asif, Muhammad
    Shah, Wahab Ali
    Anjam, Fakhar
    Ullah, Ibrar
    Khurshaid, Tahir
    Wuttisittikulkij, Lunchakorn
    Shah, Shashi
    Ali, Syed Mansoor
    Alibakhshikenari, Mohammad
    SENSORS, 2023, 23 (13)
  • [42] Semantic image segmentation for sea ice parameters recognition using deep convolutional neural networks
    Zhang, Chengqian
    Chen, Xiaodong
    Ji, Shunying
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
  • [43] Electroencephalography Image Classification Using Convolutional Neural Networks
    Galety, Mohammad Gouse
    Al-Mukhtar, Firas
    Rofoo, Fanar
    Sriharsha, A., V
    Maaroof, Rebaz
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INNOVATIONS IN COMPUTING RESEARCH (ICR'22), 2022, 1431 : 42 - 52
  • [44] Image Augmentation-Based Food Recognition with Convolutional Neural Networks
    Pan, Lili
    Qin, Jiaohua
    Chen, Hao
    Xiang, Xuyu
    Li, Cong
    Chen, Ran
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 59 (01): : 297 - 313
  • [45] Recognition of Furniture Wood Image Species Based on Convolutional Neural Networks
    Miao Y.
    Zhu S.
    Pu J.
    Li J.
    Ma L.
    Huang H.
    Linye Kexue/Scientia Silvae Sinicae, 2023, 59 (08): : 133 - 140
  • [46] Trademark Image Retrieval Using a Combination of Deep Convolutional Neural Networks
    Perez, Claudio A.
    Estevez, Pablo A.
    Galdames, Francisco J.
    Schulz, Daniel A.
    Perez, Juan P.
    Bastias, Diego
    Vilar, Daniel R.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [47] Deep Tessellated Retinal Image Detection using Convolutional Neural Networks
    Lyu, Xingzheng
    Li, Hai
    Zhen, Yi
    Ji, Xin
    Zhang, Sanyuan
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 676 - 680
  • [48] RECOGNITION OF ACOUSTIC EVENTS USING DEEP NEURAL NETWORKS
    Gencoglu, Oguzhan
    Virtanen, Tuomas
    Huttunen, Heikki
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 506 - 510
  • [49] Skeletal Maturity Recognition Using a Fully Automated System With Convolutional Neural Networks
    Wang, Shuqiang
    Shen, Yanyan
    Shi, Changhong
    Yin, Peng
    Wang, Zuhui
    Cheung, Prudence Wing-Hang
    Cheung, Jason Pui Yin
    Luk, Keith Dip-Kei
    Hu, Yong
    IEEE ACCESS, 2018, 6 : 29979 - 29993
  • [50] Emotion recognition by assisted learning with convolutional neural networks
    He, Xuanyu
    Zhang, Wei
    NEUROCOMPUTING, 2018, 291 : 187 - 194