Environmental sound recognition: a survey

被引:65
作者
Chachada, Sachin [1 ]
Kuo, C. -C. Jay [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90089 USA
关键词
environmental sound recognition; audio signal processing; feature extraction; nonstationary ESR techniques; environmental sound processing schemes; signal spectral characteristics; signal temporal characteristics;
D O I
10.1017/ATSIP.2014.12
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although research in audio recognition has traditionally focused on speech and music signals, the problem of environmental sound recognition (ESR) has received more attention in recent years. Research on ESR has significantly increased in the past decade. Recent work has focused on the appraisal of non-stationary aspects of environmental sounds, and several new features predicated on non-stationary characteristics have been proposed. These features strive to maximize their information content pertaining to signal's temporal and spectral characteristics. Furthermore, sequential learningmethods have been used to capture the long-term variation of environmental sounds. In this survey, we will offer a qualitative and elucidatory survey on recent developments. It includes four parts: (i) basic environmental sound-processing schemes, (ii) stationary ESR techniques, (iii) non-stationary ESR techniques, and (iv) performance comparison of selected methods. Finally, concluding remarks and future research and development trends in the ESR field will be given.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring
    Bardeli, R.
    Wolff, D.
    Kurth, F.
    Koch, M.
    Tauchert, K. -H.
    Frommolt, K. -H.
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1524 - 1534
  • [2] Bell Robert M, 2007, BELLKOR SOLUTION NET
  • [3] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [4] Bathroom activity monitoring based on sound
    Chen, JF
    Kam, AH
    Zhang, JM
    Liu, N
    Shue, L
    [J]. PERVASIVE COMPUTING, PROCEEDINGS, 2005, 3468 : 47 - 61
  • [5] Where am I? Scene recognition for mobile robots using audio features
    Chu, Selina
    Narayanan, Shrikanth
    Kuo, C. -C. Jay
    Mataric, Maja J.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 885 - 888
  • [6] Environmental Sound Recognition With Time-Frequency Audio Features
    Chu, Selina
    Narayanan, Shrikanth
    Kuo, C. -C. Jay
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06): : 1142 - 1158
  • [7] Chui CK, 1992, INTRO WAVELETS, V1
  • [8] Comparison of techniques for environmental sound recognition
    Cowling, M
    Sitte, R
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (15) : 2895 - 2907
  • [9] Audio-visual event recognition in surveillance video sequences
    Cristani, Marco
    Bicego, Manuele
    Murino, Vittorio
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (02) : 257 - 267
  • [10] A study on feature analysis for musical instrument classification
    Deng, Jeremiah D.
    Simmermacher, Christian
    Cranefield, Stephen
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (02): : 429 - 438