Nonnegative features of spectro-temporal sounds for classification

被引:47
|
作者
Cho, YC [1 ]
Choi, SJ [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci, Pohang 790784, South Korea
关键词
acoustic feature extraction; general sound recognition; nonnegative matrix factorization;
D O I
10.1016/j.patrec.2004.11.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A parts-based representation is a way of understanding object recognition in the brain. The nonnegative matrix factorization (NMF) is an algorithm which is able to learn a parts-based representation by allowing only non-subtractive combinations [Lee, D.D., Seung, H.S., 1999. Learning the parts of objects by non-negative matrix factorization. Nature 401, 788-791]. In this paper we incorporate a parts-based representation of spectro-temporal sounds into the acoustic feature extraction, which leads to nonnegative features. We present a method of inferring encoding variables in the framework of NMF and show that the method produces robust acoustic features in the presence of noise in the task of general sound classification.. Experimental results confirm that the proposed feature extraction method improves the classification performance, especially in the presence of noise, compared to independent component analysis (ICA) which produces holistic features. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:1327 / 1336
页数:10
相关论文
共 50 条
  • [1] Spectro-temporal features for environmental sound classification
    Thwe, Khine Zar
    Thaw, Mie Mie
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (02) : 179 - 189
  • [2] Spectro-temporal features applied to the automatic classification of volcanic seismic events
    Soto, Ricardo
    Huenupan, Fernando
    Meza, Pablo
    Curilem, Millaray
    Franco, Luis
    JOURNAL OF VOLCANOLOGY AND GEOTHERMAL RESEARCH, 2018, 358 : 194 - 206
  • [3] Development of spectro-temporal features of speech in children
    Gautam S.
    Singh L.
    Gautam, Sumanlata (suman.gautam82@gmail.com), 1600, Springer Science and Business Media, LLC (20): : 543 - 551
  • [4] SPECTRO-TEMPORAL GABOR FEATURES FOR SPEAKER RECOGNITION
    Lei, Howard
    Meyer, Bernd T.
    Mirghafori, Nikki
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4241 - 4244
  • [5] FFT-BASED SPECTRO-TEMPORAL ANALYSIS AND SYSTNESIS OF SOUNDS
    Hsu, Chung-Chien
    Lin, Ting-Han
    Chi, Tai-Shih
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5388 - 5391
  • [6] Spectro-Temporal Features for Howling Frequency Detection
    Lee, Jae-Won
    Choi, Seung Ho
    COMPUTER APPLICATIONS FOR WEB, HUMAN COMPUTER INTERACTION, SIGNAL AND IMAGE PROCESSING AND PATTERN RECOGNITION, 2012, 342 : 25 - +
  • [7] CLASSIFICATION OF HUMAN COUGH SIGNALS USING SPECTRO-TEMPORAL GABOR FILTERBANK FEATURES
    Schroeder, Jens
    Anemueller, Joern
    Goetze, Stefan
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6455 - 6459
  • [8] A Closer Look on Hierarchical Spectro-Temporal Features (HIST)
    Heckmann, Martin
    Domont, Xavier
    Joublin, Frank
    Goerick, Christian
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 894 - 897
  • [9] Learning spectro-temporal representations of complex sounds with parameterized neural networksa)
    Riad, Rachid
    Karadayi, Julien
    Bachoud-Levi, Anne-Catherine
    Dupoux, Emmanuel
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (01): : 353 - 366
  • [10] Analysis of the four heart sounds statistical study and spectro-temporal characteristics
    Debbal S.M.E.A.
    Debbal, Sidi Mohammed El Amine, 1600, Taylor and Francis Ltd. (44): : 396 - 410