Trends in audio signal feature extraction methods

被引：223

作者：

Sharma, Garima ^{[1
]}

Umapathy, Kartikeyan ^{[1
]}

Krishnan, Sridhar ^{[1
]}

机构：

[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada

来源：

APPLIED ACOUSTICS | 2020年 / 158卷

关键词：

Audio; Speech; Signal; Feature extraction; Survey; Machine learning; SPECTRAL-ANALYSIS; SPEECH ANALYSIS; CLASSIFICATION; TIME; RECOGNITION; MUSIC; BINARY; DISCRIMINATION; PREDICTION; RETRIEVAL;

D O I：

10.1016/j.apacoust.2019.107020

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Audio signal processing algorithms generally involves analysis of signal, extracting its properties, predicting its behaviour, recognizing if any pattern is present in the signal, and how a particular signal is correlated to another similar signals. Audio signal includes music, speech and environmental sounds. Over the last few decades, audio signal processing has grown significantly in terms of signal analysis and classification. And it has been proven that solutions of many existing issues can be solved by integrating the modern machine learning (ML) algorithms with the audio signal processing techniques. The performance of any ML algorithm depends on the features on which the training and testing is done. Hence feature extraction is one of the most vital part of a machine learning process. The aim of this study is to summarize the literature of the audio signal processing specially focusing on the feature extraction techniques. In this survey the temporal domain, frequency domain, cepstral domain, wavelet domain and time-frequency domain features are discussed in detail. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页数：21

共 143 条

[1] Spectrotemporal Analysis Using Local Binary Pattern Variants for Acoustic Scene Classification
Abidin, Shamsiah
Togneri, Roberto
Sohel, Ferdous
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (11) : 2112 - 2121
[2] Fall detection through acoustic Local Ternary Patterns
Adnan, Syed M.
Irtaza, Aun
Aziz, Sumair
Ullah, M. Obaid
Javed, Ali
Mahmood, Muhammad Tariq
[J]. APPLIED ACOUSTICS, 2018, 140 : 296 - 300
[3] Musical instrument timbres classification with spectral features
Agostini, G
Longari, M
Pollastri, E
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (01) : 5 - 14
[4] Ahrendt Peter, 2004, 2004 12th European Signal Processing Conference (EUSIPCO), P1293
[5] Al-Shoshan A. I., 2006, J. King Saud Univ. Eng. Sci, V19, P95, DOI [DOI 10.1016/S1018-3639(18)30850-X, 10.1016/S1018-3639(18)30850-X]
[6] ANDO Y, 2013, J ACOUST SOC AM, V133, P3292, DOI DOI 10.1121/1.4805418
[7] [Anonymous], 2016, SPEECH COMMUN, DOI DOI 10.1016/j.specom.2016.10.007
[8] [Anonymous], 1963, PROC S TIME SER ANAL
[9] [Anonymous], 2009, Proc. ACM International Confence on Multimedia
[10] [Anonymous], 2003, Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval SIGIR 03, DOI [DOI 10.1109/ISSPA.2003.1224828, DOI 10.1145/860484.860487, 10.1145/860435.860487, DOI 10.1145/860435.860487, 10.1109/ISSPA.2003.1224828]

← 1 2 3 4 5 6 7 8 9 10 →