Automatic Scene Recognition through Acoustic Classification for Behavioral Robotics

被引:27
作者
Aziz, Sumair [1 ]
Awais, Muhammad [2 ]
Akram, Tallha [2 ]
Khan, Umar [1 ]
Alhussein, Musaed [3 ]
Aurangzeb, Khursheed [3 ]
机构
[1] Univ Engn & Technol Taxila, Dept Elect Engn, Taxila 47080, Pakistan
[2] COMSATS Univ Islamabad, Dept Elect & Comp Engn, Wah Campus, Wah Cantt 47040, Pakistan
[3] King Saud Univ, Coll Comp & Informat Sci, Comp Engn Dept, Riyadh 11543, Saudi Arabia
关键词
feature extraction; sound classification; support vector machine; sound processing; robotics; MFCC; ENVIRONMENTAL SOUND CLASSIFICATION;
D O I
10.3390/electronics8050483
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification of complex acoustic scenes under real time scenarios is an active domain which has engaged several researchers lately form the machine learning community. A variety of techniques have been proposed for acoustic patterns or scene classification including natural soundscapes such as rain/thunder, and urban soundscapes such as restaurants/streets, etc. In this work, we present a framework for automatic acoustic classification for behavioral robotics. Motivated by several texture classification algorithms used in computer vision, a modified feature descriptor for sound is proposed which incorporates a combination of 1-D local ternary patterns (1D-LTP) and baseline method Mel-frequency cepstral coefficients (MFCC). The extracted feature vector is later classified using a multi-class support vector machine (SVM), which is selected as a base classifier. The proposed method is validated on two standard benchmark datasets i.e., DCASE and RWCP and achieves accuracies of <mml:semantics>97.38%</mml:semantics> and <mml:semantics>94.10%</mml:semantics>, respectively. A comparative analysis demonstrates that the proposed scheme performs exceptionally well compared to other feature descriptors.
引用
收藏
页数:17
相关论文
共 46 条
[1]   Fall detection through acoustic Local Ternary Patterns [J].
Adnan, Syed M. ;
Irtaza, Aun ;
Aziz, Sumair ;
Ullah, M. Obaid ;
Javed, Ali ;
Mahmood, Muhammad Tariq .
APPLIED ACOUSTICS, 2018, 140 :296-300
[2]  
Amarappa S., 2014, International Journal of Electronics and Computer Science Engineering, V3, P435
[3]  
[Anonymous], 2012, P 26 ANN C NEUR INF
[4]  
[Anonymous], 2000, P 2 INT C LANG RES E
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], PATTERN RECOGNITION
[7]  
[Anonymous], 2013, IEEE Workshop on Application of Signal Processing to Audio and Features, DOI [10.1109/WASPAA.2013.6701857, DOI 10.1109/WASPAA.2013.6701857]
[8]  
[Anonymous], P 10 INT WORKSH FRON
[9]  
[Anonymous], P INT WORKSH SOC LEA
[10]   Environmental sound classification with dilated convolutions [J].
Chen, Yan ;
Guo, Qian ;
Liang, Xinyan ;
Wang, Jiang ;
Qian, Yuhua .
APPLIED ACOUSTICS, 2019, 148 :123-132