MULTI-LEVEL AUDIO CLASSIFICATION ARCHITECTURE

被引:0
作者
Vavrek, Jozef [1 ]
Juhar, Jozef [1 ]
机构
[1] Tech Univ Kosice, Fac Elect Engn & Informat, Dept Elect & Multimedia Telecommun, Pk Komenskeho 13, Kosice 04200, Slovakia
关键词
Audio data classification; binary discrimination architecture; support vector machine;
D O I
10.15598/aeee.v13i4.1454
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A multi-level classification architecture for solving binary discrimination problem is proposed in this paper. The main idea of proposed solution is derived from the fact that solving one binary discrimination problem multiple times can reduce the overall miss-classification error. We aimed our effort towards building the classification architecture employing the combination of multiple binary SVM (Support Vector Machine) classifiers for solving two-class discrimination problem. Therefore, we developed a binary discrimination architecture employing the SVM classifier (BDASVM) with intention to use it for classification of broadcast news (BN) audio data. The fundamental element of BDASVM is the binary decision (BD) algorithm that performs discrimination between each pair of acoustic classes utilizing decision function modeled by separating hyperplane. The overall classification accuracy is conditioned by finding the optimal parameters for discrimination function resulting in higher computational complexity. The final form of proposed BDASVM is created by combining four BDSVM discriminators supplemented by decision table. Experimental results show that the proposed classification architecture can decrease the overall classification error in comparison with binary decision trees SVM (BDTSVM) architecture.
引用
收藏
页码:310 / 315
页数:6
相关论文
共 15 条
[1]  
Abe S, 2010, ADV PATTERN RECOGNIT, P443, DOI 10.1007/978-1-84996-098-4
[2]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[3]   Mixed type audio classification with Support Vector Machine [J].
Chen, Lei ;
Gunduz, Sule ;
Ozsu, M. Tamer .
2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, :781-+
[4]   Classification of audio signals using SVM and RBFNN [J].
Dhanalakshmi, P. ;
Palanivel, S. ;
Ramalingam, V. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :6069-6075
[5]   An introduction to ROC analysis [J].
Fawcett, Tom .
PATTERN RECOGNITION LETTERS, 2006, 27 (08) :861-874
[6]  
Hsu C. W., 2003, TECHNICAL REPORT
[7]   Online Speech/Music Segmentation Based on the Variance Mean of Filter Bank Energy [J].
Kos, Marko ;
Grasic, Matej ;
Kacic, Zdravko .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
[8]   A Decision-Tree-Based Algorithm for Speech/Music Classification and Segmentation [J].
Lavner, Yizhar ;
Ruinskiy, Dima .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2009,
[9]   Content-based audio classification and segmentation by using support vector machines [J].
Lu, L ;
Zhang, HJ ;
Li, SZ .
MULTIMEDIA SYSTEMS, 2003, 8 (06) :482-491
[10]  
Mahale PMB, 2008, PROCEEDINGS OF THE 40TH SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, P198