Multi-class SVM for stressed speech recognition

被引:0
作者
Besbes, Salsabil [1 ]
Lachiri, Lied [2 ]
机构
[1] Univ Tunis El Manar, Natl Sch Engineers Tunis, Signal Image & Informat Technol Lab, BP 37 Le Belvdre, Tunis 1002, Tunisia
[2] Univ Tunis El Manar, Natl Sch Engineers Tunis, BP 37 Le Belvdre, Tunis 1002, Tunisia
来源
2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP) | 2016年
关键词
speech recognition; multi-class support vector machines; stressed context; SUSAS database; GFCC;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper deals with a new automatic stressed recognition system based on kernel classification. We extracted advanced acoustic features from the stressed signals and employed a multi-class Support Vector Machines with different kernels to recognize speech utterances under stress. Gammatone Frequency Cepstral Coefficients are also established. The system implemented is tested using isolated words from SUSAS database with 4 classes: Neutral, Angry, Lombard and Loud. Experimental results show that the best performance is obtained when we use the auditory feature with different descriptors combination but it depends on the type of the kernel used.
引用
收藏
页码:782 / 787
页数:6
相关论文
共 50 条
[31]   Sample Complexity of Classifiers Taking Values in Q, Application to Multi-Class SVMs [J].
Guermeur, Yann .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2010, 39 (03) :543-557
[32]   Localization in Wireless Sensor Network Based on Multi-class Support Vector Machines [J].
Liu, Hongbing ;
Xiong, Shengwu ;
Chen, Qiong .
2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, :3322-3325
[33]   Data fusion for fault diagnosis using multi-class Support Vector Machines [J].
胡中辉 ;
蔡云泽 ;
李远贵 ;
许晓鸣 ;
不详 .
Journal of Zhejiang University Science A(Science in Engineering) , 2005, (10) :1030-1039
[34]   MULTI-OBJECTIVE MULTI-TASK LEARNING ON RNNLM FOR SPEECH RECOGNITION [J].
Song, Minguang ;
Zhao, Yunxin ;
Wang, Shaojun .
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, :197-203
[35]   Challenges of German Speech Recognition: A Study on Multi-ethnolectal Speech Among Adolescents [J].
Schubert, Martha ;
Duran, Daniel ;
Siegert, Ingo .
INTERSPEECH 2024, 2024, :3045-3049
[36]   MULTI-CHANNEL OVERLAPPED SPEECH RECOGNITION WITH LOCATION GUIDED SPEECH EXTRACTION NETWORK [J].
Chen, Zhuo ;
Xiao, Xiong ;
Yoshioka, Takuya ;
Erdogan, Hakan ;
Li, Jinyu ;
Gong, Yifan .
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, :558-565
[37]   SYNTHESIZING DYSARTHRIC SPEECH USING MULTI-SPEAKER TTS FOR DYSARTHRIC SPEECH RECOGNITION [J].
Soleymanpour, Mohammad ;
Johnson, Michael T. ;
Soleymanpour, Rahim ;
Berry, Jeffrey .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :7382-7386
[38]   THE ROYALFLUSH AUTOMATIC SPEECH DIARIZATION AND RECOGNITION SYSTEM FOR IN-CAR MULTI-CHANNEL AUTOMATIC SPEECH RECOGNITION CHALLENGE [J].
Tian, Jingguang ;
Ye, Shuaishuai ;
Chen, Shunfei ;
Xiang, Yang ;
Yin, Zhaohui ;
Hu, Xinhui ;
Xu, Xinkang .
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, :1-2
[39]   CLASSIFICATION MARGIN FOR IMPROVED CLASS-BASED SPEECH RECOGNITION PERFORMANCE [J].
Jouvet, Denis ;
Vinuesa, Nicolas .
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, :4285-4288
[40]   Overlapping One-Class SVMs for Utterance Verification in Speech Recognition [J].
Hou, Cuiqin ;
Hou, Yibin ;
Huang, Zhangqin ;
Liu, Qian .
TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, :1500-1504