A Reassigned Front-End for Speech Recognition

被引:0
作者
Tryfou, Georgina [1 ]
Omologo, Maurizio [1 ]
机构
[1] Fdn Bruno Kessler, Via Sommarive 18, Trento, Italy
来源
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2017年
关键词
TIME-FREQUENCY; REPRESENTATIONS; SCALE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper introduces the use of the TFRCC features, a time-frequency reassigned feature set, as a front-end for speech recognition. Compared to the power spectrogram, the time-frequency reassigned version is particularly helpful in describing simultaneously the temporal and spectral features of speech signals, as it offers an improved visualization of the various components. This powerful attribute is exploited from the cepstral reassigned features, which are incorporated in a state-of-the-art speech recognizer. Experimental activities investigate the proposed features in various scenarios, starting from recognition of close-talk signals and gradually increasing the complexity of the task. The results prove the superiority of these features compared to a MFCC baseline.
引用
收藏
页码:553 / 557
页数:5
相关论文
共 38 条
[1]  
Alam M. A., 2014, 8 INT C SOFTWARE KNO, P1
[2]  
[Anonymous], P WASPAA
[3]  
[Anonymous], 2009, ARXIV09033080
[4]  
[Anonymous], 2011, PROC 2011 WORKSHOP A
[5]  
[Anonymous], 1995, TIME FREQUENCY ANAL
[6]   IMPROVING THE READABILITY OF TIME-FREQUENCY AND TIME-SCALE REPRESENTATIONS BY THE REASSIGNMENT METHOD [J].
AUGER, F ;
FLANDRIN, P .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (05) :1068-1089
[7]  
Barker J., 2015, 2015 IEEE AUT SPEECH
[8]  
Benitez Guerrero Carmen, 2016, THESIS
[9]  
Cohen J., 1995, The Journal of the Acoustical Society of America, V97, P3246, DOI DOI 10.1121/1.411700
[10]   TIME FREQUENCY-DISTRIBUTIONS - A REVIEW [J].
COHEN, L .
PROCEEDINGS OF THE IEEE, 1989, 77 (07) :941-981