A Reassigned Front-End for Speech Recognition

被引：0

作者：

Tryfou, Georgina ^{[1
]}

Omologo, Maurizio ^{[1
]}

机构：

[1] Fdn Bruno Kessler, Via Sommarive 18, Trento, Italy

来源：

2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2017年

关键词：

TIME-FREQUENCY; REPRESENTATIONS; SCALE;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper introduces the use of the TFRCC features, a time-frequency reassigned feature set, as a front-end for speech recognition. Compared to the power spectrogram, the time-frequency reassigned version is particularly helpful in describing simultaneously the temporal and spectral features of speech signals, as it offers an improved visualization of the various components. This powerful attribute is exploited from the cepstral reassigned features, which are incorporated in a state-of-the-art speech recognizer. Experimental activities investigate the proposed features in various scenarios, starting from recognition of close-talk signals and gradually increasing the complexity of the task. The results prove the superiority of these features compared to a MFCC baseline.

引用

页码：553 / 557

页数：5

共 38 条

[1]

Alam M. A., 2014, 8 INT C SOFTWARE KNO, P1

[2]

[Anonymous], P WASPAA

[3]

[Anonymous], 2009, ARXIV09033080

[4]

[Anonymous], 2011, PROC 2011 WORKSHOP A

[5]

[Anonymous], 1995, TIME FREQUENCY ANAL

[6] IMPROVING THE READABILITY OF TIME-FREQUENCY AND TIME-SCALE REPRESENTATIONS BY THE REASSIGNMENT METHOD [J].

AUGER, F ;

FLANDRIN, P .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (05) :1068-1089

[7]

Barker J., 2015, 2015 IEEE AUT SPEECH

[8]

Benitez Guerrero Carmen, 2016, THESIS

[9]

Cohen J., 1995, The Journal of the Acoustical Society of America, V97, P3246, DOI DOI 10.1121/1.411700

[10] TIME FREQUENCY-DISTRIBUTIONS - A REVIEW [J].

COHEN, L .

PROCEEDINGS OF THE IEEE, 1989, 77 (07) :941-981

← 1 2 3 4 →