Time-frequency distributions for automatic speech recognition

被引：36

作者：

Potamianos, A ^{[1
]}

Maragos, P ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 03期

基金：

美国国家科学基金会;

关键词：

speech analysis; speech processing; speech recognition; time-frequency analysis;

D O I：

10.1109/89.905994

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The use of general time-frequency distributions as features for automatic speech recognition (ASR) is discussed in the context of hidden Markov classifiers. Short-time averages of quadratic operators, e.g., energy spectrum, generalized first spectral moments, and short-time averages of the instantaneous frequency, are compared to the standard front end features, and applied to ASR. Theoretical and experimental results indicate a close relationship among these feature sets.

引用

页码：196 / 200

页数：5

共 50 条

[1] Time-frequency analysis and auditory modeling for automatic recognition of speech
Pitton, JW
Wang, KS
Juang, BH
PROCEEDINGS OF THE IEEE, 1996, 84 (09) : 1199 - 1215
[2] Weighting Time-Frequency Representation of Speech using Auditory Saliency for Automatic Speech Recognition
Cong-Thanh Do
Stylianou, Yannis
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1591 - 1595
[3] Optimizing time-frequency distributions for automatic classification
Atlas, L
Droppo, J
McLaughlin, J
ADVANCED SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VII, 1997, 3162 : 161 - 171
[4] Robust Automatic Speech Recognition System Based on Using Adaptive Time-Frequency Masking
Gouda, Ahmed Mostafa
Tamazin, Mohamed
Khedr, Mohamed
PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2016, : 181 - 186
[5] Applications of Positive Time-Frequency Distributions to Speech Processing
Pitton, James W.
Atlas, Les E.
Loughlin, Patrick J.
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04): : 554 - 566
[6] Automatic feature-finding for time-frequency distributions
Owsley, L
McLaughlin, J
Bernard, G
PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1996, : 333 - 336
[7] Speech recognition with localized time-frequency pattern detectors
Schutte, Ken
Glass, James
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 341 - 346
[8] TIME-FREQUENCY CONVOLUTIONAL NETWORKS FOR ROBUST SPEECH RECOGNITION
Mitra, Vikramjit
Franco, Horacio
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 317 - 323
[9] TIME-FREQUENCY REASSIGNED FEATURES FOR AUTOMATIC CHORD RECOGNITION
Khadkevich, Maksim
Omologo, Maurizio
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 181 - 184
[10] Time-frequency representation based cepstral processing for speech recognition
Fineberg, AB
Yu, KC
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 25 - 28

← 1 2 3 4 5 →