Missing-feature approaches in speech recognition

被引:152
作者
Raj, B
Stern, RM
机构
[1] Mitsubishi Electric Research Laboratories, Cambridge, MA
[2] Carnegie Mellon University, Electrical and Computer Engineering Department, Language Technologies Institute, Pittsburg, PA
基金
美国国家科学基金会;
关键词
D O I
10.1109/MSP.2005.1511828
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Approaches that have been conspicuously effective in ameliorating the effects of transient maskers are reviewed. Focus is on two broad classes of missing feature algorithms namely, feature-vector imputation algorithms and classifier-modification algorithms. The mathematics of four major missing feature techniques are reviewed including, the feature-imputation techniques of cluster-based reconstruction and covariance-based reconstruction, and the classifier-modification methods of class-conditional imputation and marginalization. The difficult task of estimating the spectrographic masks is also outlined.
引用
收藏
页码:101 / 116
页数:16
相关论文
共 36 条
[1]  
AHMED S, 1993, ADV NEURAL INFORMATI, P393
[2]  
[Anonymous], 1996, THESIS CARNEGIE MELL
[3]  
[Anonymous], 2000, ICSLP 2000
[4]   EFFECTIVENESS OF LINEAR PREDICTION CHARACTERISTICS OF SPEECH WAVE FOR AUTOMATIC SPEAKER IDENTIFICATION AND VERIFICATION [J].
ATAL, BS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (06) :1304-1312
[5]  
BARKER J, 2001, P WORKSH CONS REL AC
[6]  
BARKER J, 2001, P EUR 2001 ESCA, P213
[7]   Decoding speech in the presence of other sources [J].
Barker, JP ;
Cooke, MP ;
Ellis, DPW .
SPEECH COMMUNICATION, 2005, 45 (01) :5-25
[8]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[9]   The auditory organization of speech and other sources in listeners and computational models [J].
Cooke, M ;
Ellis, DPW .
SPEECH COMMUNICATION, 2001, 35 (3-4) :141-177
[10]   Robust automatic speech recognition with missing and unreliable acoustic data [J].
Cooke, M ;
Green, P ;
Josifovski, L ;
Vizinho, A .
SPEECH COMMUNICATION, 2001, 34 (03) :267-285