On the integration of time-frequency masking speech separation and recognition in underdetermined environments

被引：0

作者：

Jafari, Ingrid ^{[1
]}

Haque, Serajul ^{[1
]}

Togneri, Roberto ^{[1
]}

Nordholm, Sven ^{[2
]}

机构：

[1] Univ Western Australia, Crawley, WA 6009, Australia

[2] Curtin Univ, Perth, WA 6845, Australia

来源：

2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR) | 2012年

基金：

澳大利亚研究理事会;

关键词：

SPARSE SOURCE SEPARATION; BLIND;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The successful application of automatic speech recognition systems in the real world is conditional on its ability to handle realistic environments with unfavorable conditions such as reverberation and multiple sources of inteference. Previous research has identified time-frequency masking based approaches to blind source separation as a viable approach for multisource reverberant source separation. It is proposed the use of such separation techniques as a front-end to speech recognition will encourage greater recognition accuracy. Experimental evaluations confirmed the hypothesis with an improvement in recognition accuracy of over 20% at a reverberation time of RT60 = 300ms; this is indicative of the potential for future research in this field.

引用

页码：1613 / 1617

页数：5

共 50 条

[1] Evaluations on underdetermined blind source separation in adverse environments using time-frequency masking
Jafari, Ingrid
Haque, Serajul
Togneri, Roberto
Nordholm, Sven
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
[2] Evaluations on underdetermined blind source separation in adverse environments using time-frequency masking
Ingrid Jafari
Serajul Haque
Roberto Togneri
Sven Nordholm
EURASIP Journal on Advances in Signal Processing, 2013
[3] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reju, Vaninirappuputhenpurayil Gopalan
Koh, Soo Ngee
Soon, Ing Yann
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 101 - 116
[4] A TIME-FREQUENCY BLIND SEPARATION METHOD FOR UNDERDETERMINED SPEECH MIXTURES
Lv Yao Li Shuangtian(Institute of Acoustics
JournalofElectronics(China), 2008, (05) : 702 - 708
[5] Robust speech separation using time-frequency masking
Aarabi, P
Shi, GJ
Jahromi, O
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 741 - 744
[6] Blind separation of underdetermined Convolutive speech mixtures by time-frequency masking with the reduction of musical noise of separated signals
Zohrevandi, Mahbanou
Setayeshi, Saeed
Rabiee, Azam
Reshadi, Midia
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12601 - 12618
[7] A NEW TIME-FREQUENCY APPROACH FOR UNDERDETERMINED CONVOLUTIVE BLIND SPEECH SEPARATION
Bouafif, Mariem
Lachiri, Zied
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 3226 - 3230
[8] Blind separation of speech mixtures via time-frequency masking
Yilmaz, Ö
Rickard, S
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1830 - 1847
[9] Underdetermined Blind Source Separation using Binary Time-Frequency Masking with Variable Frequency Resolution
Anandkumar, Amod J. G.
Ghosh, Aneesh T. A.
Damodaram, B. Teja
David, Sumam S.
2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 949 - 954
[10] Time-Frequency Masking For Large Scale Robust Speech Recognition
Wang, Yuxuan
Misra, Ananya
Chine, Kean K.
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2469 - 2473

← 1 2 3 4 5 →