On the integration of time-frequency masking speech separation and recognition in underdetermined environments

被引:0
|
作者
Jafari, Ingrid [1 ]
Haque, Serajul [1 ]
Togneri, Roberto [1 ]
Nordholm, Sven [2 ]
机构
[1] Univ Western Australia, Crawley, WA 6009, Australia
[2] Curtin Univ, Perth, WA 6845, Australia
基金
澳大利亚研究理事会;
关键词
SPARSE SOURCE SEPARATION; BLIND;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The successful application of automatic speech recognition systems in the real world is conditional on its ability to handle realistic environments with unfavorable conditions such as reverberation and multiple sources of inteference. Previous research has identified time-frequency masking based approaches to blind source separation as a viable approach for multisource reverberant source separation. It is proposed the use of such separation techniques as a front-end to speech recognition will encourage greater recognition accuracy. Experimental evaluations confirmed the hypothesis with an improvement in recognition accuracy of over 20% at a reverberation time of RT60 = 300ms; this is indicative of the potential for future research in this field.
引用
收藏
页码:1613 / 1617
页数:5
相关论文
共 50 条
  • [1] Evaluations on underdetermined blind source separation in adverse environments using time-frequency masking
    Jafari, Ingrid
    Haque, Serajul
    Togneri, Roberto
    Nordholm, Sven
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [2] Evaluations on underdetermined blind source separation in adverse environments using time-frequency masking
    Ingrid Jafari
    Serajul Haque
    Roberto Togneri
    Sven Nordholm
    EURASIP Journal on Advances in Signal Processing, 2013
  • [3] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
    Reju, Vaninirappuputhenpurayil Gopalan
    Koh, Soo Ngee
    Soon, Ing Yann
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 101 - 116
  • [4] A TIME-FREQUENCY BLIND SEPARATION METHOD FOR UNDERDETERMINED SPEECH MIXTURES
    Lv Yao Li Shuangtian(Institute of Acoustics
    JournalofElectronics(China), 2008, (05) : 702 - 708
  • [5] Robust speech separation using time-frequency masking
    Aarabi, P
    Shi, GJ
    Jahromi, O
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 741 - 744
  • [6] Blind separation of underdetermined Convolutive speech mixtures by time-frequency masking with the reduction of musical noise of separated signals
    Zohrevandi, Mahbanou
    Setayeshi, Saeed
    Rabiee, Azam
    Reshadi, Midia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12601 - 12618
  • [7] A NEW TIME-FREQUENCY APPROACH FOR UNDERDETERMINED CONVOLUTIVE BLIND SPEECH SEPARATION
    Bouafif, Mariem
    Lachiri, Zied
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 3226 - 3230
  • [8] Blind separation of speech mixtures via time-frequency masking
    Yilmaz, Ö
    Rickard, S
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1830 - 1847
  • [9] Underdetermined Blind Source Separation using Binary Time-Frequency Masking with Variable Frequency Resolution
    Anandkumar, Amod J. G.
    Ghosh, Aneesh T. A.
    Damodaram, B. Teja
    David, Sumam S.
    2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 949 - 954
  • [10] Time-Frequency Masking For Large Scale Robust Speech Recognition
    Wang, Yuxuan
    Misra, Ananya
    Chine, Kean K.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2469 - 2473