Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion

被引:3
|
作者
Ondusko, Russell [1 ]
Marbach, Matthew [2 ]
Ramachandran, Ravi P. [3 ]
Head, Linda M. [3 ]
机构
[1] Navsea, 9500 MacArthur Blvd, Bethesda, MD 20817 USA
[2] Lockheed Martin, 5600 W Sand Lake Rd, Orlando, FL 32819 USA
[3] Rowan Univ, Dept Elect & Comp Engn, 201 Mullica Hill Rd, Glassboro, NJ 08028 USA
来源
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2017年 / 89卷 / 02期
基金
美国国家科学基金会;
关键词
Blind estimation; Linear predictive features; Vector quantizer classifier; Estimation combination; Overall average absolute error; RECOGNITION; ALGORITHM; FEATURES;
D O I
10.1007/s11265-016-1200-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a vector quantizer classifier and estimation combination. Blind SNR estimation is very useful in speaker identification systems in which a confidence metric is determined along with the speaker identity. The confidence metric is partially based on the mismatch between the training and testing conditions of the speaker identification system and SNR estimation is very important in evaluating the degree of this mismatch. The aim is to correctly estimate SNR values from 0 to 30 dB, a range that is both practical and crucial for speaker identification systems. Experiments consider (1) artificially generated additive white Gaussian noise, pink noise and bandpass noise and (2) fifteen noise types from the NOISEX database. Four features are combined to get the best results. The average SNR estimation error depends on the type of noise in that a relatively low error results for pink noise and jet cockpit noise and a high error results for destroyer operations room noise and military vehicle noise. For both artificially generated noise and the NOISEX data, the error is lower than what is achieved by the IMCRA method that uses SNR estimation for speech enhancement. Combining the four features with IMCRA lowers the error for 8 of the 15 noise types from NOISEX.
引用
收藏
页码:335 / 345
页数:11
相关论文
共 22 条
  • [1] Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion
    Russell Ondusko
    Matthew Marbach
    Ravi P. Ramachandran
    Linda M. Head
    Journal of Signal Processing Systems, 2017, 89 : 335 - 345
  • [2] Neural Network Classifiers and Principal Component Analysis for Blind Signal to Noise Ratio Estimation of Speech Signals
    Marbach, Matthew
    Ondusko, Russell
    Ramachandran, Ravi P.
    Head, Linda M.
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 97 - 100
  • [3] Auditory filter-bank compression improves estimation of signal-to-noise ratio for speech in noise
    Liu, Fangqi
    Demosthenous, Andreas
    Yasin, Ifat
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (05) : 3197 - 3208
  • [4] Frame-level Signal-to-Noise Ratio Estimation using Deep Learning
    Li, Hao
    Wang, DeLiang
    Zhang, Xueliang
    Gao, Guanglai
    INTERSPEECH 2020, 2020, : 4626 - 4630
  • [5] Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation
    Li, Hao
    Wang, DeLiang
    Zhang, Xueliang
    Gao, Guanglai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2878 - 2887
  • [6] Output signal-to-noise ratio and speech perception in noise: effects of algorithm
    Miller, Christi W.
    Bentler, Ruth A.
    Wu, Yu-Hsiang
    Lewis, James
    Tremblay, Kelly
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2017, 56 (08) : 568 - 579
  • [7] A FEATURE STUDY FOR CLASSIFICATION-BASED SPEECH SEPARATION AT VERY LOW SIGNAL-TO-NOISE RATIO
    Chen, Jitong
    Wang, Yuxuan
    Wang, DeLiang
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [8] The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio
    Liang, Shan
    Liu, Wenju
    Jiang, Wei
    Xue, Wei
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (05) : EL452 - EL458
  • [9] Noise robust speech rate estimation using signal-to-noise ratio dependent sub-band selection and peak detection strategy
    Yarra, Chiranjeevi
    Nagesh, Supriya
    Deshmukh, Om D.
    Ghosh, Prasanta Kumar
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (03) : 1615 - 1628
  • [10] The Revised Speech Perception in Noise Test (R-SPIN) in a Multiple Signal-to-Noise Ratio Paradigm
    Wilson, Richard H.
    McArdle, Rachel
    Watts, Kelly L.
    Smith, Sherri L.
    JOURNAL OF THE AMERICAN ACADEMY OF AUDIOLOGY, 2012, 23 (08) : 590 - 605