Voice Activity Detection Using an Improved Unvoiced Feature Normalization Process in Noisy Environments

被引:5
作者
Chung, Kyungyong [1 ]
Oh, Sang Yeob [2 ]
机构
[1] Sangji Univ, Sch Comp Informat Engn, 83 Sangjidae Gil, Wonju 220702, Gangwon Do, South Korea
[2] Gachon Univ, Dept Comp Engn, Songnam 461701, Gyeonggi Do, South Korea
关键词
Voice recognition; Voice detection; Noise elimination; Feature extraction; UFN; Normalization;
D O I
10.1007/s11277-015-3169-5
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Noise-elimination technology is used to eliminate noise, including environmental noise, from voice signals in order to increase voice recognition rates. Noise estimation is the most important factor in noise-elimination technology. One of the effective estimation methods is voice activity detection, which is based on the statistical properties of noise and voice. This method is a way of estimating noise using the statistical properties of both noise and voice, which have an independent Gaussian distribution. In cases of severe differences in a statistical property, like white noise, the method is very reliable but limited to signals having a low signal-to-noise ratio (SNR) or having speech shape noise, which has statistical properties similar to voice signals. Methods to increase the voice recognition rate suffer from decreasing voice recognition performance due to distortion of the voice spectrum and to missing voice frames, because noise remains if there has been incorrect estimation of the noise. Degradation in voice recognition performance emerges in the differences between the model training environment and the voice recognition environment. In order to decrease environmental discordance, various silence feature normalization methods are used. Existing silence feature normalization suffers from degradation of recognition performance because the classification accuracy for the voiced and unvoiced signals decreases by an increasing energy level in the silence section of a low SNR. This paper proposes a robust voice characteristic detection method for noisy environments using feature extraction and unvoiced feature normalization for a classification relative to the voiced and unvoiced signals. The suggested method constitutes a model for recognition by extracting the characteristics for classification of the voiced and unvoiced signals in a high SNR environment. Also, the model affects noise for voice characteristics less, and recognition performance improves by using the Cepstrum feature distribution property of voiced and unvoiced signals with a low SNR. The model was checked for its ability to improve recognition performance relative to the existing method based on recognition experiment results.
引用
收藏
页码:747 / 759
页数:13
相关论文
共 30 条
  • [1] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [2] Choi GK, 2009, J ACOUST SOC KOREA, V28, P447
  • [3] Interactive Design Recommendation Using Sensor Based Smart Wear and Weather WebBot
    Chung, Kyung-Yong
    Na, Young-Joo
    Lee, Jung-Hyun
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2013, 73 (02) : 243 - 256
  • [4] Knowledge based decision support system
    Chung, Kyungyong
    Boutaba, Raouf
    Hariri, Salim
    [J]. INFORMATION TECHNOLOGY & MANAGEMENT, 2016, 17 (01) : 1 - 3
  • [5] Recent Trends in Digital Convergence Information System Preface
    Chung, Kyungyong
    Boutaba, Raouf
    Hariri, Salim
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2014, 79 (04) : 2409 - 2413
  • [6] Auditory patterns
    Fletcher, H
    [J]. REVIEWS OF MODERN PHYSICS, 1940, 12 (01) : 0047 - 0065
  • [7] Hirsch H.-G., 2000, 6 INT C SPOKEN LANGU, P181
  • [8] P2P context awareness based sensibility design recommendation using color and bio-signal analysis
    Jung, Hoill
    Chung, Kyungyong
    [J]. PEER-TO-PEER NETWORKING AND APPLICATIONS, 2016, 9 (03) : 546 - 557
  • [9] Knowledge-based dietary nutrition recommendation for obese management
    Jung, Hoill
    Chung, Kyungyong
    [J]. INFORMATION TECHNOLOGY & MANAGEMENT, 2016, 17 (01) : 29 - 42
  • [10] Ontology-driven slope modeling for disaster management service
    Jung, Hoill
    Chung, Kyungyong
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 677 - 692