Voice-Activity Detection Using Long-Term Sub-Band Entropy Measure

被引:0
作者
Wang, Kun-Ching [1 ]
机构
[1] Shin Chien Univ, Taipei, Taiwan
关键词
voice activity detection; long-term spectral analysis; sub-band entropy; variable-level noise; SPEECH RECOGNITION; WORD RECOGNITION; NOISE;
D O I
10.1587/transfun.E95.A.1606
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A novel long-term sub-band entropy (LT-SubEntropy) measure, which uses improved long-term spectral analysis and sub-band entropy, is proposed for voice activity detection (VAD). Based on the measure, we can accurately exploit the inherent nature of the formant structure on speech spectrogram (the well-known as voiceprint). Results show that the proposed VAD is superior to existing standard VAD methods at low SNR levels, especially at variable-level noise.
引用
收藏
页码:1606 / 1609
页数:4
相关论文
共 13 条
  • [1] ITU-T recommendation G.729 Annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
    Benyassine, A
    Shlomot, E
    Su, HY
    Massaloux, D
    Lamblin, C
    Petit, JP
    [J]. IEEE COMMUNICATIONS MAGAZINE, 1997, 35 (09) : 64 - 73
  • [2] *ETSI EN, 1999, 301708 ETSI EN
  • [3] KAISER JF, 1990, INT CONF ACOUST SPEE, P381, DOI 10.1109/ICASSP.1990.115702
  • [4] Kato H., 2008, SPEECH COMMUN, V50, P476
  • [5] AUTOMATIC WORD RECOGNITION IN CARS
    MOKBEL, CE
    CHOLLET, GFA
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (05): : 346 - 356
  • [6] An effective subband OSF-based VAD with noise reduction for robust speech recognition
    Ramírez, J
    Segura, JC
    Benítez, C
    de la Torre, A
    Rubio, A
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06): : 1119 - 1129
  • [7] Efficient voice activity detection algorithms using long-term speech information
    Ramírez, J
    Segura, JC
    Benítez, C
    de la Torre, A
    Rubio, A
    [J]. SPEECH COMMUNICATION, 2004, 42 (3-4) : 271 - 287
  • [8] AN IMPROVED END-POINT DETECTOR FOR ISOLATED WORD RECOGNITION - COMMENT
    REAVES, B
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (02) : 526 - 527
  • [9] Sarikaya R., 1998, NORSIG'98. 3rd IEEE Nordic Signal Processing Symposium, P81
  • [10] A ROBUST ALGORITHM FOR ACCURATE ENDPOINTING OF SPEECH SIGNALS
    SAVOJI, MH
    [J]. SPEECH COMMUNICATION, 1989, 8 (01) : 45 - 60