Voice activity detection algorithm based on long-term pitch information

被引:5
|
作者
Yang, Xu-Kui [1 ,2 ]
He, Liang [3 ]
Qu, Dan [1 ]
Zhang, Wei-Qiang [3 ]
机构
[1] Zhengzhou Informat Sci & Technol Inst, Zhengzhou, Peoples R China
[2] State Key Lab Integrated Serv Networks, Beijing, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
来源
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2016年
基金
中国国家自然科学基金;
关键词
Voice activity detection; Non-stationary noise; Long-term pitch envelop; Long-term pitch divergence; NOISE;
D O I
10.1186/s13636-016-0092-y
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new voice activity detection algorithm based on long-term pitch divergence is presented. The long-term pitch divergence not only decomposes speech signals with a bionic decomposition but also makes full use of long-term information. It is more discriminative comparing with other feature sets, such as long-term spectral divergence. Experimental results show that among six analyzed algorithms, the proposed algorithm is the best one with the highest non-speech hit rate and a reasonably high speech hit rate.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Voice-Activity Detection Using Long-Term Sub-Band Entropy Measure
    Wang, Kun-Ching
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (09) : 1606 - 1609
  • [22] The Long-Term Prognosis of Voice Pitch Change in Female Patients After Thyroid Surgery
    Jun-Ook Park
    Ja-Sung Bae
    So-Hee Lee
    Mi-Ran Shim
    Yeon-Shin Hwang
    Young-Hoon Joo
    Young Hak Park
    Dong-Il Sun
    World Journal of Surgery, 2016, 40 : 2382 - 2390
  • [23] Long-Term Spectral Statistics for Voice Presentation Attack Detection
    Muckenhirn, Hannah
    Korshunov, Pavel
    Magimai-Doss, Mathew
    Marcel, Sebastien
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2098 - 2111
  • [24] Voice Activity Detector (VAD) Based on Long-Term Mel Frequency Band Features
    Salishev, Sergey
    Barabanov, Andrey
    Kocharov, Daniil
    Skrelin, Pavel
    Moiseev, Mikhail
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 352 - 358
  • [26] ROBUST VOICE ACTIVITY DETECTION BASED ON PITCH AND SUB-BAND ENERGY
    Zhang, Zhihao
    Lin, Jinlong
    SIGMAP 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2009, : 44 - 48
  • [27] Long-term Tracking Algorithm Based on Dimensionality Reduction and Re-Detection
    Xia L.
    Zhang Y.
    Huang Y.
    Jia H.
    Zhang, Ya (zhangya@aust.edu.cn), 1600, Institute of Computing Technology (33): : 385 - 394
  • [28] Smartphone-based detection of voice disorders by long-term monitoring of neck acceleration features
    Mehta, Daryush D.
    Zanartu, Matias
    Van Stan, Jarrad H.
    Feng, Shengran W.
    Cheyne, Harold A., II
    Hillman, Robert E.
    2013 IEEE INTERNATIONAL CONFERENCE ON BODY SENSOR NETWORKS (BSN), 2013,
  • [29] A Multimicrophone Voice Activity Detection System Based on Mutual Information
    Talantzis, Fotios
    Constantinides, Anthony G.
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2009, 57 (11): : 937 - 950
  • [30] A new algorithm for voice activity detection based on wavelet transform
    Jiang, SJ
    Guo, HT
    Yin, FL
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 222 - 225