Noisy Speech Endpoint Detection Using Robust Feature

被引:0
|
作者
Ouzounov, Atanas [1 ]
机构
[1] Inst Informat & Commun Technol, Sofia, Bulgaria
来源
关键词
Teager energy; Speech activity detection; Group delay spectrum; ALGORITHMS;
D O I
10.1007/978-3-319-13386-7_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper a new robust feature for speech endpoint detection is proposed. It combines the properties of the Modified Group Delay Spectrum (MGDS) and the Mean Delta (MD) approach in order to obtain the more robust endpoint detection. This feature is named as Group Delay Mean Delta (GDMD) feature. The effectiveness of proposed feature and other three features for trajectory- based endpoint detection is experimentally evaluated in the fixed-text Dynamic Time Warping (DTW) - based speaker verification task with short phrases of telephone speech. The analysed features are - Modified Teager Energy (MTE), Energy-Entropy (EE) feature and MD feature. The results of the experiments have shown that the GDMD feature demonstrates the best performance in endpoint detection tests in terms of verification rate.
引用
收藏
页码:105 / 117
页数:13
相关论文
共 50 条
  • [1] A novel algorithm to robust speech endpoint detection in noisy environments
    Yi, Li
    Yingle, Fan
    ICIEA 2007: 2ND IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-4, PROCEEDINGS, 2007, : 1555 - 1558
  • [2] A robust endpoint detection of speech for noisy environments with application to automatic speech recognition
    Bou-Ghazale, SE
    Assaleh, K
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3808 - 3811
  • [3] Robust endpoint detection for speech recognition based on discriminative feature extraction
    Yamamoto, Koichi
    Jabloun, Firas
    Reinhard, Klaus
    Kawamura, Akinori
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 805 - 808
  • [4] Endpoint detection of noisy speech by the use of cepstrum
    Wei, Xiaodong
    Hu, Guangrui
    Ren, Xiaolin
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2000, 34 (02): : 185 - 188
  • [5] Endpoint detection of noisy speech based on cepstrum
    Hu, Guangrui
    Wei, Xiaodong
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2000, 28 (10): : 95 - 97
  • [6] Robust Speech Endpoint Detection in Noisy Environments for HRI (Human-Robot Interface)
    Park, Jin-Soo
    Ko, Han-Seok
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (02): : 147 - 156
  • [7] Speech Endpoint Detection in Noisy Environment Using Spectrogram Boundary Factor
    Wu, Di
    Tao, Zhi
    Wu, Yuanbo
    Shen, Cheng
    Xiao, Zhongzhe
    Zhang, Xiaojun
    Wu, Di
    Zhao, Heming
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 964 - 968
  • [8] Endpoint detection method of noisy Chinese speech recognition
    Wang, Peng
    Ta, Weina
    Chen, Shuzhong
    Jisuanji Gongcheng/Computer Engineering, 2003, 29 (17):
  • [9] Robust speech endpoint detection based on MP3 file in various noisy environments
    Wang, Fang
    Huang, Xianglin
    Yang, Lifang
    Liu, Tao
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 670 - 675