EMD-Based Method to Improve the Efficiency of Speech/Pause Segmentation

被引:2
|
作者
Alimuradov, Alan K. [1 ]
Tychkov, Alexander Yu [2 ]
机构
[1] Penza State Univ, Res & Prod Business Incubator, Penza, Russia
[2] Penza State Univ, Res Inst Fundamental & Appl Studies, Penza, Russia
来源
INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATIONS (SIBCON 2021 ) | 2021年
关键词
speech signal processing; speech segmentation; voiced and unvoiced speech; empirical mode decomposition; EMPIRICAL MODE DECOMPOSITION; CLASSIFICATION;
D O I
10.1109/SIBCON50419.2021.9438905
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech/pause segmentation is classification of informative sections into voiced and unvoiced speech, and pauses. Accurate detection of the boundaries of the beginning and the end of informative sections of speech signals is one of the most important tasks in speech applications. The article presents a method for increasing the efficiency of speech/pause segmentation based on empirical mode decomposition. The proposed method is based on the use of decomposition in pre-processing of the original speech signals to form a set of new investigated signals containing the most reliable information about the boundaries of the beginning and the end of the sections of voiced and unvoiced speech, and pauses. The research has been carried out to evaluate the effect of the decomposition method and the duration of the fragments of the studied signals on the efficiency of speech/pause segmentation. The methods based on zero-crossing rate, short-time energy, and the analysis of one-dimensional Mahalanobis distance, were used for segmentation. The obtained research results have shown an increase in the efficiency of speech/pause segmentation by an average of 11.44 % for the first and second kind errors.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Speech enhancement using ICA with EMD-based reference
    Zheng, YR
    Lin, QH
    Yin, FL
    Liang, HL
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 739 - 746
  • [2] Speech Enhancement Using Spectral Weighting with EMD-Based Post Filtering
    Akter, Sheuli
    Islam, Md. Rabiul
    Rashed-Al-Mahfuz, Md.
    Molla, M. K. I.
    2013 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE 2013), 2013, : 29 - +
  • [3] An EMD-based recognition method for Chinese fonts and styles
    Yang, Zhihua
    Yang, Lihua
    Qi, Dongxu
    Suen, Ching Y.
    PATTERN RECOGNITION LETTERS, 2006, 27 (14) : 1692 - 1701
  • [4] EMD-Based Borehole Radar Signal Preprocessing Method
    Zou N.
    Jin Y.
    Guo C.
    Tao S.
    Song H.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (06): : 875 - 883
  • [5] EMD-Based signal filtering
    Boudraa, Abdel-Ouahab
    Cexus, Jean-Christophe
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2007, 56 (06) : 2196 - 2202
  • [6] EMD-Based Filtering (EMDF) of Low-Frequency Noise for Speech Enhancement
    Chatlani, Navin
    Soraghan, John J.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1158 - 1166
  • [7] A Hybrid EMD-Based Signal Denosing Method for Gyrowheel System
    Zhao, Hui
    Zhao, Yuyu
    Huo, Xin
    Yao, Yu
    PROCEEDINGS OF 2019 IEEE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY (ICEICT 2019), 2019, : 210 - 214
  • [8] EMD-Based Denoising By Customized Thresholding
    Mohguen, Wahiba
    Bekka, Rais El'hadi
    2017 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND DIAGNOSIS (ICCAD), 2017, : 19 - 23
  • [9] EMD-Based Signal Noise Reduction
    Boudraa, A. O.
    Cexus, J. C.
    Saidi, Z.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 2, 2005, 2 : 93 - 96
  • [10] Why do EMD-based methods improve prediction? A multiscale complexity perspective
    Dong, Jichang
    Dai, Wei
    Tang, Ling
    Yu, Lean
    JOURNAL OF FORECASTING, 2019, 38 (07) : 714 - 731