A Voice Activity Detection Algorithm Using Sparse Non-negative Matrix Factorization-based Model Learning in Spectro-Temporal Domain

被引:0
|
作者
Mavaddati, S. [1 ]
机构
[1] Univ Mazandaran, Fac Engn & Technol, Babolsar, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2023年 / 36卷 / 08期
关键词
Voice Activity Detector; Spectro-temporal Domain; Spectro-temporal Sparse Structured Principal Component; Analysis; Sparse Non-negative Matrix Factorization; RECOGNITION; NOISE;
D O I
10.5829/ije.2023.36.08b.08
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Voice activity detectors are presented to extract silence/speech segments of the speech signal to eliminate different background noise signals. A novel voice activity detector is proposed in this paper using spectro-temporal features extracted from the auditory model of the speech signal. After extracting the scale, rate, and frequency features from this feature space, a sparse structured principal component analysis algorithm is used to consider the basic components of these features and reduce the dimension of learning data. Then these feature vectors are employed to learn the models by the sparse non-negative matrix factorization algorithm. The model learning procedure is performed to represent each feature vector with a proper sparse rate based on the selected atoms. Voice activity detection of the input frames is performed by computing the energy of the sparse representation for each input frame over the composite model. If the calculated energy exceeds a specified threshold, it indicates that the input frame has a structure similar to the atoms of the learned models and concludes that the observed frame has voice content. The results of the proposed detector were compared with other baseline methods and classifiers in this processing field. These results in the presence of stationary, non-stationary and periodic noises were investigated and they are shown that the proposed method based on model learning with spectro-temporal features can correctly detect the silence/speech activities.doi: 10.5829/ije.2023.36.08b.08
引用
收藏
页码:1478 / 1488
页数:11
相关论文
共 50 条
  • [1] A voice activity detection algorithm in spectro-temporal domain using sparse representation
    Mohadese Eshaghi
    Farbod Razzazi
    Alireza Behrad
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 1791 - 1803
  • [2] A voice activity detection algorithm in spectro-temporal domain using sparse representation
    Eshaghi, Mohadese
    Razzazi, Farbod
    Behrad, Alireza
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (07) : 1791 - 1803
  • [3] A non-negative matrix factorization approach based on spectro-temporal clustering to extract heart sounds
    Canadas-Quesada, F. J.
    Ruiz-Reyes, N.
    Carabias-Orti, J.
    Vera-Candeas, P.
    Fuertes-Garcia, J.
    APPLIED ACOUSTICS, 2017, 125 : 7 - 19
  • [4] The Non-negative Matrix Factorization Based Algorithm for Community Detection in Sparse Networks
    Hong, J.I.N.
    Zhi-Qun, H.U.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (10): : 2950 - 2959
  • [5] Denoising sound signals in a bioinspired non-negative spectro-temporal domain
    Martinez, C. E.
    Goddard, J.
    Di Persia, L. E.
    Milone, D. H.
    Rufiner, H. L.
    DIGITAL SIGNAL PROCESSING, 2015, 38 : 22 - 31
  • [6] The Voice Conversion Method Based on Sparse Convolutive Non-negative Matrix Factorization
    Zhang, Qianmin
    Tao, Liang
    Zhou, Jian
    Wang, Huabin
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES FOR RAIL TRANSPORTATION: TRANSPORTATION, 2016, 378 : 259 - 267
  • [7] Spectro-Temporal Attention-Based Voice Activity Detection
    Lee, Younglo
    Min, Jeongki
    Han, David K.
    Ko, Hanseok
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 131 - 135
  • [8] Detection of Obstructive Sleep Apnea Using Non-Negative Matrix Factorization-Based Feature Extraction Approach in Eigen Spectrum Domain
    Sinha, Nabanita
    Das, Arpita
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [9] VOICE ACTIVITY DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING
    Teng, Peng
    Jia, Yunde
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7373 - 7377
  • [10] Non-negative Matrix Factorization-Based Blind Source Separation for Non-contact Heartbeat Detection
    Ye, Chen
    Toyoda, Kentaroh
    Ohtsuki, Tomoaki
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,