An improved model for unsupervised voice activity detection

被引:0
|
作者
Sharma, Shilpa [1 ,2 ]
Malhotra, Rahul [3 ]
Sharma, Anurag [4 ]
机构
[1] CT Grp Inst, Comp Sci & Engn, Jalandhar, India
[2] Lovely Profess Univ, Phagwara 144411, Punjab, India
[3] CT Grp Inst, Elect & Telecommun Engn, Jalandhar 144020, India
[4] GNA Univ, Dept Comp Sci & Engn, Phagwara 144401, India
关键词
voice activity detector; artificial neural network; SVM; support vector machine; K-means; unsupervised learning; machine learning; TIMIT database;
D O I
10.1504/IJNT.2023.131117
中图分类号
TB3 [工程材料学];
学科分类号
0805 ; 080502 ;
摘要
The antique way to express our self is speech and nowadays speech is being used in many applications especially in machine communication. As the application of speech is increasing at rapid rate, therefore various techniques are evolving to separate out the speech signals from audio signal which is mixture of noise and speech. The method to distinguish voice and noise is known as voice activity detection. This method is gaining huge popularity as it removes background noise and acceptable approach in the area of speech coding, audio surveillance and monitoring. In this manuscript, hybrid model of unsupervised classifier is investigated. The proposed approach is tested at different levels of noise signal and overlap window size. To validate the proposed approach, a comparison with existing artificial neural network and support vector machine (SVM) is presented. The outcomes of the proposed method are observed better than the existing methods with the accuracy of 99.73% along with better SNR of 25.61 dB. Also proposed model LFV-KANN efficiently handles increase in noise power by hybridisation of two classifiers: ANN and K-means clustering.
引用
收藏
页码:235 / 258
页数:25
相关论文
共 50 条
  • [1] Unsupervised voice activity detection with improved signal-to-noise ratio in noisy environment
    Sharma, Shilpa
    Malhotra, Rahul
    Sharma, Anurag
    Bala, Jeevan
    Rattan, Punam
    Vashisht, Sheveta
    INTERNATIONAL JOURNAL OF NANOTECHNOLOGY, 2023, 20 (1-4) : 421 - 432
  • [2] Voice Activity Detection Based on an Unsupervised Learning Framework
    Ying, Dongwen
    Yan, Yonghong
    Dang, Jianwu
    Soong, Frank K.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2624 - 2632
  • [3] Online Unsupervised Classification With Model Comparison in the Variational Bayes Framework for Voice Activity Detection
    Cournapeau, David
    Watanabe, Shinji
    Nakamura, Atsushi
    Kawahara, Tatsuya
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (06) : 1071 - 1083
  • [4] USING ONLINE MODEL COMPARISON IN THE VARIATIONAL BAYES FRAMEWORK FOR ONLINE UNSUPERVISED VOICE ACTIVITY DETECTION
    Cournapeau, David
    Watanabe, Shinji
    Nakamura, Atsushi
    Kawahara, Tatsuya
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4462 - 4465
  • [5] Innovative Method for Unsupervised Voice Activity Detection and Classification of Audio Segments
    Ali, Zulfiqar
    Talha, Muhammad
    IEEE ACCESS, 2018, 6 : 15494 - 15504
  • [6] Using Variational Bayes free energy for unsupervised voice activity detection
    Cournapeau, David
    Kawahara, Tatsuya
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4429 - 4432
  • [7] rVAD: An unsupervised segment-based robust voice activity detection method
    Tan, Zheng-Hua
    Sarkar, Achintya Kr
    Dehak, Najim
    COMPUTER SPEECH AND LANGUAGE, 2020, 59 : 1 - 21
  • [8] UNSUPERVISED DOMAIN ADAPTATION FOR DEEP NEURAL NETWORK BASED VOICE ACTIVITY DETECTION
    Zhang, Xiao-Lei
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Voice Activity Detection by Upper Body Motion Analysis and Unsupervised Domain Adaptation
    Shahid, Muhammad
    Beyan, Cigdem
    Murino, Vittorio
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1260 - 1269
  • [10] A Fusion Model for Robust Voice Activity Detection
    Wang, Guan-Bo
    Zhang, Wei-Qiang
    2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,