An improved model for unsupervised voice activity detection

被引:0
作者
Sharma, Shilpa [1 ,2 ]
Malhotra, Rahul [3 ]
Sharma, Anurag [4 ]
机构
[1] CT Grp Inst, Comp Sci & Engn, Jalandhar, India
[2] Lovely Profess Univ, Phagwara 144411, Punjab, India
[3] CT Grp Inst, Elect & Telecommun Engn, Jalandhar 144020, India
[4] GNA Univ, Dept Comp Sci & Engn, Phagwara 144401, India
关键词
voice activity detector; artificial neural network; SVM; support vector machine; K-means; unsupervised learning; machine learning; TIMIT database;
D O I
10.1504/IJNT.2023.131117
中图分类号
TB3 [工程材料学];
学科分类号
0805 ; 080502 ;
摘要
The antique way to express our self is speech and nowadays speech is being used in many applications especially in machine communication. As the application of speech is increasing at rapid rate, therefore various techniques are evolving to separate out the speech signals from audio signal which is mixture of noise and speech. The method to distinguish voice and noise is known as voice activity detection. This method is gaining huge popularity as it removes background noise and acceptable approach in the area of speech coding, audio surveillance and monitoring. In this manuscript, hybrid model of unsupervised classifier is investigated. The proposed approach is tested at different levels of noise signal and overlap window size. To validate the proposed approach, a comparison with existing artificial neural network and support vector machine (SVM) is presented. The outcomes of the proposed method are observed better than the existing methods with the accuracy of 99.73% along with better SNR of 25.61 dB. Also proposed model LFV-KANN efficiently handles increase in noise power by hybridisation of two classifiers: ANN and K-means clustering.
引用
收藏
页码:235 / 258
页数:25
相关论文
共 50 条
  • [1] Unsupervised voice activity detection with improved signal-to-noise ratio in noisy environment
    Sharma, Shilpa
    Malhotra, Rahul
    Sharma, Anurag
    Bala, Jeevan
    Rattan, Punam
    Vashisht, Sheveta
    INTERNATIONAL JOURNAL OF NANOTECHNOLOGY, 2023, 20 (1-4) : 421 - 432
  • [2] Voice Activity Detection Based on an Unsupervised Learning Framework
    Ying, Dongwen
    Yan, Yonghong
    Dang, Jianwu
    Soong, Frank K.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2624 - 2632
  • [3] Innovative Method for Unsupervised Voice Activity Detection and Classification of Audio Segments
    Ali, Zulfiqar
    Talha, Muhammad
    IEEE ACCESS, 2018, 6 : 15494 - 15504
  • [4] Support Vector Machine based Voice Activity Detection
    Baig, M.
    Masud, S.
    Awais, M.
    2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 295 - 298
  • [5] A Voice Activity Detection Model Composed of Bidirectional LSTM and Attention Mechanism
    Yu, Yeonguk
    Kim, Yoon-Joong
    2018 IEEE 10TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2018,
  • [6] Unsupervised structural damage detection based on an improved generative adversarial network and cloud model
    Luo, Yongpeng
    Guo, Xu
    Wang, Lin-kun
    Zheng, Jin-ling
    Liu, Jing-liang
    Liao, Fei-yu
    JOURNAL OF LOW FREQUENCY NOISE VIBRATION AND ACTIVE CONTROL, 2023, 42 (03) : 1501 - 1518
  • [7] Voice Activity Detection based on Statistical Model Employing Deep Neural Network
    Hwang, Inyoung
    Chang, Joon-Hyuk
    2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 582 - 585
  • [8] Modeling Inhalation in Voice Activity Detection
    Aguiar Pontes, Josafa de Jesus
    2019 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND SOFTWARE TECHNOLOGIES (ICI2ST), 2019, : 24 - 31
  • [9] An unsupervised approach for human activity detection and recognition
    Department of Electrical Engineering and Information, Systems, Graduate School of Engineering, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku
    Tokyo, Japan
    不详
    CA, United States
    Int. J. Simul. Syst. Sci. Technol., 5 (42-49): : 42 - 49
  • [10] Voice activity detection based on statistical models and machine learning approaches
    Shin, Jong Won
    Chang, Joon-Hyuk
    Kim, Nam Soo
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (03) : 515 - 530