An improved model for unsupervised voice activity detection

被引:0
|
作者
Sharma, Shilpa [1 ,2 ]
Malhotra, Rahul [3 ]
Sharma, Anurag [4 ]
机构
[1] CT Grp Inst, Comp Sci & Engn, Jalandhar, India
[2] Lovely Profess Univ, Phagwara 144411, Punjab, India
[3] CT Grp Inst, Elect & Telecommun Engn, Jalandhar 144020, India
[4] GNA Univ, Dept Comp Sci & Engn, Phagwara 144401, India
关键词
voice activity detector; artificial neural network; SVM; support vector machine; K-means; unsupervised learning; machine learning; TIMIT database;
D O I
10.1504/IJNT.2023.131117
中图分类号
TB3 [工程材料学];
学科分类号
0805 ; 080502 ;
摘要
The antique way to express our self is speech and nowadays speech is being used in many applications especially in machine communication. As the application of speech is increasing at rapid rate, therefore various techniques are evolving to separate out the speech signals from audio signal which is mixture of noise and speech. The method to distinguish voice and noise is known as voice activity detection. This method is gaining huge popularity as it removes background noise and acceptable approach in the area of speech coding, audio surveillance and monitoring. In this manuscript, hybrid model of unsupervised classifier is investigated. The proposed approach is tested at different levels of noise signal and overlap window size. To validate the proposed approach, a comparison with existing artificial neural network and support vector machine (SVM) is presented. The outcomes of the proposed method are observed better than the existing methods with the accuracy of 99.73% along with better SNR of 25.61 dB. Also proposed model LFV-KANN efficiently handles increase in noise power by hybridisation of two classifiers: ANN and K-means clustering.
引用
收藏
页码:235 / 258
页数:25
相关论文
共 50 条
  • [21] Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
    Sarkar, Eklavya
    Prasad, RaviShankar
    Doss, Mathew Magimai
    INTERSPEECH 2022, 2022, : 4626 - 4630
  • [22] An Unsupervised Visual-only Voice Activity Detection Approach Using Temporal Orofacial Features
    Tao, Fei
    Hansen, John H. L.
    Busso, Carlos
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2302 - 2306
  • [23] Unsupervised Singing Voice Detection Using Dictionary Learning
    Pikrakis, Aggelos
    Kopsinis, Yannis
    Kroher, Nadine
    Diaz-Banez, Jose-Miguel
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1212 - 1216
  • [24] Improved unsupervised anomaly detection algorithm
    Luo, Na
    Yuan, Fuyu
    Zuo, Wanli
    He, Fengling
    Zhou, Zhiguo
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 532 - +
  • [25] Improved autoencoder for unsupervised anomaly detection
    Cheng, Zhen
    Wang, Siwei
    Zhang, Pei
    Wang, Siqi
    Liu, Xinwang
    Zhu, En
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (12) : 7103 - 7125
  • [26] Voice activity detection using haircell model in noisy environment
    Huang, CY
    Wang, HC
    Wang, JF
    ICES 2002: 9TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS I-111, CONFERENCE PROCEEDINGS, 2002, : 999 - 1002
  • [27] AN EFFICIENT TRANSFORMER-BASED MODEL FOR VOICE ACTIVITY DETECTION
    Zhao, Yifei
    Champagne, Benoit
    2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2022,
  • [28] Noise robust model-based Voice Activity Detection
    de la Torre, Angel
    Ramirez, Javier
    Benitez, Carmen
    Segura, Jose C.
    Garcia, Luz
    Rubio, Antonio J.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1954 - 1957
  • [29] Voice Activity Detection Using an Adaptive Context Attention Model
    Kim, Juntae
    Hahn, Minsoo
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (08) : 1181 - 1185
  • [30] Voice activity detection using Laplacian model and UMP test
    Jang, Keun Won
    Kim, Dong Kook
    Chang, Joon-Hyuk
    PROCEEDING OF THE 11TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS: COMPUTER SCIENCE AND TECHNOLOGY, VOL 4, 2007, : 480 - +