An improved model for unsupervised voice activity detection

被引：0

作者：

Sharma, Shilpa ^{[1
,2
]}

Malhotra, Rahul ^{[3
]}

Sharma, Anurag ^{[4
]}

机构：

[1] CT Grp Inst, Comp Sci & Engn, Jalandhar, India

[2] Lovely Profess Univ, Phagwara 144411, Punjab, India

[3] CT Grp Inst, Elect & Telecommun Engn, Jalandhar 144020, India

[4] GNA Univ, Dept Comp Sci & Engn, Phagwara 144401, India

来源：

INTERNATIONAL JOURNAL OF NANOTECHNOLOGY | 2023年 / 20卷 / 1-4期

关键词：

voice activity detector; artificial neural network; SVM; support vector machine; K-means; unsupervised learning; machine learning; TIMIT database;

D O I：

10.1504/IJNT.2023.131117

中图分类号：

TB3 [工程材料学];

学科分类号：

0805 ; 080502 ;

摘要：

The antique way to express our self is speech and nowadays speech is being used in many applications especially in machine communication. As the application of speech is increasing at rapid rate, therefore various techniques are evolving to separate out the speech signals from audio signal which is mixture of noise and speech. The method to distinguish voice and noise is known as voice activity detection. This method is gaining huge popularity as it removes background noise and acceptable approach in the area of speech coding, audio surveillance and monitoring. In this manuscript, hybrid model of unsupervised classifier is investigated. The proposed approach is tested at different levels of noise signal and overlap window size. To validate the proposed approach, a comparison with existing artificial neural network and support vector machine (SVM) is presented. The outcomes of the proposed method are observed better than the existing methods with the accuracy of 99.73% along with better SNR of 25.61 dB. Also proposed model LFV-KANN efficiently handles increase in noise power by hybridisation of two classifiers: ANN and K-means clustering.

引用

页码：235 / 258

页数：25

共 50 条

[1] Unsupervised voice activity detection with improved signal-to-noise ratio in noisy environment
Sharma, Shilpa
Malhotra, Rahul
Sharma, Anurag
Bala, Jeevan
Rattan, Punam
Vashisht, Sheveta
INTERNATIONAL JOURNAL OF NANOTECHNOLOGY, 2023, 20 (1-4) : 421 - 432
[2] Voice Activity Detection Based on an Unsupervised Learning Framework
Ying, Dongwen
Yan, Yonghong
Dang, Jianwu
Soong, Frank K.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2624 - 2632
[3] Innovative Method for Unsupervised Voice Activity Detection and Classification of Audio Segments
Ali, Zulfiqar
Talha, Muhammad
IEEE ACCESS, 2018, 6 : 15494 - 15504
[4] Support Vector Machine based Voice Activity Detection
Baig, M.
Masud, S.
Awais, M.
2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 295 - 298
[5] A Voice Activity Detection Model Composed of Bidirectional LSTM and Attention Mechanism
Yu, Yeonguk
Kim, Yoon-Joong
2018 IEEE 10TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2018,
[6] Unsupervised structural damage detection based on an improved generative adversarial network and cloud model
Luo, Yongpeng
Guo, Xu
Wang, Lin-kun
Zheng, Jin-ling
Liu, Jing-liang
Liao, Fei-yu
JOURNAL OF LOW FREQUENCY NOISE VIBRATION AND ACTIVE CONTROL, 2023, 42 (03) : 1501 - 1518
[7] Voice Activity Detection based on Statistical Model Employing Deep Neural Network
Hwang, Inyoung
Chang, Joon-Hyuk
2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 582 - 585
[8] Modeling Inhalation in Voice Activity Detection
Aguiar Pontes, Josafa de Jesus
2019 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND SOFTWARE TECHNOLOGIES (ICI2ST), 2019, : 24 - 31
[9] An unsupervised approach for human activity detection and recognition
Department of Electrical Engineering and Information, Systems, Graduate School of Engineering, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku
Tokyo, Japan
不详
CA, United States
Int. J. Simul. Syst. Sci. Technol., 5 (42-49): : 42 - 49
[10] Voice activity detection based on statistical models and machine learning approaches
Shin, Jong Won
Chang, Joon-Hyuk
Kim, Nam Soo
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (03) : 515 - 530

← 1 2 3 4 5 →