An improved model for unsupervised voice activity detection

被引：0

作者：

Sharma, Shilpa ^{[1
,2
]}

Malhotra, Rahul ^{[3
]}

Sharma, Anurag ^{[4
]}

机构：

[1] CT Grp Inst, Comp Sci & Engn, Jalandhar, India

[2] Lovely Profess Univ, Phagwara 144411, Punjab, India

[3] CT Grp Inst, Elect & Telecommun Engn, Jalandhar 144020, India

[4] GNA Univ, Dept Comp Sci & Engn, Phagwara 144401, India

来源：

INTERNATIONAL JOURNAL OF NANOTECHNOLOGY | 2023年 / 20卷 / 1-4期

关键词：

voice activity detector; artificial neural network; SVM; support vector machine; K-means; unsupervised learning; machine learning; TIMIT database;

D O I：

10.1504/IJNT.2023.131117

中图分类号：

TB3 [工程材料学];

学科分类号：

0805 ; 080502 ;

摘要：

The antique way to express our self is speech and nowadays speech is being used in many applications especially in machine communication. As the application of speech is increasing at rapid rate, therefore various techniques are evolving to separate out the speech signals from audio signal which is mixture of noise and speech. The method to distinguish voice and noise is known as voice activity detection. This method is gaining huge popularity as it removes background noise and acceptable approach in the area of speech coding, audio surveillance and monitoring. In this manuscript, hybrid model of unsupervised classifier is investigated. The proposed approach is tested at different levels of noise signal and overlap window size. To validate the proposed approach, a comparison with existing artificial neural network and support vector machine (SVM) is presented. The outcomes of the proposed method are observed better than the existing methods with the accuracy of 99.73% along with better SNR of 25.61 dB. Also proposed model LFV-KANN efficiently handles increase in noise power by hybridisation of two classifiers: ANN and K-means clustering.

引用

页码：235 / 258

页数：25

共 50 条

[21] Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Sarkar, Eklavya
Prasad, RaviShankar
Doss, Mathew Magimai
INTERSPEECH 2022, 2022, : 4626 - 4630
[22] An Unsupervised Visual-only Voice Activity Detection Approach Using Temporal Orofacial Features
Tao, Fei
Hansen, John H. L.
Busso, Carlos
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2302 - 2306
[23] Unsupervised Singing Voice Detection Using Dictionary Learning
Pikrakis, Aggelos
Kopsinis, Yannis
Kroher, Nadine
Diaz-Banez, Jose-Miguel
2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1212 - 1216
[24] Improved unsupervised anomaly detection algorithm
Luo, Na
Yuan, Fuyu
Zuo, Wanli
He, Fengling
Zhou, Zhiguo
ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 532 - +
[25] Improved autoencoder for unsupervised anomaly detection
Cheng, Zhen
Wang, Siwei
Zhang, Pei
Wang, Siqi
Liu, Xinwang
Zhu, En
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (12) : 7103 - 7125
[26] Voice activity detection using haircell model in noisy environment
Huang, CY
Wang, HC
Wang, JF
ICES 2002: 9TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS I-111, CONFERENCE PROCEEDINGS, 2002, : 999 - 1002
[27] AN EFFICIENT TRANSFORMER-BASED MODEL FOR VOICE ACTIVITY DETECTION
Zhao, Yifei
Champagne, Benoit
2022 IEEE 32ND INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2022,
[28] Noise robust model-based Voice Activity Detection
de la Torre, Angel
Ramirez, Javier
Benitez, Carmen
Segura, Jose C.
Garcia, Luz
Rubio, Antonio J.
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1954 - 1957
[29] Voice Activity Detection Using an Adaptive Context Attention Model
Kim, Juntae
Hahn, Minsoo
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (08) : 1181 - 1185
[30] Voice activity detection using Laplacian model and UMP test
Jang, Keun Won
Kim, Dong Kook
Chang, Joon-Hyuk
PROCEEDING OF THE 11TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS: COMPUTER SCIENCE AND TECHNOLOGY, VOL 4, 2007, : 480 - +

← 1 2 3 4 5 →