Robust speaker detection using Neural Networks

被引：0

作者：

Shell, John R. ^{[1
]}

机构：

[1] So Illinois Univ, Dept Elect & Comp Engn, Carbondale, IL 62901 USA

来源：

PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING | 2006年

关键词：

Neural Networks; speech recognition; modeling;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The work proposed in this paper utilizes Neural Networks to distinguish speech patterns. A feature extractor is used as a standard Linear Processing Coefficients (LPC) Cepstrum coder, converting the incoming speech signal captured by a Matlab interface into LPC Cepstrum feature space. A Neural Network makes each variable length LPC trajectory of an isolated word into a fixed length LPC trajectory and makes the fixed length feature vector that is fed into a recognizer. The design of the recognizer uses a Feed Forward (FF) and Back Propagation (BP) Network approach tested with variable hidden layers with Transfer functions of hyperbolic tangent and sigmoid to test the signal output for the recognition of the feature vectors of isolated words. The feature vector was normalized and decorrelated by pruning techniques. The training process uses momentum to find the global minima of the error surface avoiding the oscillations in local minima. The goal of the work is to consistently identify a randomly chosen speech pattern from the samples of four different speakers uttering the same phrase 100% of the time and to verify the effectiveness of neural networks as a valid method in pattern recognition.

引用

页码：414 / 419

页数：6

共 50 条

[1] Robust Multimodal Heartbeat Detection Using Hybrid Neural Networks
Schwob, Michael R.
Dempsey, Aeren
Zhan, Felix
Zhan, Justin
Mehmood, Asif
IEEE ACCESS, 2020, 8 (08): : 82201 - 82214
[2] Speaker verification using committee neural networks
Reddy, NP
Butch, OA
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2003, 72 (02) : 109 - 115
[3] Learning Speaker Aware Offsets for Speaker Adaptation of Neural Networks
Sari, Leda
Thomas, Samuel
Hasegawa-Johnson, Mark A.
INTERSPEECH 2019, 2019, : 769 - 773
[4] Speaker Recognition Using Neural Networks and Conventional Classifiers
Farrell, Kevin R.
Mammone, Richard J.
Assaleh, Khaled T.
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 194 - 205
[5] Speaker Identification in Multi-Talker Overlapping Speech Using Neural Networks
Tran, Van-Thuan
Tsai, Wei-Ho
IEEE ACCESS, 2020, 8 : 134868 - 134879
[6] Face Detection based Neural Networks using Robust Skin Color Segmentation
Mohamed, Aamer
Weng, Ying
Jiang, Jianmin
Ipson, Stan
2008 5TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2008, : 287 - 291
[7] TEXT-INDEPENDENT SPEAKER RECOGNITION USING NEURAL NETWORKS
HATTORI, H
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (03) : 345 - 351
[8] Biometric Speaker Recognition Using Neural Networks and Wavelet Transform
Daghbosheh, Mohammed
Hattab, Ezz
Bisher, Ahmad
2011 INTERNATIONAL CONFERENCE ON CIVIL ENGINEERING AND INFORMATION TECHNOLOGY (CEIT 2011), 2011, : 1 - 8
[9] Speaker identification using multimodal neural networks and wavelet analysis
Almaadeed, Noor
Aggoun, Amar
Amira, Abbes
IET BIOMETRICS, 2015, 4 (01) : 18 - 28
[10] On Context-Dependent Neural Networks and Speaker Adaptation
Zelinka, Jan
Trmal, Jan
Mueller, Ludek
PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 515 - 518

← 1 2 3 4 5 →