Robust speaker detection using Neural Networks

被引:0
|
作者
Shell, John R. [1 ]
机构
[1] So Illinois Univ, Dept Elect & Comp Engn, Carbondale, IL 62901 USA
来源
PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING | 2006年
关键词
Neural Networks; speech recognition; modeling;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The work proposed in this paper utilizes Neural Networks to distinguish speech patterns. A feature extractor is used as a standard Linear Processing Coefficients (LPC) Cepstrum coder, converting the incoming speech signal captured by a Matlab interface into LPC Cepstrum feature space. A Neural Network makes each variable length LPC trajectory of an isolated word into a fixed length LPC trajectory and makes the fixed length feature vector that is fed into a recognizer. The design of the recognizer uses a Feed Forward (FF) and Back Propagation (BP) Network approach tested with variable hidden layers with Transfer functions of hyperbolic tangent and sigmoid to test the signal output for the recognition of the feature vectors of isolated words. The feature vector was normalized and decorrelated by pruning techniques. The training process uses momentum to find the global minima of the error surface avoiding the oscillations in local minima. The goal of the work is to consistently identify a randomly chosen speech pattern from the samples of four different speakers uttering the same phrase 100% of the time and to verify the effectiveness of neural networks as a valid method in pattern recognition.
引用
收藏
页码:414 / 419
页数:6
相关论文
共 50 条
  • [1] Robust Multimodal Heartbeat Detection Using Hybrid Neural Networks
    Schwob, Michael R.
    Dempsey, Aeren
    Zhan, Felix
    Zhan, Justin
    Mehmood, Asif
    IEEE ACCESS, 2020, 8 (08): : 82201 - 82214
  • [2] Speaker verification using committee neural networks
    Reddy, NP
    Butch, OA
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2003, 72 (02) : 109 - 115
  • [3] Learning Speaker Aware Offsets for Speaker Adaptation of Neural Networks
    Sari, Leda
    Thomas, Samuel
    Hasegawa-Johnson, Mark A.
    INTERSPEECH 2019, 2019, : 769 - 773
  • [4] Speaker Recognition Using Neural Networks and Conventional Classifiers
    Farrell, Kevin R.
    Mammone, Richard J.
    Assaleh, Khaled T.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 194 - 205
  • [5] Speaker Identification in Multi-Talker Overlapping Speech Using Neural Networks
    Tran, Van-Thuan
    Tsai, Wei-Ho
    IEEE ACCESS, 2020, 8 : 134868 - 134879
  • [6] Face Detection based Neural Networks using Robust Skin Color Segmentation
    Mohamed, Aamer
    Weng, Ying
    Jiang, Jianmin
    Ipson, Stan
    2008 5TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2008, : 287 - 291
  • [8] Biometric Speaker Recognition Using Neural Networks and Wavelet Transform
    Daghbosheh, Mohammed
    Hattab, Ezz
    Bisher, Ahmad
    2011 INTERNATIONAL CONFERENCE ON CIVIL ENGINEERING AND INFORMATION TECHNOLOGY (CEIT 2011), 2011, : 1 - 8
  • [9] Speaker identification using multimodal neural networks and wavelet analysis
    Almaadeed, Noor
    Aggoun, Amar
    Amira, Abbes
    IET BIOMETRICS, 2015, 4 (01) : 18 - 28
  • [10] On Context-Dependent Neural Networks and Speaker Adaptation
    Zelinka, Jan
    Trmal, Jan
    Mueller, Ludek
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 515 - 518