Endpoint detection of speech signal using neural network

被引:0
作者
Hussain, A [1 ]
Samad, SA [1 ]
Fah, LB [1 ]
机构
[1] Univ Kebangsaan Malaysia, Fac Engn, Dept Elect Elect & Syst Engn, Multimedia Signal Proc Res Grp, Bangi 43600, Malaysia
来源
IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM | 2000年
关键词
speech segmentation; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper highlights the artificial neural network (ANN) approach to perform the endpoint detection process, which involves the segmentation of speech signals from non-speech signals. Two ANN models have been proposed to perform endpoint detections of isolated digit utterances spoken in the Malay Language: Multilayer Perceptron (MLP) and Adaptive Linear Network (ADALINE). Results obtained from the ANN models are acoustically verified, visually checked and compared to the conventional method of endpoint detection. It was found that the endpoint detection accuracy using the MLP approach is very high and encouraging.
引用
收藏
页码:271 / 274
页数:4
相关论文
共 50 条
  • [31] A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters
    Mao, Congmin
    Liu, Sujing
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (03) : 679 - 684
  • [32] Voice Activity Detection Using Speech Recognizer Feedback
    Thambiratnam, Kit
    Zhu, Weiwu
    Seide, Frank
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1490 - 1493
  • [33] Robust speech recognition using time boundary detection
    Mohajer, K
    Hu, ZM
    [J]. MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2003, 2003, 5099 : 335 - 343
  • [34] SPEECH RECOGNITION USING NEURAL NETWORKS
    Kumar, T. Lalith
    Kumar, T. Kishore
    Rajan, K. Soundar
    [J]. PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2009, : 248 - +
  • [35] Bayesian Neural Network Language Modeling for Speech Recognition
    Xue, Boyang
    Hu, Shoukang
    Xu, Junhao
    Geng, Mengzhe
    Liu, Xunying
    Meng, Helen
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2900 - 2917
  • [36] An on-line adaptive neural network for speech recognition
    Zhang L.-P.
    Li L.M.
    Chi Z.
    [J]. International Journal of Speech Technology, 1998, 2 (3) : 241 - 248
  • [37] A new Dynamic Synapse Neural Network for speech recognition
    Namarvar, HH
    Liaw, JS
    Berger, TW
    [J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2985 - 2990
  • [38] A NETWORK OF DEEP NEURAL NETWORKS FOR DISTANT SPEECH RECOGNITION
    Ravanelli, Mirco
    Brakel, Philemon
    Omologo, Maurizio
    Bengio, Yoshua
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4880 - 4884
  • [39] A Fuzzy Neural Network Applied in the Speech Recognition System
    Zhang, Xueying
    Wang, Peng
    Li, Gaoyun
    Hou, Wenjun
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2008, : 14 - +
  • [40] The spiking neural network based on fMRI for speech recognition
    Song, Yihua
    Guo, Lei
    Man, Menghua
    Wu, Youxi
    [J]. PATTERN RECOGNITION, 2024, 155