Endpoint detection of speech signal using neural network

被引:0
作者
Hussain, A [1 ]
Samad, SA [1 ]
Fah, LB [1 ]
机构
[1] Univ Kebangsaan Malaysia, Fac Engn, Dept Elect Elect & Syst Engn, Multimedia Signal Proc Res Grp, Bangi 43600, Malaysia
来源
IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM | 2000年
关键词
speech segmentation; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper highlights the artificial neural network (ANN) approach to perform the endpoint detection process, which involves the segmentation of speech signals from non-speech signals. Two ANN models have been proposed to perform endpoint detections of isolated digit utterances spoken in the Malay Language: Multilayer Perceptron (MLP) and Adaptive Linear Network (ADALINE). Results obtained from the ANN models are acoustically verified, visually checked and compared to the conventional method of endpoint detection. It was found that the endpoint detection accuracy using the MLP approach is very high and encouraging.
引用
收藏
页码:271 / 274
页数:4
相关论文
共 50 条
[41]   Indonesian speech recognition based on Deep Neural Network [J].
Yang, Ruolin ;
Yang, Jian ;
Lu, Yu .
2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, :36-41
[42]   An Optimal Method for Speech Recognition Based on Neural Network [J].
Ishak, Mohamad Khairi ;
Madsen, Dag oivind ;
Al-Zahrani, Fahad Ahmed .
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (02) :1951-1961
[43]   Primi Speech Recognition Based on Deep Neural Network [J].
Hu, Wenjun ;
Fu, Meijun ;
Pan, Wenlin .
2016 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS), 2016, :667-671
[44]   Multiresolution Convolutional Neural Network For Robust Speech Recognition [J].
Naderi, Navid ;
Nasersharif, Babak .
2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, :1459-1464
[45]   Neural Network Phone Duration Model for Speech Recognition [J].
Alumae, Tanel .
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, :1204-1208
[46]   Contextual Speech Recognition in End-to-End Neural Network Systems using Beam Search [J].
Williams, Ian ;
Kannan, Anjuli ;
Aleksci, Petar ;
Rybach, David ;
Sainath, Tara N. .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :2227-2231
[47]   SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping [J].
Kothapally, Vinay ;
Xia, Wei ;
Ghorbani, Shahram ;
Hansen, John H. L. ;
Xue, Wei ;
Huang, Jing .
INTERSPEECH 2020, 2020, :3935-3939
[48]   Recurrent Neural Network Language Model with Part-of-speech for Mandarin Speech Recognition [J].
Gong, Caixia ;
Li, Xiangang ;
Wu, Xihong .
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, :459-463
[49]   End-to-End Speech Endpoint Detection Utilizing Acoustic and Language Modeling Knowledge for Online Low-Latency Speech Recognition [J].
Hwang, Inyoung ;
Chang, Joon-Hyuk .
IEEE ACCESS, 2020, 8 :161109-161123
[50]   A Neural Network Based Regression Approach for Recognizing Simultaneous Speech [J].
Li, Weifeng ;
Kunatani, Kenichi ;
Dines, John ;
Maginad-Doss, Mathew ;
Bourlard, Herve .
MACHINE LEARNING FOR MULTIMODAL INTERACTION, PROCEEDINGS, 2008, 5237 :110-118