SPEECH RECOGNITION BY AN ARTIFICIAL NEURAL NETWORK USING FINDINGS ON THE AFFERENT AUDITORY-SYSTEM

被引:13
作者
KUROGI, S
机构
[1] Division of Control Engineering, Kyushu Institute of Technology, Kitakyushu, 804, Sensuicho, Tobata
关键词
D O I
10.1007/BF00201985
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An artificial neural network which uses anatomical and physiological findings on the afferent pathway from the ear to the cortex is presented and the roles of the constituent functions in recognition of continuous speech are examined. The network deals with successive spectra of speech sounds by a cascade of several neural layers: lateral excitation layer (LEL), lateral inhibition layer (LIL), and a pile of feature detection layers (FDL's). These layers are shown to be effective for recognizing spoken words. Namely, first, LEL reduces the distortion of sound spectrum caused by the pitch of speech sounds. Next, LIL emphasizes the major energy peaks of sound spectrum, the formants. Last, FDL's detect syllables and words in successive formants, where two functions, time-delay and strong adaptation, play important roles: time-delay makes it possible to retain the pattern of formant changes for a period to detect spoken words successively; strong adaptation contributes to removing the time-warp of formant changes. Digital computer simulations show that the network detect isolated syllables, isolated words, and connected words in continuous speech, while reproducing the fundamental responses found in the auditory system such as ON, OFF, ON-OFF, and SUSTAINED patterns.
引用
收藏
页码:243 / 249
页数:7
相关论文
共 50 条
[21]   A NEURAL NET FOR PROCESSING OF STATIONARY SIGNALS IN THE AUDITORY-SYSTEM [J].
BERTHOMMIER, F ;
DEMONGEOT, J ;
SCHWARTZ, JL .
FIRST IEE INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1989, :287-291
[22]   RESPONSES TO SPEECH SIGNALS IN THE NORMAL AND PATHOLOGICAL PERIPHERAL AUDITORY-SYSTEM [J].
PALMER, AR ;
MOORJANI, PA .
PROGRESS IN BRAIN RESEARCH, 1993, 97 :107-115
[23]   A novel system for effective speech recognition based on artificial neural network and opposition artificial bee colony algorithm [J].
Shilpi Shukla ;
Madhu Jain .
International Journal of Speech Technology, 2019, 22 :959-969
[24]   A novel system for effective speech recognition based on artificial neural network and opposition artificial bee colony algorithm [J].
Shukla, Shilpi ;
Jain, Madhu .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (04) :959-969
[25]   Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network [J].
Dua, Sakshi ;
Kumar, Sethuraman Sambath ;
Albagory, Yasser ;
Ramalingam, Rajakumar ;
Dumka, Ankur ;
Singh, Rajesh ;
Rashid, Mamoon ;
Gehlot, Anita ;
Alshamrani, Sultan S. ;
AlGhamdi, Ahmed Saeed .
APPLIED SCIENCES-BASEL, 2022, 12 (12)
[26]   Pain recognition using artificial neural network [J].
Monwar, Md. Maruf ;
Rezaei, Siamak .
2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, :28-+
[27]   A Speech Recognition System Based on Fuzzy Neural Network trained by Artificial Bee Colony Algorithm [J].
Ning, Aiping ;
Zhang, Xueying .
2011 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND CONTROL (ICECC), 2011, :2488-2491
[28]   Audio-Visual Speech Recognition System Using Recurrent Neural Network [J].
Goh, Yeh-Huann ;
Lau, Kai-Xian ;
Lee, Yoon-Ket .
PROCEEDINGS OF THE 2019 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT): ENCOMPASSING INTELLIGENT TECHNOLOGY AND INNOVATION TOWARDS THE NEW ERA OF HUMAN LIFE, 2019, :38-43
[29]   A multi-lingual speech recognition system using a neural network approach [J].
Chen, OTC ;
Chen, CY ;
Chang, HT ;
Hsu, FR ;
Yang, HL ;
Lee, YG .
ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, :1576-1581
[30]   Hierarchical Expert Neural Network System for Speech Recognition [J].
Rocha, Priscila ;
Silva, Washington ;
Barros, Allan .
JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2019, 30 (03) :347-359