SPEECH RECOGNITION BY AN ARTIFICIAL NEURAL NETWORK USING FINDINGS ON THE AFFERENT AUDITORY-SYSTEM

被引：13

作者：

KUROGI, S

机构：

[1] Division of Control Engineering, Kyushu Institute of Technology, Kitakyushu, 804, Sensuicho, Tobata

来源：

BIOLOGICAL CYBERNETICS | 1991年 / 64卷 / 03期

关键词：

D O I：

10.1007/BF00201985

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

An artificial neural network which uses anatomical and physiological findings on the afferent pathway from the ear to the cortex is presented and the roles of the constituent functions in recognition of continuous speech are examined. The network deals with successive spectra of speech sounds by a cascade of several neural layers: lateral excitation layer (LEL), lateral inhibition layer (LIL), and a pile of feature detection layers (FDL's). These layers are shown to be effective for recognizing spoken words. Namely, first, LEL reduces the distortion of sound spectrum caused by the pitch of speech sounds. Next, LIL emphasizes the major energy peaks of sound spectrum, the formants. Last, FDL's detect syllables and words in successive formants, where two functions, time-delay and strong adaptation, play important roles: time-delay makes it possible to retain the pattern of formant changes for a period to detect spoken words successively; strong adaptation contributes to removing the time-warp of formant changes. Digital computer simulations show that the network detect isolated syllables, isolated words, and connected words in continuous speech, while reproducing the fundamental responses found in the auditory system such as ON, OFF, ON-OFF, and SUSTAINED patterns.

引用

页码：243 / 249

页数：7

共 50 条

[31] A Fuzzy Neural Network Applied in the Speech Recognition System [J].

Zhang, Xueying ;

Wang, Peng ;

Li, Gaoyun ;

Hou, Wenjun .

ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2008, :14-+

[32] BILINGUAL SPEECH RECOGNITION SYSTEM FOR ISOLATED WORDS USING DEEP NEURAL NETWORK [J].

Bharathi, B. ;

Kavitha, S. ;

Sugapriya, S. .

2018 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, AND SIGNAL PROCESSING (ICCCSP): SPECIAL FOCUS ON TECHNOLOGY AND INNOVATION FOR SMART ENVIRONMENT, 2018, :78-81

[33] Automatic Gender Recognition using Linear Prediction Coefficients and Artificial Neural Network on Speech Signal [J].

Yusnita, M. A. ;

Hafiz, A. M. ;

Fadzilah, Nor M. ;

Zulhanip, Aida Zulia ;

Idris, Mohaiyedin .

2017 7TH IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE), 2017, :372-377

[34] Speech Controlled Robotics using Artificial Neural Network [J].

Joshi, Neha ;

Kumar, Anil ;

Chakraborty, Pavan ;

Kala, Rahul .

2015 THIRD INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2015, :526-530

[35] Artificial neural network based method for handwriting recognition to speech generation [J].

Magoules, Frederic ;

Marquevielle, Vincent ;

Dutilleul, Pierre-Arnaud .

JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2009, 3 (01) :45-58

[36] Artificial Neural Network for Arabic Speech Recognition in Humanoid Robotic Systems [J].

Al-Abdullah, A. ;

Al-Ajmi, A. ;

Al-Mutairi, A. ;

Al-Mousa, N. ;

Al-Daihani, S. ;

Karar, A. S. ;

Alkork, S. .

2019 3RD INTERNATIONAL CONFERENCE ON BIO-ENGINEERING FOR SMART TECHNOLOGIES (BIOSMART), 2019,

[37] Artificial Neural Network based Emotion Classification and Recognition from Speech [J].

Iqbal, Mudasser ;

Raza, Syed Ali ;

Abid, Muhammad ;

Majeed, Furgan ;

Hussain, Ans Ali .

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (12) :434-444

[38] CENTRAL AUDITORY-SYSTEM PLASTICITY ASSOCIATED WITH SPEECH-DISCRIMINATION TRAINING [J].

KRAUS, N ;

MCGEE, T ;

CARRELL, TD ;

KING, C ;

TREMBLAY, K ;

NICOL, T .

JOURNAL OF COGNITIVE NEUROSCIENCE, 1995, 7 (01) :25-32

[39] Speech Recognition from PSD using Neural Network [J].

Saheli, Amin Ashouri ;

Abdali, Gholam Ali ;

Suratgar, Amir Abolfazl .

IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, :174-+

[40] Speech recognition using pulse coupled neural network [J].

Chandrasekaran, P ;

Bodruzzaman, M ;

Yuen, G ;

Malkani, M .

THIRTIETH SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY (SSST), 1998, :515-519

← 1 2 3 4 5 →