SPEECH RECOGNITION BY AN ARTIFICIAL NEURAL NETWORK USING FINDINGS ON THE AFFERENT AUDITORY-SYSTEM

被引：13

作者：

KUROGI, S

机构：

[1] Division of Control Engineering, Kyushu Institute of Technology, Kitakyushu, 804, Sensuicho, Tobata

来源：

BIOLOGICAL CYBERNETICS | 1991年 / 64卷 / 03期

关键词：

D O I：

10.1007/BF00201985

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

An artificial neural network which uses anatomical and physiological findings on the afferent pathway from the ear to the cortex is presented and the roles of the constituent functions in recognition of continuous speech are examined. The network deals with successive spectra of speech sounds by a cascade of several neural layers: lateral excitation layer (LEL), lateral inhibition layer (LIL), and a pile of feature detection layers (FDL's). These layers are shown to be effective for recognizing spoken words. Namely, first, LEL reduces the distortion of sound spectrum caused by the pitch of speech sounds. Next, LIL emphasizes the major energy peaks of sound spectrum, the formants. Last, FDL's detect syllables and words in successive formants, where two functions, time-delay and strong adaptation, play important roles: time-delay makes it possible to retain the pattern of formant changes for a period to detect spoken words successively; strong adaptation contributes to removing the time-warp of formant changes. Digital computer simulations show that the network detect isolated syllables, isolated words, and connected words in continuous speech, while reproducing the fundamental responses found in the auditory system such as ON, OFF, ON-OFF, and SUSTAINED patterns.

引用

页码：243 / 249

页数：7

共 50 条

[1] Speech Recognition using Artificial Neural Network
Gupta, Arpita
Joshi, Akshay
PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 68 - 71
[2] Isolated Assamese Speech Recognition Using Artificial Neural Network
Medhi, Bhargab
Talukdar, P. H.
2015 INTERNATIONAL SYMPOSIUM ON ADVANCED COMPUTING AND COMMUNICATION (ISACC), 2015, : 141 - 148
[3] AFFERENT REGULATION OF NEURONS IN THE BRAIN-STEM AUDITORY-SYSTEM
RUBEL, EW
HYSON, RL
DURHAM, D
JOURNAL OF NEUROBIOLOGY, 1990, 21 (01): : 169 - &
[4] Speech recognition using artificial neural networks
Lim, CP
Woo, SC
Loh, AS
Osman, R
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I, 2000, : 419 - 423
[5] Speech recognition using elman artificial neural network and linear predictive coding
Khajehasani S.
Dehyadegari L.
Recent Advances in Computer Science and Communications, 2020, 13 (04) : 650 - 656
[6] Text Dependent Speaker Identification and Speech Recognition Using Artificial Neural Network
Swamy, Suma
Shalini, T.
Nagabhushan, Sindhu P.
Nawaz, Sumaiah
Ramakrishnan, K. V.
GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 160 - +
[7] Auditory-based wavelet packet filterbank for speech recognition using neural network
Gandhiraj, R.
Sathidevi, P. S.
ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 666 - +
[8] Artificial Auditory Perception Pattern Recognition System Based on Spatiotemporal Convolutional Neural Network
Fang, Xia
Fang, Han
Feng, Zhan
Wang, Jie
Zhou, Libin
APPLIED SCIENCES-BASEL, 2020, 10 (01):
[9] DISCRIMINATION OF NEURAL CODING PARAMETERS IN AUDITORY-SYSTEM
SANDERSON, AC
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1975, 5 (05): : 533 - 542
[10] DIGITAL ENCODING OF SPEECH BASED ON PROPERTIES OF THE AUDITORY-SYSTEM
KRASNER, MA
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S100 - S100

← 1 2 3 4 5 →