SPEECH RECOGNITION BY AN ARTIFICIAL NEURAL NETWORK USING FINDINGS ON THE AFFERENT AUDITORY-SYSTEM

被引:13
|
作者
KUROGI, S
机构
[1] Division of Control Engineering, Kyushu Institute of Technology, Kitakyushu, 804, Sensuicho, Tobata
关键词
D O I
10.1007/BF00201985
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An artificial neural network which uses anatomical and physiological findings on the afferent pathway from the ear to the cortex is presented and the roles of the constituent functions in recognition of continuous speech are examined. The network deals with successive spectra of speech sounds by a cascade of several neural layers: lateral excitation layer (LEL), lateral inhibition layer (LIL), and a pile of feature detection layers (FDL's). These layers are shown to be effective for recognizing spoken words. Namely, first, LEL reduces the distortion of sound spectrum caused by the pitch of speech sounds. Next, LIL emphasizes the major energy peaks of sound spectrum, the formants. Last, FDL's detect syllables and words in successive formants, where two functions, time-delay and strong adaptation, play important roles: time-delay makes it possible to retain the pattern of formant changes for a period to detect spoken words successively; strong adaptation contributes to removing the time-warp of formant changes. Digital computer simulations show that the network detect isolated syllables, isolated words, and connected words in continuous speech, while reproducing the fundamental responses found in the auditory system such as ON, OFF, ON-OFF, and SUSTAINED patterns.
引用
收藏
页码:243 / 249
页数:7
相关论文
共 50 条
  • [1] Speech Recognition using Artificial Neural Network
    Gupta, Arpita
    Joshi, Akshay
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 68 - 71
  • [2] Isolated Assamese Speech Recognition Using Artificial Neural Network
    Medhi, Bhargab
    Talukdar, P. H.
    2015 INTERNATIONAL SYMPOSIUM ON ADVANCED COMPUTING AND COMMUNICATION (ISACC), 2015, : 141 - 148
  • [3] AFFERENT REGULATION OF NEURONS IN THE BRAIN-STEM AUDITORY-SYSTEM
    RUBEL, EW
    HYSON, RL
    DURHAM, D
    JOURNAL OF NEUROBIOLOGY, 1990, 21 (01): : 169 - &
  • [4] Speech recognition using artificial neural networks
    Lim, CP
    Woo, SC
    Loh, AS
    Osman, R
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I, 2000, : 419 - 423
  • [5] Speech recognition using elman artificial neural network and linear predictive coding
    Khajehasani S.
    Dehyadegari L.
    Recent Advances in Computer Science and Communications, 2020, 13 (04) : 650 - 656
  • [6] Text Dependent Speaker Identification and Speech Recognition Using Artificial Neural Network
    Swamy, Suma
    Shalini, T.
    Nagabhushan, Sindhu P.
    Nawaz, Sumaiah
    Ramakrishnan, K. V.
    GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 160 - +
  • [7] Auditory-based wavelet packet filterbank for speech recognition using neural network
    Gandhiraj, R.
    Sathidevi, P. S.
    ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 666 - +
  • [8] Artificial Auditory Perception Pattern Recognition System Based on Spatiotemporal Convolutional Neural Network
    Fang, Xia
    Fang, Han
    Feng, Zhan
    Wang, Jie
    Zhou, Libin
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [9] DISCRIMINATION OF NEURAL CODING PARAMETERS IN AUDITORY-SYSTEM
    SANDERSON, AC
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1975, 5 (05): : 533 - 542
  • [10] DIGITAL ENCODING OF SPEECH BASED ON PROPERTIES OF THE AUDITORY-SYSTEM
    KRASNER, MA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S100 - S100