SPEECH RECOGNITION BY AN ARTIFICIAL NEURAL NETWORK USING FINDINGS ON THE AFFERENT AUDITORY-SYSTEM

被引:13
|
作者
KUROGI, S
机构
[1] Division of Control Engineering, Kyushu Institute of Technology, Kitakyushu, 804, Sensuicho, Tobata
关键词
D O I
10.1007/BF00201985
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An artificial neural network which uses anatomical and physiological findings on the afferent pathway from the ear to the cortex is presented and the roles of the constituent functions in recognition of continuous speech are examined. The network deals with successive spectra of speech sounds by a cascade of several neural layers: lateral excitation layer (LEL), lateral inhibition layer (LIL), and a pile of feature detection layers (FDL's). These layers are shown to be effective for recognizing spoken words. Namely, first, LEL reduces the distortion of sound spectrum caused by the pitch of speech sounds. Next, LIL emphasizes the major energy peaks of sound spectrum, the formants. Last, FDL's detect syllables and words in successive formants, where two functions, time-delay and strong adaptation, play important roles: time-delay makes it possible to retain the pattern of formant changes for a period to detect spoken words successively; strong adaptation contributes to removing the time-warp of formant changes. Digital computer simulations show that the network detect isolated syllables, isolated words, and connected words in continuous speech, while reproducing the fundamental responses found in the auditory system such as ON, OFF, ON-OFF, and SUSTAINED patterns.
引用
收藏
页码:243 / 249
页数:7
相关论文
共 50 条
  • [31] A Fuzzy Neural Network Applied in the Speech Recognition System
    Zhang, Xueying
    Wang, Peng
    Li, Gaoyun
    Hou, Wenjun
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2008, : 14 - +
  • [32] BILINGUAL SPEECH RECOGNITION SYSTEM FOR ISOLATED WORDS USING DEEP NEURAL NETWORK
    Bharathi, B.
    Kavitha, S.
    Sugapriya, S.
    2018 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, AND SIGNAL PROCESSING (ICCCSP): SPECIAL FOCUS ON TECHNOLOGY AND INNOVATION FOR SMART ENVIRONMENT, 2018, : 78 - 81
  • [33] Speech Controlled Robotics using Artificial Neural Network
    Joshi, Neha
    Kumar, Anil
    Chakraborty, Pavan
    Kala, Rahul
    2015 THIRD INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2015, : 526 - 530
  • [34] Automatic Gender Recognition using Linear Prediction Coefficients and Artificial Neural Network on Speech Signal
    Yusnita, M. A.
    Hafiz, A. M.
    Fadzilah, Nor M.
    Zulhanip, Aida Zulia
    Idris, Mohaiyedin
    2017 7TH IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE), 2017, : 372 - 377
  • [35] Artificial neural network based method for handwriting recognition to speech generation
    Magoules, Frederic
    Marquevielle, Vincent
    Dutilleul, Pierre-Arnaud
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2009, 3 (01) : 45 - 58
  • [36] Artificial Neural Network for Arabic Speech Recognition in Humanoid Robotic Systems
    Al-Abdullah, A.
    Al-Ajmi, A.
    Al-Mutairi, A.
    Al-Mousa, N.
    Al-Daihani, S.
    Karar, A. S.
    Alkork, S.
    2019 3RD INTERNATIONAL CONFERENCE ON BIO-ENGINEERING FOR SMART TECHNOLOGIES (BIOSMART), 2019,
  • [37] Artificial Neural Network based Emotion Classification and Recognition from Speech
    Iqbal, Mudasser
    Raza, Syed Ali
    Abid, Muhammad
    Majeed, Furgan
    Hussain, Ans Ali
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (12) : 434 - 444
  • [38] Speech Recognition from PSD using Neural Network
    Saheli, Amin Ashouri
    Abdali, Gholam Ali
    Suratgar, Amir Abolfazl
    IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 174 - +
  • [39] CENTRAL AUDITORY-SYSTEM PLASTICITY ASSOCIATED WITH SPEECH-DISCRIMINATION TRAINING
    KRAUS, N
    MCGEE, T
    CARRELL, TD
    KING, C
    TREMBLAY, K
    NICOL, T
    JOURNAL OF COGNITIVE NEUROSCIENCE, 1995, 7 (01) : 25 - 32
  • [40] Speech recognition using pulse coupled neural network
    Chandrasekaran, P
    Bodruzzaman, M
    Yuen, G
    Malkani, M
    THIRTIETH SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY (SSST), 1998, : 515 - 519