TONE RECOGNITION OF CONTINUOUS MANDARINE SPEECH-BASED ON NEURAL NETWORKS

被引:0
|
作者
CHEN, SH
WANG, YR
机构
[1] Natl Chiao Tung Univ, Taiwan
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 02期
关键词
Number:; -; Acronym:; NSC; Sponsor: National Science Council; MOTC; Sponsor: Ministry of Transportation and Communications;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of sandhi rules of tone pronunciation by including tone information of neighboring syllables. The recognition criterion is now changed to find the best tone sequence that minimizes the total risk that simultaneously considers tone recognition of all syllables in the input utterance. Last, two approaches using HCNN and HSMLP, respectively, to model the intonation pattern as a hidden Markov chain for assisting tone recognition are proposed. The effectiveness of these schemes was confirmed by simulations on a speaker-independent tone recognition task. A recognition rate of 86.72% was achieved.
引用
收藏
页码:146 / 150
页数:5
相关论文
共 26 条
  • [1] Robust boundary-based object recognition in occlusion environment by Hybrid Hopfield Neural Networks
    Kim, J.H.
    Yoon, S.H.
    Sohn, K.H.
    Elsevier Science Inc., Tarrytown, NY, United States (29):
  • [2] A learning result for continuous-time recurrent neural networks
    Sontag, Eduardo D.
    Systems and Control Letters, 1998, 34 (03): : 151 - 158
  • [3] Japanese large-vocabulary continuous-speech recognition using a newspaper corpus and broadcast news
    NTT Human Interface Laboratories, Speech Acoust. Lab., R., Kanagawa, Japan
    不详
    不详
    Speech Commun, 2 (155-166):
  • [4] Comparison of Mandarin tone and speech perception between advanced combination encoder and continuous interleaved sampling speech-processing strategies in children
    Hwang, Chung-Feng
    Chen, Hsiao-Chuan
    Yang, Chao-Hui
    Peng, Jyh-Ping
    Weng, Chia-Hui
    AMERICAN JOURNAL OF OTOLARYNGOLOGY, 2012, 33 (03) : 338 - 344
  • [5] Time-multiplexing scheme for cellular neural networks based image processing
    Texas A & M Univ, College Station, United States
    Real Time Imaging, 4 (231-239):
  • [6] Time-delay neural networks for estimating lip movements from speech analysis: A useful tool in audio-video synchronization
    Univ of Genova, Genova, Italy
    IEEE Trans Circuits Syst Video Technol, 5 (786-800):
  • [7] Hydrophobicity Classification of Composite Insulators Based on Light-Weight Convolutional Neural Networks
    Qiu, Zhibin
    Liu, Zhou
    Liao, Caibo
    Wang, Dong
    Yu, Xiaobin
    IEEJ Transactions on Electrical and Electronic Engineering, 2022, 17 (12): : 1728 - 1737
  • [8] Attributes of neural networks for extracting continuous vegetation variables from optical and radar measurements
    Lab for Terrestrial Physics, NASA Goddard Space Flight Center, Greenbelt MD 20771, United States
    Int. J. Remote Sens., 14 (2639-2662):
  • [9] Position-sensitive attention based on fully convolutional neural networks for land cover classification
    Xiong, Z.
    Zhan, Z.
    Wang, X.
    ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2022, 5 (03): : 281 - 288
  • [10] Hand Gesture Recognition Based on sEMG Signal and Convolutional Neural Network
    Su, Ziyi
    Liu, Handong
    Qian, Jinwu
    Zhang, Zhen
    Zhang, Lunwei
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (11)