Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables

被引:0
|
作者
Oropeza Rodriguez, Jose Luis [1 ]
Suarez Guerra, Sergio [1 ]
机构
[1] IPN, Ctr Invest Comp, Av Juan de Dios Batiz S-N Esq, Mexico City 07738, DF, Mexico
来源
COMPUTACION Y SISTEMAS | 2006年 / 9卷 / 03期
关键词
Speech recognition; Syllables recognition; Expert System; Speech processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work examines the results of incorporating into Automatic Speech Recognition the syllable units for the Spanish language. Because of the boundaries between phonemes-like units its often difficult to elicit them; the use of these has not reached a good performance in Automatic Speech Recognition. In the course of the developing the experiments three approaches for the segmentation task were examined: a) the using of the Short Term Total Energy Function, b) the Energy Function of the Cepstral High Frequency (named ERO parameter), and c) a Knowledge Based System. They represent the most important contributions of this work; they showed good results for the Continuous and Discontinuous speech corpus developed in laboratory. The Knowledge Based System and Short Term Total Energy Function were used in a digit corpus where the results achieved using Short Term Total Energy Function alone reached 90.58% recognition rate. When Short Term Total Energy Function and RO parameters were used a 94.70% recognition rate was achieved. Otherwise, in the continuous speech corpus created in the laboratory the results achieved a 78.5% recognition rate using Short Term Total Energy Function and Knowledge Based System, and 80.5% recognition rate using the three approaches mentioned above. The bigram model language and Continuous Density Hidden Markov Models with three and five states incorporating three Gaussian Mixtures for state were implemented. By further including a major number of digital filters and Artificial Intelligent techniques in the training and recognition stages respectively the results can be improved even more. This research showed the potential of the syllabic unit paradigm for the Automatic Speech Recognition for the Spanish language. Finally, the inference rules in the Knowledge Based System associated with rules for splitting words in syllables in the cited language were created.
引用
收藏
页码:270 / 286
页数:17
相关论文
共 50 条
  • [31] Exploring Alternative Data Augmentation Methods in Dysarthric Automatic Speech Recognition
    Gracelli, Ricardo
    Almeida, Jurandy
    2024 IEEE 37TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS 2024, 2024, : 243 - 248
  • [32] Depression Detection in Arabic Using Speech Language Recognition
    Alsharif, Zainab
    Elhag, Salma
    Alfakeh, Sulhi
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 61 - 66
  • [33] English Language Speech Recognition using MFCC and HMM
    Naithani, Kanchan
    Thakkar, V. M.
    Semwal, Ashish
    2018 IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN INTELLIGENT AND COMPUTING IN ENGINEERING (RICE III), 2018,
  • [34] Speech Recognition and System Controlling using Hindi Language
    Rathor, Sandeep
    Jadon, R. S.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [35] Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language
    Le, Viet-Bac
    Besacier, Laurent
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1471 - 1482
  • [36] DISCRIMINATIVE APPROACH TO LEXICAL ENTRY SELECTION FOR AUTOMATIC SPEECH RECOGNITION OF AGGLUTINATIVE LANGUAGE
    Ablimit, Mijit
    Kawahara, Tatsuya
    Hamdulla, Askar
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5009 - 5012
  • [37] Lexicon optimization based on discriminative learning for automatic speech recognition of agglutinative language
    Ablimit, Mijit
    Kawahara, Tatsuya
    Hamdulla, Askar
    SPEECH COMMUNICATION, 2014, 60 : 78 - 87
  • [38] Automatic speech recognition: a survey
    Mishaim Malik
    Muhammad Kamran Malik
    Khawar Mehmood
    Imran Makhdoom
    Multimedia Tools and Applications, 2021, 80 : 9411 - 9457
  • [39] Automatic testing of speech recognition
    Francart, Tom
    Moonen, Marc
    Wouters, Jan
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2009, 48 (02) : 80 - 90
  • [40] Turbo Automatic Speech Recognition
    Receveur, Simon
    Weiss, Robin
    Fingscheidt, Tim
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 846 - 862