Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables

被引:0
|
作者
Oropeza Rodriguez, Jose Luis [1 ]
Suarez Guerra, Sergio [1 ]
机构
[1] IPN, Ctr Invest Comp, Av Juan de Dios Batiz S-N Esq, Mexico City 07738, DF, Mexico
来源
COMPUTACION Y SISTEMAS | 2006年 / 9卷 / 03期
关键词
Speech recognition; Syllables recognition; Expert System; Speech processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work examines the results of incorporating into Automatic Speech Recognition the syllable units for the Spanish language. Because of the boundaries between phonemes-like units its often difficult to elicit them; the use of these has not reached a good performance in Automatic Speech Recognition. In the course of the developing the experiments three approaches for the segmentation task were examined: a) the using of the Short Term Total Energy Function, b) the Energy Function of the Cepstral High Frequency (named ERO parameter), and c) a Knowledge Based System. They represent the most important contributions of this work; they showed good results for the Continuous and Discontinuous speech corpus developed in laboratory. The Knowledge Based System and Short Term Total Energy Function were used in a digit corpus where the results achieved using Short Term Total Energy Function alone reached 90.58% recognition rate. When Short Term Total Energy Function and RO parameters were used a 94.70% recognition rate was achieved. Otherwise, in the continuous speech corpus created in the laboratory the results achieved a 78.5% recognition rate using Short Term Total Energy Function and Knowledge Based System, and 80.5% recognition rate using the three approaches mentioned above. The bigram model language and Continuous Density Hidden Markov Models with three and five states incorporating three Gaussian Mixtures for state were implemented. By further including a major number of digital filters and Artificial Intelligent techniques in the training and recognition stages respectively the results can be improved even more. This research showed the potential of the syllabic unit paradigm for the Automatic Speech Recognition for the Spanish language. Finally, the inference rules in the Knowledge Based System associated with rules for splitting words in syllables in the cited language were created.
引用
收藏
页码:270 / 286
页数:17
相关论文
共 50 条
  • [21] Methods for Rapid Development of Automatic Speech Recognition System for Russian
    Safarik, Radek
    Nouza, Jan
    2015 IEEE INTERNATIONAL WORKSHOP OF ELECTRONICS, CONTROL, MEASUREMENT, SIGNALS AND THEIR APPLICATION TO MECHATRONICS (ECMSM), 2015,
  • [22] Textual Data Selection for Language Modelling in the Scope of Automatic Speech Recognition
    Mezzoudj, Freha
    Langlois, David
    Jouvet, Denis
    Benyettou, Abdelkader
    1ST INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE AND SPEECH PROCESSING, 2018, 128 : 55 - 64
  • [23] K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables
    Kim, Jounghee
    Kang, Pilsung
    INTERSPEECH 2022, 2022, : 4945 - 4949
  • [24] Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations
    Pelemans, Joris
    Vanallemeersch, Tom
    Demuynck, Kris
    Van Hamme, Hugo
    Wambacq, Patrick
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2262 - 2266
  • [25] A survey on automatic speech recognition systems for Portuguese language and its variations
    de Lima, Thales Aguiar
    Da Costa-Abreu, Marjory
    COMPUTER SPEECH AND LANGUAGE, 2020, 62 (62)
  • [26] Web-based possibilistic language models for automatic speech recognition
    Oger, Stanislas
    Linares, Georges
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (04) : 923 - 939
  • [27] COMPUTER-AIDED THERAPY USING AUTOMATIC SPEECH RECOGNITION TECHNIQUE FOR DELAYED LANGUAGE DEVELOPMENT CHILDREN
    Abuelmakarem, Hala S.
    Fawzi, Sahar Ali
    Quriba, Amal
    Elbialy, Ahmed
    Kandil, Ahmed Hisham
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2022, 34 (04):
  • [28] Framework for choosing a set of syllables and phonemes for Lithuanian speech recognition
    Laurinciukaite, Sigita
    Lipeika, Antanas
    INFORMATICA, 2007, 18 (03) : 395 - 406
  • [29] Automatic speech recognition using acoustic doppler signal
    Lee, Ki-Seung
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2016, 35 (01): : 74 - 82
  • [30] Summarization of Spontaneous Speech using Automatic Speech Recognition and a Speech Prosody based Tokenizer
    Szaszak, Gyorgy
    Tundik, Mate Akos
    Beke, Andras
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 221 - 227