Framework for choosing a set of syllables and phonemes for Lithuanian speech recognition

被引:0
|
作者
Laurinciukaite, Sigita [1 ]
Lipeika, Antanas [1 ]
机构
[1] Inst Math & Informat, Recognit Proc Dept, LT-01108 Vilnius, Lithuania
关键词
speech recognition; framework for formation of set of syllables and phonemes;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a framework for making up a set of syllables and phonemes that subsequently is used in the creation of acoustic models for continuous speech recognition of Lithuanian. The target is to discover a set of syllables and phonemes that is of utmost importance in speech recognition. This framework includes operations with lexicon, and transcriptions of records. To facilitate this work, additional programs have been developed that perform word syllabification, lexicon adjustment, etc. Series of experiments were done in order to establish the framework and model syllable- and phoneme-based speech recognition. Dominance of a syllable in lexicon has improved speech recognition results and encouraged us to move away from a strict definition of syllable, i.e., a syllable becomes a simple sub-word unit derived from a syllable. Two sets of syllables and phonemes and two types of lexicons have been developed and tested. The best recognition accuracy achieved 56.67% +/- 0.33. The speech recognition system is based on Hidden Markov Models (HMM). The continuous speech corpus LRN0 was used for the speech recognition experiments.
引用
收藏
页码:395 / 406
页数:12
相关论文
共 50 条
  • [21] K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables
    Kim, Jounghee
    Kang, Pilsung
    INTERSPEECH 2022, 2022, : 4945 - 4949
  • [22] Evaluation of a set of new ORF kernel functions of SVM for speech recognition
    Zhang, Xueying
    Liu, Xiaofeng
    Wang, Zizhong John
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (10) : 2574 - 2580
  • [23] Taylor-DBN: A new framework for speech recognition systems
    Haridas, Arul Valiyavalappil
    Marimuthu, Ramalatha
    Sivakumar, V. G.
    Chakraborty, Basabi
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2021, 19 (02)
  • [24] A hierarchical duration model for speech recognition based on the ANGIE framework
    Chung, GY
    Seneff, S
    SPEECH COMMUNICATION, 1999, 27 (02) : 113 - 134
  • [25] A MODEL STRUCTURE INTEGRATION BASED ON A BAYESIAN FRAMEWORK FOR SPEECH RECOGNITION
    Shiota, Sayaka
    Hashimoto, Kei
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4813 - 4816
  • [26] Continuous Automatic Speech Recognition System using MapReduce Framework
    Vikram, M.
    Reddy, N. Sudhakar
    Madhavi, K.
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 80 - 83
  • [27] Associative memory framework for speech recognition: Adaptation of Hopfield network
    Vaishnavi, Y.
    Shreyas, R.
    Suhas, S.
    Surya, U. N.
    Ladwani, Vandana M.
    Ramasubramanian, V.
    2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
  • [28] A Bayesian Framework Using Multiple Model Structures for Speech Recognition
    Shiota, Sayaka
    Hashimoto, Kei
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 939 - 948
  • [29] A methodology for speech data analysis and a framework for adaptive speech recognition using fuzzy neural networks
    Kasabov, N
    Kozma, R
    Kilgour, R
    Laws, M
    Taylor, J
    Watts, M
    Gray, A
    PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1055 - 1060
  • [30] Lithuanian Speech Corpus Liepa for Development of Human-Computer Interfaces Working in Voice Recognition and Synthesis Mode
    Laurinciukaite, Sigita
    Telksnys, Laimutis
    Kasparaitis, Pijus
    Kliukiene, Regina
    Paukstyte, Vilma
    INFORMATICA, 2018, 29 (03) : 487 - 498