Framework for choosing a set of syllables and phonemes for Lithuanian speech recognition

被引:0
作者
Laurinciukaite, Sigita [1 ]
Lipeika, Antanas [1 ]
机构
[1] Inst Math & Informat, Recognit Proc Dept, LT-01108 Vilnius, Lithuania
关键词
speech recognition; framework for formation of set of syllables and phonemes;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a framework for making up a set of syllables and phonemes that subsequently is used in the creation of acoustic models for continuous speech recognition of Lithuanian. The target is to discover a set of syllables and phonemes that is of utmost importance in speech recognition. This framework includes operations with lexicon, and transcriptions of records. To facilitate this work, additional programs have been developed that perform word syllabification, lexicon adjustment, etc. Series of experiments were done in order to establish the framework and model syllable- and phoneme-based speech recognition. Dominance of a syllable in lexicon has improved speech recognition results and encouraged us to move away from a strict definition of syllable, i.e., a syllable becomes a simple sub-word unit derived from a syllable. Two sets of syllables and phonemes and two types of lexicons have been developed and tested. The best recognition accuracy achieved 56.67% +/- 0.33. The speech recognition system is based on Hidden Markov Models (HMM). The continuous speech corpus LRN0 was used for the speech recognition experiments.
引用
收藏
页码:395 / 406
页数:12
相关论文
共 50 条
  • [41] Using mutual information criterion to design an efficient phoneme set for Chinese speech recognition
    Zhang, Jin-Song
    Hu, Xin-Hui
    Nakamura, Satoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 508 - 513
  • [42] A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition
    Juneja, Amit
    Espy-Wilson, Carol
    Journal of the Acoustical Society of America, 2008, 123 (02): : 1154 - 1168
  • [43] A novel privacy-preserving speech recognition framework using bidirectional LSTM
    Wang, Qingren
    Feng, Chuankai
    Xu, Yan
    Zhong, Hong
    Sheng, Victor S.
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2020, 9 (01):
  • [44] A Joint Speech Enhancement and Self-Supervised Representation Learning Framework for Noise-Robust Speech Recognition
    Zhu, Qiu-Shi
    Zhang, Jie
    Zhang, Zi-Qiang
    Dai, Li-Rong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1927 - 1939
  • [45] A novel privacy-preserving speech recognition framework using bidirectional LSTM
    Qingren Wang
    Chuankai Feng
    Yan Xu
    Hong Zhong
    Victor S. Sheng
    Journal of Cloud Computing, 9
  • [46] Speech Recognition for Keyword Spotting using a Set of Modulation Based Features - Preliminary Results
    Gopalan, Kaliappan
    Chu, Tao
    IMCIC 2010: INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL II, 2010, : 32 - 36
  • [47] PRIVACY ATTACKS FOR AUTOMATIC SPEECH RECOGNITION ACOUSTIC MODELS IN A FEDERATED LEARNING FRAMEWORK
    Tomashenko, Natalia
    Mdhaffar, Salima
    Tommasi, Marc
    Esteve, Yannick
    Bonastre, Jean-Francois
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6972 - 6976
  • [48] A new framework for Arabic recitation using speech recognition and the Jaro Winkler algorithm
    Larabi-Marie-Sainte, Souad
    Alnamlah, Betool S.
    Alkassim, Norah F.
    Alshathry, Sara Y.
    KUWAIT JOURNAL OF SCIENCE, 2022, 49 (01)
  • [49] EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition
    Gerczuk, Maurice
    Amiriparian, Shahin
    Ottl, Sandra
    Schuller, Bjorn W. W.
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1472 - 1487
  • [50] Development of HMM/neural network-based medium-vocabulary isolated-word Lithuanian speech recognition system
    Filipovic, M
    Lipeika, A
    INFORMATICA, 2004, 15 (04) : 465 - 474