Framework for choosing a set of syllables and phonemes for Lithuanian speech recognition

被引:0
作者
Laurinciukaite, Sigita [1 ]
Lipeika, Antanas [1 ]
机构
[1] Inst Math & Informat, Recognit Proc Dept, LT-01108 Vilnius, Lithuania
关键词
speech recognition; framework for formation of set of syllables and phonemes;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a framework for making up a set of syllables and phonemes that subsequently is used in the creation of acoustic models for continuous speech recognition of Lithuanian. The target is to discover a set of syllables and phonemes that is of utmost importance in speech recognition. This framework includes operations with lexicon, and transcriptions of records. To facilitate this work, additional programs have been developed that perform word syllabification, lexicon adjustment, etc. Series of experiments were done in order to establish the framework and model syllable- and phoneme-based speech recognition. Dominance of a syllable in lexicon has improved speech recognition results and encouraged us to move away from a strict definition of syllable, i.e., a syllable becomes a simple sub-word unit derived from a syllable. Two sets of syllables and phonemes and two types of lexicons have been developed and tested. The best recognition accuracy achieved 56.67% +/- 0.33. The speech recognition system is based on Hidden Markov Models (HMM). The continuous speech corpus LRN0 was used for the speech recognition experiments.
引用
收藏
页码:395 / 406
页数:12
相关论文
共 50 条
  • [31] SYLLABLE-BASED SPEECH RECOGNITION USING ELECTROMYOGRAPHY AND DECISION SET CLASSIFIER
    Topalovic, Marko
    Damnjanovic, Dorde
    Peulic, Aleksandar
    Blagojevic, Milan
    Filipovic, Nenad
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2015, 27 (02):
  • [32] Reduced feature-set based parallel CHMM speech recognition systems
    Abdulla, WH
    Kasabov, N
    INFORMATION SCIENCES, 2003, 156 (1-2) : 21 - 38
  • [33] SpecMark: A Spectral Watermarking Framework for IP Protection of Speech Recognition Systems
    Chen, Huili
    Darvish, Bita
    Koushanfar, Farinaz
    INTERSPEECH 2020, 2020, : 2312 - 2316
  • [34] Neural Networks for Proper Name Retrieval in the Framework of Automatic Speech Recognition
    Fohr, Dominique
    Illina, Irina
    2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND ECONOMIC INTELLIGENCE (SIIE), 2015, : 25 - 30
  • [35] A Unified Framework for Multilingual Speech Recognition in Air Traffic Control Systems
    Lin, Yi
    Guo, Dongyue
    Zhang, Jianwei
    Chen, Zhengmao
    Yang, Bo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (08) : 3608 - 3620
  • [36] TRAINING SPEECH RECOGNITION MODELS WITH FEDERATED LEARNING: A QUALITY/COST FRAMEWORK
    Guliani, Dhruv
    Beaufays, Francoise
    Motta, Giovanni
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3080 - 3084
  • [37] Cloud-Assisted Speech and Face Recognition Framework for Health Monitoring
    Hossain, M. Shamim
    Muhammad, Ghulam
    MOBILE NETWORKS & APPLICATIONS, 2015, 20 (03) : 391 - 399
  • [38] Web-based Framework for Assisting Users Using Speech Recognition
    Zahr, Hassan
    Hassan, Hussein Al Haj
    Haydar, Jamal
    2018 19TH INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2018, : 240 - 245
  • [39] Cloud-Assisted Speech and Face Recognition Framework for Health Monitoring
    M. Shamim Hossain
    Ghulam Muhammad
    Mobile Networks and Applications, 2015, 20 : 391 - 399
  • [40] A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users
    Dekkers, Gert
    van Waterschoot, Toon
    Vanrumste, Bart
    Van Den Broeck, Bert
    Gemmeke, Jort F.
    Van Hamme, Hugo
    Karsmakers, Peter
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 746 - 750